Home

Home

In this era where digital information has become the main source of knowledge across the globe, Alfresco has a relevant role to play as it manages huge volumes of data for companies and users.

Despite the services provided by Alfresco to ease and organise your repository, data volume is so high that not only will you not know where to find documentation but also whether that documentation even exists.  At this point, a little help from your own repository is more than welcome.

Machine Learning strategy turns into an ideal approach to manage these situations. Where users can benefit from both personal and similar users’ preferences to get automated recommendations in order to decrease the amount of time find what they may be interested in.

This kind of situation is where Apache Mahout comes into play. Mahout is a project of the Apache Software Foundation that implements multiple algorithms mainly focused on the areas of collaboration and classification.

As Alfresco integrators we can really feel the value of a Mahout integration within Alfresco, giving users the ability to recommend any document whilst offering them the best documents based on their recommendations. Mahout offers different classification/recommendation algorithms out of the box. Our integration uses the “logLikelihoodSimilarity” as the default algorithm but you can customise it and use the best algorithm according to your needs.

The recommendation approach we are exploring is based on user likes, moreover it is related to a 5star classification scheme, where 5 stars is the maximum you can give, and 0 stars the minimum.

To perform the vote, we have used the Fivestar Ratings Widget for Alfresco Share from Jeff Potts, adapting it to work with Alfresco 4.x.

You can find Jeff Potts code in google code. https://code.google.com/p/alfresco-fivestar-ratings/

How it works

Go to your local alfresco repository and create a folder, in this example it’s called « FiveStars ».

Create a new rule with the parameters that you can see in the screenshot. 

Upload some documents in the folder.

 

 

Create some users to vote the documents. For instance, the following table shows a set of users and the number of votes of each document.

Document [user,vote] [user,vote] [user,vote]
Alex Kingston – Lady Macbeth freaked me out.pdf [aayala,4] [admin,4]
Barack Obama – NSA is not rifling through ordinary peoples emails.pdf [aayala,5] [ivan,5] [fran,5]
China diversifies UK interests as Dalian Wanda invests £1bn in luxury brands.pdf [ivan,4] [admin,4]
Federal Reserve hints it could end stimulus program next year.pdf [admin,1] [fran,1]
Google is not in cahoots with NSA, says chief legal officer.pdf [admin,5] [fran,5]

The result should be something like this.

 

When users vote documents, recommendation algorithms run in background and provide users with recommended documents based on those they like. These documents are displayed in a simple but useful dashlet for recommended document, which should look like this:

 

Based on those ratings, these are the documents suggested to every user through “Recommended documents” dashlet 

User Voted documents Mahout recommendations (sorted by rating)
admin Alex Kingston – Lady Macbeth freaked me out.pdf
China diversifies UK interests as Dalian Wanda invests £1bn in luxury brands.pdf
Federal Reserve hints it could end stimulus program next year.pdf

Google is not in cahoots with NSA, says chief legal officer.pdf

1. Barack Obama – NSA is not rifling through ordinary peoples emails.pdf
aayala Alex Kingston – Lady Macbeth freaked me out.pdf
Barack Obama – NSA is not rifling through ordinary peoples emails.pdf
1. Google is not in cahoots with NSA, says chief legal officer.pdf
2. China diversifies UK interests as Dalian Wanda invests £1bn in luxury brands.pdf

3. Federal Reserve hints it could end stimulus program next year.pdf

ivan Barack Obama – NSA is not rifling through ordinary peoples emails.pdf
China diversifies UK interests as Dalian Wanda invests £1bn in luxury brands.pdf
1. Google is not in cahoots with NSA, says chief legal officer.pdf
2. Alex Kingston – Lady Macbeth freaked me out.pdf

3. Federal Reserve hints it could end stimulus program next year.pdf

fran Barack Obama – NSA is not rifling through ordinary peoples emails.pdf
Federal Reserve hints it could end stimulus program next year.pdf

Google is not in cahoots with NSA, says chief legal officer.pdf

1. Alex Kingston – Lady Macbeth freaked me out.pdf
2. China diversifies UK interests as Dalian Wanda invests £1bn in luxury brands.pdf

The source code is now available to download for free here: https://github.com/zaizi/alfresco-recommendations

If you want to know more in both technical and end users aspects, we’ll be doing a talk soon with a live demo. Join Zaizi on the next web talk and learn how Alfresco learns from the user experience within the repository to offer them recommended content based of the content they read, create or like.


iarroyo's picture

 

Publicités

Laisser un commentaire

Choisissez une méthode de connexion pour poster votre commentaire:

Logo WordPress.com

Vous commentez à l'aide de votre compte WordPress.com. Déconnexion / Changer )

Image Twitter

Vous commentez à l'aide de votre compte Twitter. Déconnexion / Changer )

Photo Facebook

Vous commentez à l'aide de votre compte Facebook. Déconnexion / Changer )

Photo Google+

Vous commentez à l'aide de votre compte Google+. Déconnexion / Changer )

Connexion à %s