algorithm - Behavioral Analysis : Combining DMOZ categories -


i trying analyse behaviour of user using urls guy has visited , make accurate prediction popular dmoz categories (http://dmoztools.net/).

we have built api maps url dmoz category quite accurately. api looks :

input - url

output - dmoz category in url lies maximum probability , probability value.

after passing urls user has visited api, come dmoz categories , respective probabilities. there may lot of dmoz categories number of input url increases. trying filter out relevant dmoz categories. dmoz category may not returned exactly, instead can limit dmoz hierarchy .

example:

1)

 url1 dmoz -> grocery/vegetable/tomato ,  url2 dmoz -> grocery/vegetable/potato,  url3 dmoz -> grocery/vegetable/onion  expected output : grocery/vegetable (since user interested across vegetables) 

2)

url1 dmoz -> grocery/vegetable/tomato ,  url2 dmoz -> grocery/vegetable/tomato,  url3 dmoz -> grocery/vegetable/tomato,  url4 dmoz -> grocery/vegetable/potato  expected output : grocery/vegetable/tomato (since user more biased towards tomato) 

as clear above, want filter out dmoz hierarchy has rare significance, giving priority subclasses if significant.is there standard way of merging dmoz categories, make accurate dmoz category predicition user ?


Comments

Popular posts from this blog

neo4j - finding mutual friends in a cypher statement starting with three or more persons -

php - How to remove letter in front of the word laravel -

minify - Minimizing css files -