TFIDF and machine learning

I have written an article about TFIDF and machine learning in German. This is an examination for implementation of my recommender system (without collaborative filter).

  • Training Data: web pages with labels (±1)
  • Test Data: web pages without labels.

I have tried several method: SVM, random forest, penalised logistic regression, k-means, hierarchical clustering, cosine similarity and ratio of distances to labelled data.

If you have any question, suggestion for machine learning, German, etc., please leave a comment at this entry or send me a message on twitter!

Share this page on        
Categories: #data-mining  #development