Introduction
During my internship at Infotact Solutions, the challenge was to improve user engagement by suggesting relevant content. We chose a Collaborative Filtering approach to leverage user-item interaction data.
Mathematical Framework
We utilized Singular Value Decomposition (SVD) to decompose the user-item interaction matrix into latent factors.
Where:
- : User preferences matrix
- : Item attributes matrix
- : Weights of latent factors
K-Nearest Neighbors (KNN)
For real-time "Similar Item" recommendations, we used KNN with Cosine Similarity:
Implementation
The pipeline involved cleaning the MovieLens dataset, creating a sparse matrix, and training the models.
Results
- RMSE (Root Mean Square Error): 0.766 (Lower is better)
- Precision@10: 81.02% (High relevance in top 10 results)
Conclusion
The hybrid approach of using SVD for matrix completion and KNN for localized similarity proved highly effective for this dataset scale.