Video Game Recommender System
Content-based video game recommendation engine using IGDB data, cosine similarity, and a Streamlit interface for interactive exploration.
01.Project Overview
Overview
A content-based video game recommendation engine built as part of the INFO-H501 course. Designed end-to-end, the system ingests data from the IGDB (Internet Games Database) API, engineers multi-dimensional game features, and uses cosine similarity to surface relevant recommendations.
Methodology
- Data Acquisition — Custom IGDB puller fetches game metadata, genres, platforms, ratings, community engagement metrics (follows, hypes), and time-to-beat data
- Feature Engineering — Multi-label genre encoding via MultiLabelBinarizer, combined with standardized numerical features (rating, community engagement, playtime)
- Similarity Computation — Cosine similarity matrix across the full feature space enables fast, precomputed recommendations
- Interactive Interface — Streamlit application for exploring recommendations with fuzzy game name matching
How It Works
Given a game title, the system:
- Locates the game in the feature matrix (case-insensitive, with fuzzy matching fallback)
- Retrieves the top-N most similar games based on cosine similarity scores
- Returns recommendations with key metadata (rating, community follows)
The feature space blends categorical signals (genre combinations) with numerical signals (rating, engagement, completion time) to capture both thematic and quality-based similarity.
My Role
I designed the system end-to-end: the data ingestion pipeline from IGDB, the feature engineering approach, the similarity computation, and the Streamlit interface.
Tech Stack
- Python, Pandas, NumPy
- Scikit-learn (MultiLabelBinarizer, StandardScaler, cosine_similarity)
- Streamlit (interactive UI)
- IGDB API (game metadata)
Technologies
Role
Lead Developer & System Designer
Timeline
Sep 2025 - Dec 2025
Category
Machine Learning / Recommendation Systems