Skip to content

A modular NLP pipeline for binary sentiment classification on IMDb movie reviews using Hugging Face Transformers. The project demonstrates scalable model evaluation, multilingual compatibility (tabularisai), and clean engineering practices.

License

Notifications You must be signed in to change notification settings

owenwienczkowski/sentiment-analysis-huggingface

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Sentiment Classifier using Hugging Face Transformers

A modular NLP pipeline and interactive web app for sentiment analysis on text using Hugging Face Transformers. This project demonstrates scalable inference, binary and multi-class classification compatibility, model evaluation, and clean engineering practices.

Project Overview

Directory Structure

imdb-sentiment-classifier/
├── src/
│ ├── preprocess.py # Tokenization and decoding
│ ├── inference.py # Hugging Face model inference
├── app.py # Streamlit app
├── LICENSE # MIT License
├── README.md # You are here
└── requirements.txt # Project dependencies

App Demo

You can interact with the web app to upload review data, select the model type, and view predictions directly. Results can be filtered by label and downloaded as CSV.

Link to live app: https://sentiment-classifier-owen-wienczkowski.streamlit.app/

How to Run Locally

Make sure you're in the project root directory and have Python 3.8+.

  1. Install dependencies
pip install -r requirements.txt
  1. Run interactive web app
streamlit run app.py

Results for Web App (Example)

sentiment-metrics-1k

Skills Demonstrated

Hugging Face Transformers and inference pipelines

Streamlit interface design for real-time inference

Modular, production-style pipeline design

Tokenization, decoding, label mapping

Custom evaluation logic for multiclass-to-binary transitions

Multilingual model handling and class consolidation

Usable, downloadable web interface for non-technical users

About

A modular NLP pipeline for binary sentiment classification on IMDb movie reviews using Hugging Face Transformers. The project demonstrates scalable model evaluation, multilingual compatibility (tabularisai), and clean engineering practices.

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages