RAG-based Hate Speech Classification Application

This Streamlit application uses Retrieval Augmented Generation (RAG) to classify text as hate speech or not hate speech, leveraging Google Cloud Vertex AI's embedding and generative models.

Features

RAG-based Classification: Retrieves similar examples from your dataset to provide context for more accurate classification
Single Prompt Classification: Classify individual text inputs
Batch Processing: Process multiple examples at once for evaluation
Results Analysis: View performance metrics and analyze classification errors
Context Visualization: See which similar examples influenced each classification

Prerequisites

A Google Cloud account with Vertex AI API enabled
A CSV dataset with hate speech examples (must contain "prompt" and "label" columns)
Python 3.8 or higher

Installation

Clone this repository:

git clone https://github.com/afroCoderHanane/rag-hate-speech-classification.git
cd rag-hate-speech-classification

Install the required packages:

pip install -r requirements.txt

Set up Google Cloud authentication:

gcloud auth application-default login

Docker Installation

You can also run the application and dataset generator using Docker.

Building the Docker Image

docker build -t rag-classifier .

Running the Docker Container

1. Running the Streamlit App

docker run -p 8501:8501 -v $(pwd):/data rag-classifier app

Access the app in your browser at: http://localhost:8501

2. Generating a Hate Speech Dataset

docker run -v $(pwd):/data rag-classifier generate --samples=200 --output=/data/my_dataset.csv

Options:

--samples=NUMBER: Number of examples to generate (default: 200)
--output=PATH: Where to save the dataset (default: timestamped filename in container)

Using Docker Compose

Start the application:

docker-compose up rag-app

Generate a dataset:

docker-compose run generator

Dataset Format

Your dataset should be a CSV file with at least these columns:

prompt: The text to classify
label: The classification label (should be either "hate_speech" or "not_hate_speech")

Example:

prompt,label
"I hate all people from that country",hate_speech
"I love sunny days at the beach",not_hate_speech

Running the Application

Start the Streamlit app:

streamlit run app.py

In your web browser, you'll see the application interface.
In the sidebar:
- Enter your Google Cloud Project ID
- Select a Vertex AI Location
- Click "Initialize Vertex AI API"
- Upload your hate speech dataset
- Click "Create Vector Store" to create embeddings
Use the tabs to perform single classifications or batch processing

How It Works

RAG is a method that combines retrieval and generation to deliver more accurate and context-aware results. The retrieval process pulls relevant documents from a knowledge base, while the generation process uses a language model to create a coherent response based on the retrieved content.

This application:

Creates embeddings for all examples in your hate speech dataset
Stores these embeddings in a vector store
When classifying new text:
- Creates an embedding for the new text
- Finds the 5 most similar examples from your dataset
- Sends these similar examples as context to the Vertex AI model
- Uses the model to classify the new text as hate speech or not

Tips for Better Results

Ensure your dataset has diverse examples
Use a large enough dataset for better retrieval (ideally 1000+ examples)
Experiment with different Vertex AI models
Adjust the number of similar examples retrieved (default is 5)

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Dockerfile		Dockerfile
README.md		README.md
create_dataset.py		create_dataset.py
create_evaluation_dataset.py		create_evaluation_dataset.py
dataset_generator.py		dataset_generator.py
docker-compose.yml		docker-compose.yml
entrypoint.sh		entrypoint.sh
hate_speech_evaluation_dataset_20250420_144723.csv		hate_speech_evaluation_dataset_20250420_144723.csv
rag_app.py		rag_app.py
requirements.txt		requirements.txt
sample_hate_speech_dataset.csv		sample_hate_speech_dataset.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RAG-based Hate Speech Classification Application

Features

Prerequisites

Installation

Docker Installation

Building the Docker Image

Running the Docker Container

1. Running the Streamlit App

2. Generating a Hate Speech Dataset

Using Docker Compose

Dataset Format

Running the Application

How It Works

Tips for Better Results

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

afroCoderHanane/rag_safety_classification

Folders and files

Latest commit

History

Repository files navigation

RAG-based Hate Speech Classification Application

Features

Prerequisites

Installation

Docker Installation

Building the Docker Image

Running the Docker Container

1. Running the Streamlit App

2. Generating a Hate Speech Dataset

Using Docker Compose

Dataset Format

Running the Application

How It Works

Tips for Better Results

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages