Skip to content

A comprehensive Python-based validation framework for ensuring data quality in AI agent training datasets.

Notifications You must be signed in to change notification settings

qshytpolite/ai_data_validator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AI Data Validation Framework

A comprehensive Python-based validation framework for ensuring data quality in AI agent training datasets. This framework checks data completeness, format integrity, consistency, and quality metrics specifically tailored for AI/ML applications.

Features

  • Completeness Validation: Missing values, required columns, empty records
  • Format Validation: Data types, string patterns, date formats
  • Consistency Validation: Value ranges, categorical values, foreign key relations
  • Quality Validation: Class balance, feature correlation, outlier detection
  • Comprehensive Reporting: Multiple output formats (console, JSON, HTML)
  • Extensible Architecture: Easy to add custom validators

Installation

git clone https://github.com/qshytpolite/ai-data-validator.git
cd ai-data-validator
pip install -r requirements.txt

About

A comprehensive Python-based validation framework for ensuring data quality in AI agent training datasets.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published