Skip to content

A curated collection of projects demonstrating modern generative AI techniques — including image captioning, text generation, diffusion models, and Retrieval-Augmented Generation (RAG) applications — built using Transformer models, Hugging Face, BLIP, and LangChain.

Notifications You must be signed in to change notification settings

zahra-ahmadbeigloo/Generative-AI-Projects

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

49 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Generative AI Projects

This repository showcases advanced Generative AI projects that combine the power of transformer architectures, diffusion models, and LLM pipelines to generate images, text, and intelligent responses. Each project reflects state-of-the-art methods in modern AI, as explored in the IBM AI Engineering Professional Certificate coursework and independent research.


Projects Overview

Project Description Tools & Models
Aircraft Damage Captioning Classifies aircraft damage and generates natural language captions from images. VGG16, BLIP, Keras, Hugging Face
Diffusion Models Implements image generation using denoising diffusion probabilistic models. NumPy, Matplotlib, Forward/Reverse Sampling
Text Generation with Transformers Generates creative and coherent text using Transformer-based language models. GPT2, Hugging Face Transformers
Advanced Transformers Custom Transformer architecture from scratch with positional encoding and masked attention. TensorFlow/Keras

Tools & Libraries Used

  • Transformers (Hugging Face, BLIP, GPT2)
  • TensorFlow/Keras
  • LangChain (for RAG)
  • Matplotlib, NumPy, PIL
  • Diffusers, Datasets, Scikit-learn

Key Highlights

  • Image Captioning with BLIP using pretrained transformer encoders + decoders
  • Diffusion model training loop from scratch with noise schedules
  • Text generation with GPT2 (sampling, top-k, and temperature control)
  • Retrieval-Augmented Generation (RAG) architecture with external context and vector databases (coming soon)

Setup

pip install transformers tensorflow datasets diffusers langchain pillow matplotlib numpy

License

This project is licensed under the MIT License.

About

A curated collection of projects demonstrating modern generative AI techniques — including image captioning, text generation, diffusion models, and Retrieval-Augmented Generation (RAG) applications — built using Transformer models, Hugging Face, BLIP, and LangChain.

Topics

Resources

Stars

Watchers

Forks