🚀 i2yt: Instagram to YouTube/Google Drive Automation

i2yt is a powerful and highly customizable automation tool designed to streamline your content workflow. It scrapes Instagram Reels, intelligently organizes the data in Google Sheets, and can automatically upload the videos to Google Drive, preparing them for your YouTube channel or other content pipelines.

✨ Features

Multi-Account Scraping: Effortlessly scrape Reels from multiple Instagram accounts with parallel processing
Smart Data Management: Automatically saves data to Google Sheets with duplicate checks, status tracking, and clean formatting
Google Drive Integration: Seamlessly download and upload Reels to Google Drive with automated file management
Professional Parallel Processing: Leverages concurrent processing for maximum speed in scraping, description extraction, and uploads
Multiple Execution Methods: Run via Python CLI, PowerShell interactive menu, or Windows batch launcher
Highly Customizable: Fine-tune the entire workflow through a comprehensive config.py file with 50+ settings
Resilient & Robust: Built-in error handling, retry mechanisms, and intelligent rate limiting
Workflow Automation Ready: Designed for integration with n8n, Task Scheduler, or other automation platforms
Advanced Browser Management: Headless mode, session persistence, and optimized Chrome profile handling
Comprehensive Logging: Detailed logging system with multiple verbosity levels and structured output
Modular Architecture: Separate modules for scraping, description extraction, and Google Drive uploads
Professional Status Tracking: Color-coded status system in Google Sheets with workflow state management

⚙️ How It Works

The automation process follows these steps:

Scrape Reels: The tool scrapes the latest Reels from the specified Instagram accounts.
Populate Google Sheets: New Reel URLs, along with metadata like username and Reel ID, are added to a Google Sheet. Duplicates are automatically skipped.
Extract Descriptions: (Optional) The description for each Reel is extracted and added to the sheet.
Upload to Google Drive: (Optional) The videos are downloaded and uploaded to your Google Drive.
Update Status: The status of each Reel is updated in the Google Sheet, giving you a clear overview of the workflow (pending, processing, completed, failed).

🏁 Getting Started

Prerequisites

Python 3.8+
Google Chrome or Chromium
A Google Cloud project with the Google Sheets and Google Drive APIs enabled.
An Instagram account (for authentication to extract descriptions).

1. Clone the Repository

git clone https://github.com/your-username/i2yt.git
cd i2yt

2. Install Dependencies

pip install -r requirements.txt

3. Automated Setup (Recommended)

For quick setup, use the automated setup script:

python setup.py

This script will:

Install all Python dependencies
Create config.py from the template
Check Chrome WebDriver availability
Display next steps with clear instructions

4. Configure Google API Access (Manual Setup)

If you prefer manual setup or need to understand each step:

You'''ll need to set up a Google Service Account to allow the application to access your Google Sheet and Google Drive.

Follow the official Google Cloud documentation to create a service account and download the credentials.json file.
- Enable Google Sheets API
- Enable Google Drive API
Place the credentials.json file in the root directory of the project.
Share your Google Sheet with the service account email address (Editor permissions).

⚠️ Google Drive Important Note: Service accounts cannot upload to their own Google Drive due to storage quota limitations. For Google Drive uploads, you need to either:

Use a Google Workspace Shared Drive (add service account as Content Manager)
Share a personal folder with the service account (Editor permissions)
See detailed setup guide for complete instructions

5. Create Your Configuration

Copy the template to create your own configuration file:

copy config_template.py config.py

Now, open config.py and customize it with your details:

INSTAGRAM_URLS: A list of Instagram accounts to scrape.
GOOGLE_SHEETS_ID: The ID of your Google Sheet (from its URL).
UPLOAD_TO_GOOGLE_DRIVE: Set to True to enable uploads.
DRIVE_FOLDER_ID: The ID of the Google Drive folder for uploads.

6. Run the Scraper

You can run the scraper in multiple ways:

Option A: Direct Python Execution

python run_scraper.py

Option B: PowerShell Script

.\run_scraper.ps1

Option C: Windows Batch Launcher (Recommended for Windows)

For Windows users, a convenient batch file launcher is provided:

⚠️ IMPORTANT: You must update the path in the batch file before first use!

Open Launch_Instagram_Scraper.bat in a text editor

Find this line:

start wt -p "PowerShell" --title "Instagram Reel Scraper" pwsh -NoExit -ExecutionPolicy Bypass -File "d:\Kodo\i2yt\run_scraper.ps1"

Replace "d:\Kodo\i2yt\run_scraper.ps1" with the full path to your project directory

Example: If your project is in C:\Users\YourName\Documents\i2yt\, change it to:
```
start wt -p "PowerShell" --title "Instagram Reel Scraper" pwsh -NoExit -ExecutionPolicy Bypass -File "C:\Users\YourName\Documents\i2yt\run_scraper.ps1"
```
Save the file and double-click to run

Features of the Batch Launcher:

Opens a new Windows Terminal with PowerShell
Sets the correct working directory automatically
Runs with bypass execution policy
Can be launched from anywhere (desktop, taskbar, etc.)
Keeps the terminal open so you can see the results
Provides an interactive menu for different operations

📖 For detailed information on all execution methods, see Running Guide

🔧 Configuration

All settings are managed in the config.py file. Here are some of the key options:

Essential Configuration

Setting	Description	Example
`INSTAGRAM_URLS`	List of Instagram profile URLs to scrape	`["https://www.instagram.com/account/"]`
`GOOGLE_SHEETS_ID`	The ID of the target Google Sheet	`"1abcd1234efgh5678..."`
`CREDENTIALS_FILE`	Path to Google API credentials	`"credentials.json"`
`TARGET_LINKS`	Number of Reels to scrape per account	`50` (0 = unlimited)
`DAYS_LIMIT`	Only scrape Reels from the last N days	`30` (0 = all time)

Performance & Processing

Setting	Description	Default
`HEADLESS`	Run browser in headless mode	`False`
`FAST_MODE`	Enable performance optimizations	`True`
`ENABLE_CONCURRENT_PROCESSING`	Use parallel processing	`True`
`MAX_SCRAPING_WORKERS`	Concurrent threads for scraping	`4`
`BATCH_SIZE`	Save to sheets every N new reels	`25`

Content Processing

Setting	Description	Default
`EXTRACT_DESCRIPTIONS`	Extract Reel descriptions	`True`
`UPLOAD_TO_GOOGLE_DRIVE`	Enable Google Drive uploads	`False`
`DRIVE_FOLDER_ID`	Google Drive destination folder	`""`
`DELETE_LOCAL_AFTER_UPLOAD`	Clean up local files	`True`

For complete configuration options, see Configuration Guide.

📂 Project Structure

i2yt/
├── .gitignore
├── config.py                     # Your configuration (created from template)
├── config_template.py            # Configuration template with examples
├── Launch_Instagram_Scraper.bat  # Windows launcher (requires path customization)
├── login_to_instagram.bat        # Instagram login helper for Windows
├── run_scraper.ps1               # PowerShell script for Windows
├── credentials.json              # Google API credentials (you create this)
├── requirements.txt              # Python dependencies
├── description_extractor.py      # Extract descriptions using yt-dlp
├── google_drive_manager.py       # Handles Google Drive video uploads
├── google_sheets_manager.py      # Manages Google Sheets interactions
├── instagram_scraper.py          # Main Instagram scraping engine
├── instagram_scraper_clean.py    # Alternative clean scraper implementation
├── main_processor.py             # Orchestrates the complete workflow
├── parallel_processor.py         # Handles concurrent processing for performance
├── run_scraper.py                # Python entry point with CLI options
├── setup.py                      # Automated setup and configuration helper
├── n8n_workflow.json            # n8n workflow template for automation
├── docs/                         # Comprehensive documentation
│   ├── quick_start.md            # Get started in 10 minutes
│   ├── running_guide.md          # Complete guide to execution methods
│   ├── configuration.md          # Complete configuration guide
│   ├── google_sheets_setup.md    # Step-by-step Google API setup
│   ├── n8n_integration.md       # YouTube automation with n8n
│   ├── advanced_usage.md         # Power user features
│   ├── developer_guide.md        # For developers and contributors
│   ├── troubleshooting.md        # Common issues and solutions
│   └── technical_sheets_permissions.md  # Google Sheets permissions fix
├── tests/                        # Testing and validation tools
│   ├── test_google_api.py        # Test Google API connectivity
│   ├── test_sheets.py           # Test Google Sheets integration
│   └── demo.py                  # Quick demo and setup verification
├── downloaded_reels/             # Local video storage (auto-created)
├── instagram_profile/            # Chrome profile for Instagram login
└── __pycache__/                 # Python bytecode cache

🤝 Contributing

Contributions are welcome! Please feel free to submit a pull request.

Fork the repository.
Create your feature branch (git checkout -b feature/AmazingFeature).
Commit your changes (git commit -m '''Add some AmazingFeature''').
Push to the branch (git push origin feature/AmazingFeature).
Open a pull request.

📄 License

This project is licensed under the MIT License. See the LICENSE file for details.

⚠️ Disclaimer

This tool is for educational purposes only. Please be responsible and respect Instagram'''s terms of service. The developers are not responsible for any misuse of this tool.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🚀 i2yt: Instagram to YouTube/Google Drive Automation

✨ Features

⚙️ How It Works

🏁 Getting Started

Prerequisites

1. Clone the Repository

2. Install Dependencies

3. Automated Setup (Recommended)

4. Configure Google API Access (Manual Setup)

5. Create Your Configuration

6. Run the Scraper

Option A: Direct Python Execution

Option B: PowerShell Script

Option C: Windows Batch Launcher (Recommended for Windows)

🔧 Configuration

Essential Configuration

Performance & Processing

Content Processing

📂 Project Structure

🤝 Contributing

📄 License

⚠️ Disclaimer

About

Uh oh!

Uh oh!

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
.github/workflows		.github/workflows
docs		docs
tests		tests
.gitignore		.gitignore
Launch_Instagram_Scraper.bat		Launch_Instagram_Scraper.bat
README.md		README.md
config_template.py		config_template.py
description_extractor.py		description_extractor.py
google_drive_manager.py		google_drive_manager.py
google_sheets_manager.py		google_sheets_manager.py
instagram_scraper.py		instagram_scraper.py
instagram_scraper_clean.py		instagram_scraper_clean.py
login_to_instagram.bat		login_to_instagram.bat
main_processor.py		main_processor.py
n8n_workflow.json		n8n_workflow.json
parallel_processor.py		parallel_processor.py
requirements.txt		requirements.txt
run_scraper.ps1		run_scraper.ps1
run_scraper.py		run_scraper.py
setup.py		setup.py

ByteTrix/i2yt

Folders and files

Latest commit

History

Repository files navigation

🚀 i2yt: Instagram to YouTube/Google Drive Automation

✨ Features

⚙️ How It Works

🏁 Getting Started

Prerequisites

1. Clone the Repository

2. Install Dependencies

3. Automated Setup (Recommended)

4. Configure Google API Access (Manual Setup)

5. Create Your Configuration

6. Run the Scraper

Option A: Direct Python Execution

Option B: PowerShell Script

Option C: Windows Batch Launcher (Recommended for Windows)

🔧 Configuration

Essential Configuration

Performance & Processing

Content Processing

📂 Project Structure

🤝 Contributing

📄 License

⚠️ Disclaimer

About

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors 2

Uh oh!

Languages