This project showcases end-to-end data extraction, cleaning, and interactive visualization using Selenium, Pandas, and Power BI. Data was scraped from Croma across various product categories such as Mobiles, Air Conditioners, Refrigerators, and Televisions.
To scrape product listings from the Croma eCommerce website, clean the data, and create a Power BI dashboard for price-based insights and product exploration.
- Python 3
- Selenium β for automating web scraping
- Pandas β for data processing
- Power BI β for creating interactive dashboards
- ChromeDriver
- Automated scraping from Croma.com using Selenium.
- Extracted fields:
Category
,Title
,Price
, andProduct Link
. - Cleaned prices (βΉ symbol removed, converted to integer).
- Final dataset saved as
croma_cleaned_products.csv
. - Created Power BI dashboard:
- π¦ Category-wise product price distribution
- π° Top 10 expensive products
- π Price range histogram
- π Table with clickable product links
croma-web-scraping/
βββ project_notebook.ipynb # All scraping and cleaning code
βββ croma_cleaned_products.csv # Final cleaned dataset
βββ dashboard.png # Screenshot of Power BI dashboard
βββ README.md # Project overview