Skip to content

DepthStrider-x/deodap_bestseller

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🛒 Deodap Bestseller Analytics & Scraper

Python Streamlit Pandas Status

A professional web scraping and analytics tool designed to extract, visualize, and export data from the Deodap Best-Selling Products catalog.

This project features both a Production-Ready Python Script for automation and a Modern Interactive Web Interface (built with Streamlit) for ease of use.


� Project Demo

lv_0_20260129175806.mp4

�🌟 Key Features

🖥️ Interactive Web Dashboard

  • One-Click Scraping: Trigger the entire process with a single button.
  • Real-Time Feedback: View live progress bars and streaming logs as the scraper works.
  • Data Preview: Sort and filter scraped products instantly in a responsive table.
  • Instant Export: Download data in CSV, JSON, or Excel formats directly from the browser.

⚙️ Core Scraping Logic

  • Robust Extraction: Custom safe_text and safe_attr handlers prevent crashes from missing data.
  • Auto-Pagination: Automatically navigates through all bestseller pages (1-9).
  • Polished Logging: Detailed file-based logging (scraper.log) for debugging and auditing.
  • Rate Limiting: Built-in delays to respect server load and mimic human behavior.

📥 Output Data

The tool extracts the following fields for every product:

  • 📝 Product Description
  • 💰 Price (Current Sale Price)
  • Star Rating
  • 💬 Review Count
  • 🔗 Product URL

🚀 Quick Start

Prerequisites

  • Python 3.8 or higher

Installation

  1. Clone the repository

    git clone https://github.com/AryanPrajapati9456/deodap_bestseller.git
    cd deodap_bestseller
  2. Install Dependencies

    pip install -r requirements.txt

Usage

Option A: Interactive Dashboard (Recommended)

Launch the visual interface in your browser:

streamlit run app.py

Option B: Headless CLI Scraper

Run the automation script directly in your terminal:

python scraper.py

📁 Project Structure

├── app.py             # 🎨 Streamlit Web Application Entry Point
├── scraper.py         # 🧠 Core Scraping Logic & Data Extraction
├── scraper.log        # 📝 Runtime Logs
├── requirements.txt   # 📦 Project Dependencies
├── README.md          # 📄 Documentation
└── [Output Files]     # 📊 Generated CSV/JSON/Excel reports

⚠️ Ethical Considerations

This tool is designed for educational purposes and personal portfolio demonstration.

  • It respects the target site by implementing delays between requests.
  • Users are responsible for adhering to deodap.in's Terms of Service and robots.txt policy.

👤 Author

Aryan Prajapati Python Developer • Web Scraper • Automation Engineer

GitHub


📝 License

This project is open-source and available for usage under the MIT License.

About

Python-based tool to scrape and log Deodap bestseller product info (price, rating, reviews) for analysis using BeautifulSoup and Request.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages