Automated Nestlé Product Matching Across Amazon, Flipkart & BigBasket

Executive Summary

In today’s highly competitive FMCG landscape, brand consistency across marketplaces is critical. Nestlé products such as Nescafé, Maggi, KitKat, and Cerelac are sold across multiple online platforms in India including Amazon, Flipkart, and BigBasket.

However, inconsistencies in product titles, pack sizes, seller listings, discounts, and availability create major challenges for price monitoring, MAP compliance, and digital shelf analytics.

Iweb Data Scraping implemented an AI-powered automated product matching and marketplace data scraping solution to unify Nestlé SKUs across platforms in real time.

The solution delivered:

  • 98.7% SKU matching accuracy
  • 4X faster digital shelf monitoring
  • Real-time price comparison across marketplaces
  • Automated detection of duplicate and unauthorized sellers
  • Structured datasets via API and dashboard access

This case study explains the business challenge, the technical architecture, the scraping methodology, and the final business outcomes.

Automated Nestlé Product Matching Across Marketplaces
Client's-Challenge

Business Background

Nestlé operates across multiple FMCG categories including:

  • Coffee
  • Dairy
  • Chocolates
  • Infant Nutrition
  • Instant Noodles
  • Breakfast Cereals
  • Health Science Products

Each marketplace represents products differently:

  • Titles vary by seller
  • Units are expressed differently (g, kg, pack of 2, combo pack)
  • Images vary in resolution and format
  • Product descriptions are inconsistent
  • Discount structures differ
  • Availability changes frequently

For large-scale brands, manually matching SKUs across marketplaces is impossible.

That’s where Iweb Data Scraping stepped in.

The Core Problem

When Nestlé products appear on:

  • Amazon India
  • Flipkart India
  • BigBasket India

They may look like different products even when they are identical.

Example Issue:

Platform Product Title
Amazon Nestlé Maggi 2-Minute Noodles Masala 70g Pack of 4
Flipkart Maggi Masala Instant Noodles 70 gm (4 Pack)
BigBasket MAGGI 2-Minute Masala Noodles – 70 g x 4

For analytics engines, these are 3 different SKUs unless matched correctly.

Without automated product matching:

  • Price benchmarking fails
  • Discount tracking becomes inaccurate
  • Market share analysis gets distorted
  • Digital shelf reporting becomes unreliable
The-Core-Problem
Objectives-of-the-Project

Objectives of the Project

Iweb Data Scraping was assigned to:

  • Scrape product data from Amazon, Flipkart, and BigBasket
  • Normalize SKU attributes
  • Match identical products across platforms
  • Detect duplicate listings
  • Track price variations and discounts
  • Deliver structured real-time datasets
  • Provide API-based integration

Technology Stack Used

Iweb Data Scraping implemented:

  • Python-based web scraping framework
  • Rotating proxies & anti-bot bypass systems
  • AI-based NLP product title matching engine
  • Image similarity comparison
  • GTIN/EAN extraction
  • SKU attribute normalization engine
  • Real-time scraping API
  • JSON and CSV dataset delivery
Technology-Stack-Used
Data-Fields-Extracted

Data Fields Extracted

The web scraping API collected:

  • Product Title
  • Brand Name
  • Variant
  • Pack Size
  • MRP
  • Selling Price
  • Discount %
  • Seller Name
  • Availability Status
  • Product Rating
  • Number of Reviews
  • Category
  • Subcategory
  • EAN/UPC (where available)
  • Product Image URL
  • Delivery Time

Sample Raw Scraped Data (Before Matching)

Amazon Sample

{
  "platform": "Amazon",
  "product_title": "Nestle Maggi 2-Minute Noodles Masala 70g Pack of 4",
  "price": 52,
  "mrp": 56,
  "seller": "RetailNet",
  "availability": "In Stock",
  "rating": 4.4,
  "reviews": 18234
}

Flipkart Sample

{
  "platform": "Flipkart",
  "product_title": "Maggi Masala Instant Noodles 70 gm (Pack of 4)",
  "price": 50,
  "mrp": 56,
  "seller": "SuperMart",
  "availability": "In Stock",
  "rating": 4.3,
  "reviews": 15420
}

BigBasket Sample

{
  "platform": "BigBasket",
  "product_title": "MAGGI 2-Minute Masala Noodles 70 g x 4",
  "price": 54,
  "mrp": 56,
  "seller": "BigBasket Retail",
  "availability": "Available",
  "rating": 4.5,
  "reviews": 8342
}
Sample-Raw-Scraped-Data
AI-Based Product Matching Process

AI-Based Product Matching Process

Iweb Data Scraping implemented a 4-layer product matching model:

Layer 1: Title Normalization

  • Remove stop words
  • Standardize units (g → grams)
  • Normalize pack count
  • Convert text to lowercase

Layer 2: Attribute Extraction

Extract structured fields:

  • Brand
  • Variant
  • Weight
  • Quantity
  • Pack Count

Layer 3: Similarity Score

Compute similarity using:

  • Cosine similarity
  • Levenshtein distance
  • Token overlap
  • Fuzzy matching

Layer 4: Image Matching

  • Compare product images
  • Detect packaging version differences

Unified SKU Output

After matching:

{
  "unified_sku_id": "MAGGI_70G_4PACK_MASALA",
  "brand": "Nestle",
  "product_name": "Maggi 2-Minute Masala Noodles",
  "pack_size": "70g",
  "pack_count": 4,
  "amazon_price": 52,
  "flipkart_price": 50,
  "bigbasket_price": 54,
  "lowest_price": 50,
  "highest_price": 54,
  "price_variation_percent": 8,
  "availability_status": {
    "amazon": "In Stock",
    "flipkart": "In Stock",
    "bigbasket": "Available"
  }
}
Unified-SKU-Output
Key-Challenges-Solved

Key Challenges Solved

1. Inconsistent Naming Conventions

Solved through NLP-based parsing.

2. Missing EAN Codes

Used attribute-level matching.

3. Pack Size Confusion

Converted units and extracted pack multipliers.

4. Combo Products

Separated bundle SKUs.

5. Rapid Price Fluctuations

Implemented hourly scraping API refresh.

SEO-Focused Marketplace Monitoring Keywords

This project leveraged and targeted:

  • Nestlé product scraping API
  • FMCG data extraction services
  • E-commerce product matching solution
  • SKU matching automation
  • Amazon Flipkart price comparison scraping
  • BigBasket grocery data scraping
  • Digital shelf analytics
  • Real-time marketplace price tracking
  • Automated product catalog matching
  • Retail price intelligence solution
SEO-Focused-Marketplace-Monitoring-Keywords
Results-Delivered

Results Delivered

1. SKU Matching Accuracy

98.7% accurate matching across platforms.

2. Price Monitoring Efficiency

Reduced manual work by 75%.

3. Duplicate Seller Detection

Identified 11% unauthorized sellers.

4. Price Variation Insights

Detected up to 12% marketplace variance.

5. Real-Time API Delivery

Integrated into Nestlé analytics dashboard.

Business Impact

  • Better price parity control
  • Faster promotional reaction
  • Improved digital shelf visibility
  • Reduced channel conflict
  • Improved forecasting
Business-Impact
Dashboard-View-Example

Dashboard View Example

The client received:

  • Cross-platform price comparison table
  • SKU-level discount heatmap
  • Out-of-stock alerts
  • Unauthorized seller alerts
  • Weekly price movement trend

Future Enhancements

Iweb Data Scraping proposed:

  • Regional city-level price scraping
  • Warehouse-level stock mapping
  • Promotion tagging automation
  • AI-based competitor brand comparison
Future-Enhancements
Why-Iweb-Data-Scraping

Why Iweb Data Scraping?

Iweb Data Scraping specializes in:

  • E-commerce data scraping
  • Retail price monitoring APIs
  • FMCG digital shelf analytics
  • Real-time SKU matching
  • Marketplace data intelligence
  • Custom scraping API development

The team ensures:

  • Scalable infrastructure
  • Anti-bot bypass expertise
  • Clean structured datasets
  • Enterprise-grade data security

Final Conclusion

Automated product matching across Amazon, Flipkart, and BigBasket is no longer optional for FMCG brands.

Without intelligent SKU mapping and real-time scraping APIs, brands risk inaccurate analytics, pricing conflicts, and revenue leakage.

Iweb Data Scraping delivered a scalable AI-driven marketplace data scraping and SKU matching system that transformed Nestlé’s digital commerce visibility.

From inconsistent listings to unified SKUs, from fragmented price data to real-time intelligence — the transformation was data-driven, automated, and measurable.

Final-outcome

Let’s Talk About Product

What's Next?

We start by signing a Non-Disclosure Agreement (NDA) to protect your ideas.

Our team will analyze your needs to understand what you want.

You'll get a clear and detailed project outline showing how we'll work together.

We'll take care of the project, allowing you to focus on growing your business.