Instacart Project

Instacart Sales Analysis

Business Context

Instacart, a leading app-based online grocery store, sought to gain deeper insights into its sales patterns to optimize operations and marketing strategies. This analysis explores customer behavior, purchasing patterns, and demographic trends to provide actionable recommendations.

Objective

Conduct comprehensive data exploration and analysis to uncover insights and recommend strategies for improved customer segmentation and targeted marketing initiatives

Population Flow

The following population flow diagram illustrates the data processing pipeline used in this analysis:

Orders Dataset

→  3,421,083 rows

→  7 columns

Orders Products Prior Dataset

→  32,434,489 rows

→  4 columns

Products Dataset

49,693 rows

4 columns

Customers Dataset

 206,209 rows

→  10 columns

Data Merging Process

orders + orders_products_prior → order_products_combined → + products → orders_products_mergred → + customers → orders_products_all

Exclusion flag:

max_order ≤ 5

Observations to be removed: low activity customers
Total rows is 30M after exclusion

Exclusion not applied for all analysis to maintain sample diversity

Population Flow Diagram

Technical Approach

Data Sources

Open-source Instacart datasets + custom customer dataset

The customers dataset was created by CareerFoundry Team for for the purpose of this project. 

  • Download the Instacart datasets here.
  • Download the customers dataset here.

Data Processing

Wrangling, merging, and deriving variables

Analysis Methods

Grouping, aggregation, and population flow analysis

Visualization

Python Visualizations and Excel reporting

Analysis & Insights

1. Optimal Ad Scheduling: When are customers most active?

Finding: Peak Shopping Times Identified

Busiest days: Saturdays (0) and Sundays (1)

Peak hours: 10 AM – 4 PM

Schedule ads during off-peak hours to maximize visibility when fewer orders compete for attention

2. Spending Patterns: When do customers spend the most?

Finding: Peak Spending Window

Highest spending occurs from 10 AM to early afternoon

Aligns with overall peak ordering times

> Opportunity to promote premium products during these high-value time slots

3. Price Segmentation: What price ranges drive the most purchases?

Finding: Mid-Range Products Dominate

Clear preference for affordable, everyday items

4. Product Popularity: Which departments and products lead in orders?

Finding: Fresh Produce Leads Sales

Top departments: Produce → Dairy & Eggs → Snacks

> Most ordered item: Bananas (453,164 orders)

> Strong preference for organic products in top 10 items

Health-conscious shopping behavior evident

Top_Products_by_Orders

5. Customer Loyalty: How do ordering habits vary by loyalty status?

Finding: Regular Customers Drive Volume

> Regular customers: > 10 && ≤ 40 orders/customer

> Loyal customers: > 40 orders/customer

> New customers: ≤ 10 orders/customer

> Key insight: Regular customer, low-spenders are the core customer base

6. Demographic Patterns: How do customer characteristics influence shopping?

Finding: Regional and Demographic Trends

 Top region: Southern

Primary Customer Profile: Southern Middle-aged customers with dependents

> 75% of orders come from customers with dependents (data not shown)

> Low-income customers drive 87.4% of sale transactions across all departments

7. Customer Profiles: What are the ordering habits of different segments?

Finding: Family Status Drives Behavior

The highest number of orders are placed by:

Frequent customers

Married customers

Customers with dependents

Impact Summary


10AM-4PM
Peak shopping window


87.4%
Orders from low-income segment


70.2%
Orders from married customers


$5-$15
Sweet spot price range

Strategic Recommendations

1. Optimize Ad Scheduling & Budget Allocation

> Schedule promotional campaigns during weekday mornings and evenings when competition for attention is lower
> Reserve premium ad slots for Saturday and Sunday mornings (9-11 AM) to capture pre-shopping attention
> Implement dynamic pricing strategies with slight discounts during off-peak hours to balance order volume

2. Targeted Product Promotion Strategy

> Feature mid-range products ($5-$15) prominently in marketing materials as they drive volume
Create “Family Value Bundles” combining popular produce items (bananas, organic vegetables) with dairy essentials
Develop organic product highlights given the strong preference shown in top-ordered items
Cross-sell snacks and beverages with produce orders through intelligent recommendations

3. Customer Segmentation & Personalization

> Create “Family Essentials” subscription programs targeting married customers with dependents

Develop budget-friendly meal planning tools for the dominant low-income segment

Design loyalty progression programs to convert regular customers (10-40 orders) into loyal customers (> 40 orders)

4. Regional Marketing Focus

> Allocate 40% of marketing budget to Southern and Midwestern regions where sales are highest

Partner with local Southern and Midwestern brands to increase regional relevance

Develop region-specific promotions that align with local shopping preferences and cultural events

Test expansion strategies in Western regions where dependent-based households show promise

5. New Customer Acquisition

> Create “First Family Shop” promotions targeting recently married couples and  new parents

Implement referral programs leveraging the large regular customer base

Develop simplified onboarding focusing on popular items (produce, dairy) to reduce choice paralysis

Offer incentives for new customers to reach 10+ orders and enter the regular customer segment

Project Impact

> This analysis provided Instacart with actionable insights into customer behavior, purchasing patterns, and demographic trends.

By understanding that their core customer base consists of budget-conscious families shopping primarily for fresh produce and everyday essentials, Instacart can optimize their marketing spend, improve customer targeting, and enhance the shopping experience for their most valuable segments.

The data-driven recommendations focus on leveraging existing strengths (high-volume regular customers, popular product categories) while identifying growth opportunities (converting regular to loyal customers, expanding in promising regions).

> This strategic approach ensures marketing efforts align with actual customer behavior rather than assumptions.

Deliverables

Population Flow & Excel Report can be found here

Codes, additional Visualizations & Insights can be found here