Back to selected work
Big Data · 2026

Flight Ticket Meta Search & Analysis

Big-data internship project at Suncaper Data (State Information Center base) — collected, cleaned and analysed large-scale flight-ticket datasets to surface pricing & demand insights for stakeholders.

Stack
4 tools
Outcome
Awarded Excellent rating
PythonPandasData pipelinesAnalytics
01Overview

Industry-rated big-data internship project completed at Chengdu Suncaper Data — the Big Data Innovation & Entrepreneurship Base of the State Information Center.

I owned the end-to-end pipeline: collecting raw flight-ticket data from multiple sources, cleaning and normalising it into a unified schema, then running price-trend, route-popularity and seasonal-demand analyses to surface insights stakeholders could act on.

The project was awarded an overall evaluation of Excellent — the highest grade given to interns on that programme.

02What I built
  • Multi-source data collection

    Reliable scrapers and importers across multiple ticketing data sources, with retry logic and rate-limit handling.

  • Cleaning & normalisation

    Pandas-based ETL pipeline that deduplicates, enriches and validates records before they hit the analysis layer.

  • Pricing & demand analytics

    Trend analysis on routes, seasons and price elasticity — packaged as charts that non-technical stakeholders can read in seconds.

03Outcomes
  • Awarded an overall Excellent evaluation by the supervising team
  • Processed and cleaned millions of flight-ticket records into a single analytics-ready table
  • Surfaced pricing & demand insights through clear, decision-ready dashboards
· Next case study
Lead-research automation pipeline

For a small B2B agency.

· Ready when you are

Ship your first automation this week.

Free 30-min discovery call. If we’re a fit, you’ll have a working v1 in your inbox before next weekend.