|
| 1 | +# Analyzing and Optimizing Airplane Connectivity |
| 2 | + |
| 3 | +This project focuses on analyzing and optimizing airplane connectivity through Graph, Machine Learning, and Linear Programming algorithms to improve operational efficiency, customer satisfaction, and economic performance. |
| 4 | + |
| 5 | + |
| 6 | +## Table of Contents |
| 7 | +- [Project Overview](#project-overview) |
| 8 | +- [Problem Statement and Objectives](#problem-statement-and-objectives) |
| 9 | +- [Importance of the Study](#importance-of-the-study) |
| 10 | +- [Dataset](#dataset) |
| 11 | +- [Key Analyses](#key-analyses) |
| 12 | + - [Predicting Flight Arrival Delays](#predicting-flight-arrival-delays) |
| 13 | + - [Airport Profiling (Clustering Analysis)](#airport-profiling-clustering-analysis) |
| 14 | + - [Shortest Path between Two Airports](#shortest-path-between-two-airports) |
| 15 | + - [Optimize Airport Connectivity within a State](#optimize-airport-connectivity-within-a-state) |
| 16 | + - [Optimization of Flight Schedules to Minimize Delay Penalties](#optimization-of-flight-schedules-to-minimize-delay-penalties) |
| 17 | + |
| 18 | +## Project Overview |
| 19 | + |
| 20 | +In this project, we leverage a detailed dataset of nationwide flight operations to tackle key challenges in the aviation industry, aiming to enhance operational efficiency, customer satisfaction, and economic performance. |
| 21 | + |
| 22 | +## Problem Statement and Objectives |
| 23 | + |
| 24 | +The aviation industry faces numerous challenges that directly affect operational efficiency, customer satisfaction, and economic performance. This project aims to address five key problems: |
| 25 | + |
| 26 | +1. **Predict Delays:** Develop predictive models for flight delays to improve passenger satisfaction and optimize airline operations. |
| 27 | +2. **Cluster Airports and Identify Hotspots:** Identify and profile hotspot airports to understand regional dynamics and propose targeted improvements. |
| 28 | +3. **Find the Shortest Route in Terms of Flying Time:** Enhance route planning to save fuel, reduce carbon emissions, and better allocate resources. |
| 29 | +4. **Optimize Routes Within a State:** Improve intra-state connectivity to boost local economies by enhancing access to markets. |
| 30 | +5. **Optimize Schedule by Minimizing Total Delay Penalty:** Develop strategies to minimize total delay across the network and maximize flights. |
| 31 | + |
| 32 | +## Importance of the Study |
| 33 | + |
| 34 | +1. **Economic Efficiency:** Reducing delays and optimizing routes can lower operational costs significantly. |
| 35 | +2. **Environmental Impact:** Optimizing flight paths and reducing delays can lessen the aviation industry's environmental footprint. |
| 36 | +3. **Passenger Experience:** Enhancing predictability and efficiency can increase customer satisfaction and loyalty. |
| 37 | +4. **Strategic Planning:** Inform long-term infrastructure and policy decisions for airport authorities and government agencies. |
| 38 | + |
| 39 | +## Dataset |
| 40 | + |
| 41 | +### Flight Data Attributes Description |
| 42 | +The dataset provides a comprehensive overview of flight information, including operational details, timings, and delays. Key attributes include: |
| 43 | + |
| 44 | +- **Date and Identification:** FL_DATE, OP_CARRIER_AIRLINE_ID, TAIL_NUM, OP_CARRIER_FL_NUM |
| 45 | +- **Airport and Location Codes:** ORIGIN_AIRPORT_ID, ORIGIN, DEST_AIRPORT_ID, DEST |
| 46 | +- **Timing and Delays:** CRS_DEP_TIME, DEP_TIME, DEP_DELAY, ARR_TIME, ARR_DELAY, CANCELLED |
| 47 | +- **Performance Metrics:** CRS_ELAPSED_TIME, ACTUAL_ELAPSED_TIME, CARRIER_DELAY, WEATHER_DELAY, NAS_DELAY, SECURITY_DELAY, LATE_AIRCRAFT_DELAY |
| 48 | + |
| 49 | +## Key Analyses |
| 50 | + |
| 51 | +### Predicting Flight Arrival Delays |
| 52 | +- **Objective:** Develop a predictive model for flight delays to improve scheduling reliability. |
| 53 | +- **Methods:** Used Gradient Boosting Classifier with feature engineering and SMOTE for class imbalance. |
| 54 | +- **Outcome:** Achieved an accuracy of 71.01% and an ROC AUC of 76.85%. |
| 55 | + |
| 56 | +### Airport Profiling (Clustering Analysis) |
| 57 | +- **Objective:** Identify patterns in airport operations related to delays and cancellations. |
| 58 | +- **Methods:** Used K-means clustering to group airports based on delay and cancellation characteristics. |
| 59 | +- **Outcome:** Identified four clusters with unique delay and cancellation patterns. |
| 60 | + |
| 61 | +### Shortest Path between Two Airports |
| 62 | +- **Objective:** Determine the shortest flying time between any two airports. |
| 63 | +- **Methods:** Implemented Dijkstra’s Algorithm using NetworkX. |
| 64 | +- **Outcome:** Provided actionable insights into the fastest routes based on historical data. |
| 65 | + |
| 66 | +### Optimize Airport Connectivity within a State |
| 67 | +- **Objective:** Enhance intra-state connectivity by finding the most efficient routes. |
| 68 | +- **Methods:** Used Kruskal’s algorithm to compute the Minimum Spanning Tree (MST) of the airport network. |
| 69 | +- **Outcome:** Achieved a significant reduction in total delay by over 93%. |
| 70 | + |
| 71 | +### Optimization of Flight Schedules to Minimize Delay Penalties |
| 72 | +- **Objective:** Optimize flight schedules to reduce delay penalties and maximize flights. |
| 73 | +- **Methods:** Used linear programming with penalty cost minimization. |
| 74 | +- **Outcome:** Reduced total penalty costs by 20.15% with minimal reduction in total flights. |
0 commit comments