Trajectory Data Processing

Project information

Summary: The project mainly aimed at pre-processing the trajectory path of taxi data, collected using GPS, conducted different studies on processing the dataset.

Tools: Python, NumPy, SciPy, Matplotlib, and Pandas

Duties: It was a team project. I performed detailed exploratory data analysis, identified some crucial problems in the dataset, and performed one of the steps in pre-processing which included handling of infeasible values by basically removing outliers that were identified outside the region of interest on the map for this study.

Outcome: Got hands-on experience on the most prominent part of the data science life cycle which was pre-processing. It includes formatting the data appropriately which could be useful and feasible in performing efficient predictions.

Result: Obtained High Distinction score for this project.