Wrapped up the second project in Udacity’s Data Analyst Nanodegree last night. In a nutshell, the goal of the project was to
- Download a .osm file > 50 MB of any geographic boundary from around the world (I used Denver-Boulder, CO which is 616 MB)
- Parse the .osm file and clean the data specific to the desired data model
- Import the data into MongoDB
- Analyze the dataset
Admittedly, it took me a few weeks to complete the course and project. I kept all the code for the course and project in this github repository. It also contains a pdf write up of the process and findings. Since the analysis was focused on a geographic location in the Rocky Mountain Range, I decided to explore the trails and peaks. The tallest peak in the data was Longs Peak, measuring 3,040 meters (14,240 ft.) tall, recorded by HolgerJeromin. I enjoyed the project a lot and I worked hard on it… but I doubt it was as good as the Udacity reviewer made it sound. Would love to discuss!