Captone Project IBM (SpaceX dataset)

The following is a captone project of the course “IBM Data Science”. For more details about the Professional certificate, click here

ibm data science capstone project github spacex

Executive Summary

For this project, the data was collected from the public SpaceX API and the SpaceX Wikipedia page. Then,added a column called ‘class’ to classify successful landings. To analyze the data, SQL, visualizations, folium maps, and dashboards were used. The relevant columns as features for further analysis were selected.

Next, The categorical variables were transformed into binary using a technique called one hot encoding. The data was standardized and used GridSearchCV to find the best parameters for the machine learning models. Finally, the accuracy scores were visualized of all the models. Four machine learning models were considered: Logistic Regression, Support Vector Machine, Decision Tree Classifier, and K Nearest Neighbors. Surprisingly, all the models produced similar results, with an accuracy rate of around 83.33%. However, it’s important to note that all the models tended to over-predict successful landings. The following pdf file shows the final presentation for this project.

Tools: Python Management Studio,SQL Server Integration Services (SSIS) Languajes: SQL

Andrés Camilo Viloria García

Data scientist | data analyst | python | r | sql | machine learning | artificial intelligence | deep learning | tableau | power bi | business analyst.

To me, the magic of data science lies in the adventure of uncovering three amazing outcomes: discovery, insights, and innovation.

Winning Space Race with Data Science

November 1, 2022

This is the presentation of the capstone project in the IBM Data Science Professional Certificate .

Note that this presentation is much more detailed and technical than regular high-level and abstracted presentations for executive teams.

I assume the role of a Data Scientist working for a startup intending to compete with SpaceX , and in the process follow the Data Science methodology involving data collection, data wrangling, exploratory data analysis, data visualization, model development, model evaluation, and reporting results to stakeholders.

In this capstone, we will predict if the Falcon 9 first stage will land successfully, SpaceX advertises Falcon 9 rocket launches on its website, with a cost of 62 million dollars; other providers cost upward of 165 million dollars each, much of the savings is because SpaceX can reuse the first stage. Therefore if we can determine if the first stage will land, we can determine the cost of a launch. This information can be used if an alternate company wants to bid against SpaceX for a rocket launch.

ibm data science capstone project github spacex

Executive Summary

ibm data science capstone project github spacex

Introduction

ibm data science capstone project github spacex

Methodology

ibm data science capstone project github spacex

Data collection API notebook

ibm data science capstone project github spacex

Web scraping notebook

ibm data science capstone project github spacex

Data wrangling notebook

ibm data science capstone project github spacex

EDA with Visualization notebook

ibm data science capstone project github spacex

EDA with SQL notebook

ibm data science capstone project github spacex

Launch Sites Locations Analysis with Folium notebook

ibm data science capstone project github spacex

Interactive Dashboard with Ploty Dash

ibm data science capstone project github spacex

Machine Learning Prediction notebook

ibm data science capstone project github spacex

Insights Drawn from EDA

ibm data science capstone project github spacex

Launch Sites Proximities Analysis

ibm data science capstone project github spacex

Build a Dashboard with Plotly Dash

ibm data science capstone project github spacex

Predictive Analysis (Classification)

ibm data science capstone project github spacex

Conclusions

ibm data science capstone project github spacex

For notebooks, datasets and scripts, follow this GitHub repository link: Applied Data Science Capstone

ibm data science capstone project github spacex

Instantly share code, notes, and snippets.

@spemmaraju5

spemmaraju5 / space_dash_app.py

  • Download ZIP
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Embed Embed this gist in your website.
  • Share Copy sharable link for this gist.
  • Clone via HTTPS Clone using the web URL.
  • Learn more about clone URLs
  • Save spemmaraju5/251897f19bac3fe8c554c664557f769c to your computer and use it in GitHub Desktop.

Nick Belgau

Nick Belgau

SpaceX: IBM Capstone

ibm data science capstone project github spacex

Leave a comment Cancel reply

' src=

  • Copy shortlink
  • Report this content
  • Manage subscriptions

IMAGES

  1. GitHub

    ibm data science capstone project github spacex

  2. GitHub

    ibm data science capstone project github spacex

  3. GitHub

    ibm data science capstone project github spacex

  4. GitHub

    ibm data science capstone project github spacex

  5. GitHub

    ibm data science capstone project github spacex

  6. GitHub

    ibm data science capstone project github spacex

VIDEO

  1. IBM Coursera Advanced Data Science Capstone

  2. IBM Coursera Advanced Data Science Capstone

  3. IBM Data Science capstone

  4. Aldie Adrian

  5. Aldie Adrian

  6. Data Science Capstone Project Spotlight: Language Detection App

COMMENTS

  1. GitHub

    You will assume the role of a Data Scientist working for a startup intending to compete with SpaceX, and in the process follow the Data Science methodology involving data collection, data wrangling, exploratory data analysis, data visualization, model development, model evaluation, and reporting your results to stakeholders. You are tasked with ...

  2. GitHub

    SpaceX can do this because the rocket launches are relatively inexpensive ($62 million per launch) due to its novel reuse of the first stage of its Falcon 9 rocket. Other providers, which are not able to reuse the first stage, cost upwards of $165 million each. By determining if the first stage will land, we can determine the price of the launch.

  3. adgsenpai/IBM-DataScience-SpaceX-Capstone

    About. I predicted if the Falcon 9 first stage will land successfully. SpaceX advertises Falcon 9 rocket launches on its website, with a cost of 62 million dollars; other providers cost upward of 165 million dollars each, much of the savings is because SpaceX can reuse the first stage. Therefore, if we can determine if the first stage will land ...

  4. GitHub

    This capstone project course will give you a taste of what data scientists go through in real life when working with real datasets. You will assume the role of a Data Scientist working for a startup intending to compete with SpaceX, and in the process follow the Data Science methodology involving data collection, data wrangling, exploratory ...

  5. IBM Applied Data Science Capstone Project

    This capstone project course will give you a taste of what data scientists go through in real life when working with real datasets. You will assume the role of a Data Scientist working for a startup intending to compete with SpaceX, and in the process follow the Data Science methodology involving data collection, data wrangling, exploratory ...

  6. GitHub

    The date as of writing this readme is 2/19/2022. I hope you enjoy the project! The Denis O'Byrne IBM Data Science Capstone Project.pdf file is the presentation for the assignment. The notebook files were completed as part of the assignment and all data is hosted online except for the data for the dashboard project so that is provided in the ...

  7. GitHub

    IBM-Data-Science-Capstone-Project In this capstone, we will predict if the Falcon 9 first stage will land successfully. SpaceX advertises Falcon 9 rocket launches on its website with a cost of 62 million dollars; other providers cost upward of 165 million dollars each, much of the savings is because SpaceX can reuse the first stage.

  8. GitHub

    Executive summary. In this capstone project, we will predict if the SpaceX Falcon 9 first stage will land successfully using several machine learning classification algorithms. The main steps in this project include: Data collection, wrangling, and formatting. Exploratory data analysis. Interactive data visualization. Machine learning prediction.

  9. applied-data-science-capstone-project-spacex-ibm/spacex_dash ...

    Contribute to SukanyaGuhaRoy/applied-data-science-capstone-project-spacex-ibm development by creating an account on GitHub.

  10. Captone Project IBM (SpaceX dataset)

    The following is a captone project of the course "IBM Data Science". For more details about the Professional certificate, click here. Full project on github, click here. Executive Summary. For this project, the data was collected from the public SpaceX API and the SpaceX Wikipedia page.

  11. SukanyaGuhaRoy/applied-data-science-capstone-project-spacex-ibm

    Find and fix vulnerabilities Codespaces. Instant dev environments

  12. PDF Data Science Capstone Project

    Perform exploratory Data Analysis and determine Training Labels. Calculate the number of launches on each site. Calculate the number and occurrence of each orbit. Calculate the number and occurrence of mission outcome per orbit type. Create a landing outcome label from Outcome column. Exporting the data to CSV. GitHub URL: Data Wrangling.

  13. IBM-Data-Science-Professional-Certificate-Capstone-Project ...

    The repository contains all of the resources in the IBM Data Science Professional Certificate Capstone Project. - odetara/IBM-Data-Science-Professional-Certificate-Capstone-Project

  14. Ibm data science capstone project-SpaceX launch analysis

    Ibm data science capstone project-SpaceX launch analysis. SpaceX advertises Falcon 9 rocket launches on its website, with a cost of 62 million dollars; other providers cost upward of 165 million dollars each, much of the savings is because SpaceX can reuse the first stage. The project task is to predicting if the first stage of the SpaceX ...

  15. LVM

    Winning Space Race with Data Science. Data Science. Coursera. IBM. Capstone Project. Published. November 1, 2022. This is the presentation of the capstone project in the IBM Data Science Professional Certificate. Note that this presentation is much more detailed and technical than regular high-level and abstracted presentations for executive teams.

  16. IBM Data Science Capstone Project -Space X

    Methodology. Methodology. 6. • The following datasets was collected by. • We worked with SpaceX launch data that is gathered from the SpaceX REST API. • This API will give us data about ...

  17. IBM-Data-Science-Capstone-Project/spacex_web_scraped.csv at ...

    Contribute to SYEUNG114/IBM-Data-Science-Capstone-Project development by creating an account on GitHub.

  18. Applied Data Science Capstone Course by IBM

    This is the final course in the IBM Data Science Professional Certificate as well as the Applied Data Science with Python Specialization. This capstone project course will give you the chance to practice the work that data scientists do in real life when working with datasets. In this course you will assume the role of a Data Scientist working ...

  19. IBM Data Science Capstone Project Dashboard · GitHub

    IBM Data Science Capstone Project Dashboard. Raw. space_dash_app.py. # Import required libraries. import pandas as pd. import dash. import dash_html_components as html. import dash_core_components as dcc. from dash.dependencies import Input, Output.

  20. PDF Ibm Data Science Capstone Project

    DATA COLLECTION - SPACE X REST API. 1. Using the SpaceX API to retrieve data about launches, including. information about the rocket used, payload delivered, launch specifications, landing specifications, and landing outcome. 2. Make a GET response to the SpaceX REST API. Convert the response to a .json file then to a Pandas DataFrame.

  21. andreiramani/IBM-Data-Science-Capstone-Project

    This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository. main

  22. SpaceX: IBM Capstone

    Example Data Analysis (Python) Tableau Portfolio; SQL & APIs; SpaceX: IBM Capstone; Inferential Generator; Engineering App for Android; About Me; SpaceX: IBM Capstone. Related. 7. Post navigation. Previous Project Engineering App for Android. Next Project Inferential Generator. Leave a comment Cancel reply.

  23. IBM-Data-Science-Capstone-Project/Spacex.csv at main

    Find and fix vulnerabilities Codespaces. Instant dev environments