Skip to main content

Using GitHub for Data Scientists

Using GitHub for Data Scientists

Total video time: 44m
Award-winning instructor: Sara Anstey
View pricing 14-day money-back guarantee
Beginner No prior experience needed
Bite-sized content Learn at your own pace
Get certified Verified by GoSkills

What you’ll learn

Master version control in GitHub
Explore common GitHub terminology
Create and use repositories in GitHub
Create a GitHub portfolio
Resolve common merge issues that can arise
Connect with communities and resources in GitHub

Skills you’ll gain

Data analysis Version control

GitHub is the premier platform for managing version control. In this course from instructor and business intelligence architect Sara Anstey, you'll learn how to use GitHub as an essential skill in data science. you'll learn how to use GitHub both as a code repository and as a resource for connecting to the wider data science community. After reviewing the foundations of GitHub, you'll progress to practicing using key terminology, as well as unlocking how to use GitHub to facilitate collaboration across multiple data science projects and teams. you'll also learn how to create and clone repositories, resolve common merge issues, and communicate effectively with others through the GitHub interface. Finally, you'll learn how to contribute to open-source repositories and create a stand-out data science portfolio. By the end of this course, you will gain a full understanding of the functionality GitHub provides to data scientists.

  • 1
    Collaboration is the key 1m
  • 1
    What is GitHub? GitHub is one of the platforms you will likely encounter when working on projects with teams. 2m
  • 2
    How does GitHub work? GitHub's main function is to transfers and shares code among teams. 1m
  • 3
    Common GitHub terminology GitHub has many words and phrases that are unique to its platform. After this lesson you'll be able to use common definitions to operate in GitHub. 2m
  • 4
    Using GitHub for collaboration Data scientists are constantly collaborating on the same projects or tackling similar issues. 2m
  • 5
    Accessing learning resources As data science expands, so do the resources to understand it. 2m
  • 1
    Creating a repository in GitHub In order to collaborate with your peers and work on code in GitHub, you will first need to create a repository to hold the code. 2m
  • 2
    Cloning a repository in GitHub Once you’ve created a repository in GitHub, you will need to “clone” a version that is copied to your local machine. 1m
  • 3
    Branching in GitHub Branches are different versions of the code that live within the repository. 2m
  • 4
    Commit in GitHub A commit adds your latest changes to the source code to the repository. 2m
  • 5
    Pull requests in GitHub Pull requests let you notify others about changes you've pushed to a repository. 1m
  • 6
    Resolving common merge issues GitHub is a great collaboration tool, but sometimes there are issues with merging codes. 3m
  • 1
    The value of a GitHub portfolio Your data science portfolio highlights your skills and capabilities. 3m
  • 2
    Creating a powerful portfolio Creating a strong portfolio can establish you as a qualified data scientist. 3m
  • 1
    Finding collaborators to follow Collaboration is critical to getting the most out of GitHub. 1m
  • 2
    Contributing to open source projects Open source repositories are a great way to collaborate with others. After this lesson, you'll be able to contribute code to public repositories. 2m
  • 1
    Where to find answers If you're confused or have a question, someone else has likely had the same issue. 1m
  • 2
    Graphic user interfaces User interfaces make it easier to plug GitHub into your local machine. 3m
  • 1
    Moving beyond the basics of GitHub 1m

Certificate

Certificate of Completion

Awarded upon successful completion of the course.

Certificate sample

Instructor

Sara Anstey

Sara Anstey is a Business Intelligence Architect at Novacoast Inc., a cybersecurity-focused consulting firm. Sara is passionate about empowering businesses to use everyday data to make strategic business decisions. She believes that the intentional adoption of a data-driven culture can be a key differentiator to companies in today's corporate climate. Sara has experience in custom web development, artificial intelligence, data analytics, business intelligence, and applied statistics.

Business Intelligence Architect at Novacoast Sara Anstey

Sara Anstey

Business Intelligence Architect at Novacoast

Accreditations

Link to awards

How GoSkills helped Chris

I got the promotion largely because of the skills I could develop, thanks to the GoSkills courses I took. I set aside at least 30 minutes daily to invest in myself and my professional growth. Seeing how much this has helped me become a more efficient employee is a big motivation.

Chris Sanchez GoSkills learner
Chris Sanchez, GoSkills learner