Instructor Name

Fatai A. Abdusalam

Category

School of Data

Reviews

0 (0 Rating)

Course Description

Program Description

  • This program will teach you the fundamental concepts and skills of data science, using Python as the programming language. You will learn about the data science workflow, mathematics and statistics for data science, Python programming, data manipulation, data visualization, exploratory data analysis, machine learning foundations, advanced machine learning, Git and GitHub, and deployment.

    -Manipulate and clean data
    -Create informative visualizations
    -Build and train machine learning models
    -Deploy your models to production
    -Perform exploratory data analysis

    What You Will Learn

      • The basics of data science, including its applications, the data science workflow, and mathematics and statistics for data science.
      • Python programming, including basic programming concepts, data types, variables, operators, control structures, functions, modules, and object-oriented programming.
    • Data manipulation with Python, using NumPy and Pandas for numerical data manipulation and data cleaning and preprocessing
    • Data visualization with Python, using Matplotlib and Seaborn to create informative and visually appealing data visualizations
    • Exploratory data analysis (EDA) with Python, using Pandas and NumPy to identify patterns and trends in data
    • Machine learning foundations, including supervised learning, unsupervised learning, model evaluation, and validation
    • Advanced machine learning algorithms, including linear regression, logistic regression, decision trees, random forests, gradient boosting, and K-means clustering
    • Model selection and hyperparameter tuning
    • Git and GitHub for version control and collaboration
    • Deployment of data science projects using Flask and Streamlit

Course Curriculum

1 Course Introduction
9 Min

Welcome to Module 1 of our comprehensive Data Science course! In this module, we embark on an exciting journey into the world of data science, unraveling its core concepts, applications, and transformative impact across various industries.


2 Environment Setup
22 Min

This brief course guides you through the quick and effective setup of a Python environment using Anaconda and Jupyter. Learn to install Anaconda, manage Python packages effortlessly, and harness the power of Jupyter notebooks for interactive coding and visualization. Kickstart your Python journey with a streamlined and efficient development environment.


3 Jupyter Overview
41 Min

This introductory course is designed for beginners to learn the fundamentals of Python programming using Jupyter Notebook. Explore essential concepts such as data types, variables, and operators. Gain hands-on experience with Python syntax, understand how to declare variables, and grasp the basics of operators for effective programming. Build a strong foundation in Python to kickstart your coding journey.


1 Data Types
51 Min

Quickly grasp Python data types, variables, and fundamental structures. Ideal for beginners, this concise course ensures a strong foundation for efficient programming. Enroll for rapid mastery.


2 Comparison Operators (If, Elif, Else Statement)
14 Min

This course covers the use of comparison operators (e.g., ==, !=, >) in Python's "if," "elif," and "else" statements. Learn to make decisions in your code efficiently and master essential programming logic.


3 For Loops, While Loops, and Range
21 Min

Explore the power of loops in Python with a focus on "for" and "while" loops, along with the versatile "range" function. Learn to iterate through sequences, automate repetitive tasks, and efficiently manage code flow.


4 List Comprehension
6 Min

Dive into the concise and powerful world of Python list comprehensions. This course equips you with the skills to create compact and expressive code for creating lists, eliminating the need for traditional loops. Master the art of efficient and readable list creation in Python.


1 Functions
11 Min

Unlock the full potential of Python by mastering functions. This course covers the creation, utilization, and optimization of functions, providing essential skills for modular and efficient programming. Learn to design reusable code and enhance the clarity and scalability of your Python projects.


2 Lambda Expression
4 Min

Delve into the world of lambda expressions in Python through this concise course. Learn how to create anonymous functions for streamlined and efficient code. Explore practical applications and understand how lambda expressions contribute to writing expressive and functional Python program


3 Map and Filter
11 Min

This course explores the powerful functions of map and filter in Python. Discover how to apply functions to iterable elements using map and selectively filter data with filter. Streamline your code and enhance your proficiency in data manipulation and processing using these versatile tools.


4 Methods
7 Min

Unlock the versatility of Python through a comprehensive exploration of essential methods. This course covers a wide array of built-in methods for strings, lists, dictionaries, and more. Learn how to manipulate data, streamline workflows, and optimize your code with these fundamental Python methods.


1 Numpy Arrays
29 Min

This course focuses on the fundamental concepts of NumPy arrays, a core component of numerical computing in Python. Explore array creation, manipulation, and operations, gaining proficiency in handling large datasets efficiently. Elevate your scientific computing skills by mastering NumPy arrays for data analysis, machine learning, and scientific research.


2 Numpy Indexing and Selections
22 Min

This course provides an in-depth exploration of NumPy's powerful indexing and selection capabilities. Dive into array manipulation techniques, including slicing, boolean indexing, and fancy indexing. Gain the skills to efficiently extract, modify, and analyze data, essential for advanced scientific computing, data manipulation, and machine learning tasks.


3 Numpy Operations
6 Min

Unlock the potential of NumPy's versatile operations and array manipulation with this comprehensive course. Explore essential functions for mathematical operations, broadcasting, and element-wise manipulation. Learn to transform and analyze data efficiently, providing a strong foundation for scientific computing, data analysis, and machine learning applications in Python.


1 Introduction to Pandas
1 Min

This course offers a foundational introduction to Pandas, a powerful Python library for data manipulation and analysis. Explore key concepts such as Series, DataFrames, and Indexing. Learn how to handle and clean data, perform basic statistical operations, and gain the skills necessary for effective data exploration and manipulation using Pandas.


2 Data Input and Output
15 Min

Delve into the essential aspects of data input and output with Pandas in this comprehensive course. Learn to seamlessly read and write data in various formats, including CSV, Excel, SQL databases, and more. Explore techniques for importing and exporting data, enabling efficient data processing and analysis within your Python projects.


3 DataFrames
27 Min

This course focuses on DataFrames, a core component of Pandas, providing a deep dive into their structure, manipulation, and analysis. Learn to create, modify, and analyze tabular data efficiently, gaining essential skills for data exploration, cleaning, and transformation in Python.


4 Groupby
10 Min

Explore the power of Pandas GroupBy in this specialized course, where you'll learn to aggregate, transform, and analyze data based on different criteria. Master the art of grouping data efficiently, gaining valuable skills for data segmentation, statistical analysis, and insightful data exploration in Python.


5 Merging, Joining, and Concatenating
13 Min

Dive into the intricacies of Pandas as you master the techniques of merging, joining, and concatenating DataFrames. This course equips you with the skills to combine and manipulate data effectively, enabling seamless integration of disparate datasets for thorough analysis and exploration in Python.


6 Missing Data
7 Min

his course focuses on strategies for handling missing data using Pandas, a crucial skill in data analysis. Learn effective techniques to identify, handle, and impute missing values, ensuring your data is ready for meaningful analysis and visualization in Python.


7 Operations
13 Min

This course explores advanced data manipulation techniques using the Pandas library in Python. Topics include data cleaning, aggregation, filtering, and merging datasets. Students will learn how to perform complex data operations efficiently and effectively using Pandas, a powerful tool for data analysis and manipulation. Prerequisites: Basic Python programming skills and familiarity with data structures.


8 Series
11 Min

This course focuses specifically on the Pandas Series data structure in Python. Students will learn how to create, manipulate, and analyze one-dimensional labeled arrays using Pandas Series. Topics include indexing, slicing, arithmetic operations, and data alignment. By the end of the course, students will have a solid understanding of how to work with Series objects effectively for data analysis tasks. Prerequisites: Basic Python programming skills.


1 Matplotlib Concepts
38 Min

This course delves deeply into Matplotlib, one of the most powerful and versatile plotting libraries in Python. Students will explore fundamental concepts and advanced techniques for creating high-quality visualizations for data analysis and presentation. Topics include understanding the anatomy of a Matplotlib figure, creating various types of plots such as line plots, scatter plots, bar plots, and histograms, customizing plot aesthetics with colors, markers, and labels, adding annotations and text to plots, working with multiple subplots and axes, and saving plots to different file formats. By the end of the course, students will have gained the skills to effectively communicate insights and tell compelling stories through data visualization using Matplotlib. Prerequisites: Basic Python programming skills and familiarity with NumPy arrays.


2 Categorical Plots
21 Min

This course focuses on mastering categorical plots using Seaborn in Python. Participants will learn to visualize categorical data effectively, covering various plot types like bar plots, point plots, and box plots. Topics include handling missing data, customizing aesthetics, and creating multi-panel visualizations. By the end of this course course, students will proficiently communicate insights from categorical data. Prerequisites: Basic Python skills and familiarity with Pandas.


3 Distribution Plots
12 Min

This course is dedicated to mastering distribution plots with Seaborn in Python. Participants will learn to visualize data distributions effectively using techniques such as histograms, kernel density estimation (KDE), and rug plots. Topics include exploring univariate and bivariate distributions, fitting parametric distributions, and customizing plot aesthetics. By the course's conclusion, students will adeptly communicate insights derived from data distributions. Prerequisites: Basic Python skills and familiarity with data manipulation using Pandas.


4 Grids
10 Min

This course focuses on harnessing the power of grids in data visualization using Seaborn in Python. Participants will explore the versatility of grids for creating multi-plot layouts and effectively visualizing complex datasets. Topics include understanding grid types such as FacetGrid and PairGrid, creating grid-based plots for different data structures, customizing grid aesthetics, and incorporating advanced techniques like conditional plotting and data aggregation. By the end of the course, students will possess the skills to create insightful visualizations using grids that facilitate clear interpretation of complex data relationships. Prerequisites: Basic Python programming skills and familiarity with data manipulation using Pandas.


5 Matrix Plots
9 Min

This course delves into Matrix Plots using Seaborn in Python, focusing on visualizing relationships within large datasets. Participants will learn to create various matrix plot types such as heatmaps and clustermaps to uncover patterns and correlations in data. Topics include customizing plot aesthetics, handling missing data, and interpreting matrix plots effectively. By the end of the course, students will be equipped to visualize complex data structures and derive actionable insights using matrix plots. Prerequisites: Basic Python programming skills and familiarity with data manipulation using Pandas.


6 Style and Color
7 Min

This course focuses on mastering Style and Color in data visualization with Python. Participants will learn advanced techniques for manipulating plot aesthetics, selecting color palettes, and customizing visual styles to create impactful and informative plots. By the end, students will be equipped to craft polished and visually appealing visualizations that effectively convey insights from complex datasets. Prerequisites: Basic Python programming skills and familiarity with plotting libraries like Matplotlib and Seaborn.


7 Regression Plots
7 Min

This course dives into Regression Plots using Python, particularly with Matplotlib and Seaborn libraries. Participants will explore various regression techniques and learn to create insightful plots to visualize relationships between variables in datasets. Topics include simple linear regression, multiple regression, and polynomial regression, along with techniques for interpreting regression plots effectively. By the end, students will be adept at using regression plots to analyze data, make predictions, and derive actionable insights. Prerequisites: Basic Python programming skills and familiarity with data manipulation using Pandas.


1 Introduction to Machine learning
7 Min

"Introduction to Machine Learning" provides an overview of fundamental concepts, techniques, and applications in machine learning. Topics include supervised and unsupervised learning, classification, regression, clustering, and evaluation methods. Students will gain hands-on experience through practical exercises and projects, enabling them to understand how machine-learning algorithms work and how to apply them to real-world problems.


2 Supervised learning vs Unsupervised learning
17 Min

This course explores the key distinctions between supervised and unsupervised learning in machine learning. Students will delve into supervised learning, where models are trained on labeled data to make predictions, and unsupervised learning, which focuses on finding patterns and structures in unlabeled data without explicit guidance. Through practical exercises, learners will grasp the applications and limitations of each approach.


3 Cross Validation and Bias Variance
11 Min

This course explores cross-validation, a technique to evaluate model performance, and Bias-Variance tradeoff, balancing model complexity and flexibility. Participants gain insights into validating models effectively and managing the tradeoff for robust machine learning solutions.


1 Linear Regression with Python
34 Min

This course delves into the application of cross-validation techniques in the context of Linear Regression using Python. Participants will gain a deep understanding of how to effectively evaluate model performance through cross-validation, a pivotal technique in the realm of machine learning. Additionally, the course will delve into the intricate balance between bias and variance, crucial for optimizing model complexity and flexibility. Through hands-on exercises and practical examples, participants will learn how to navigate the bias-variance tradeoff to develop robust Linear Regression models. By the end of the course, participants will possess the skills needed to validate models effectively and make informed decisions regarding model complexity for creating reliable machine-learning solutions.


1. Python Crash Course Exercises
2. Numpy Exercise
3. PANDAS 1 - Ecommerce Purchases Exercise
4. Ecommerce Purchases Exercise Data
5. PANDAS 2 - SF Salaries Exercise
6. SF Salaries Exercise Data

Instructor

Fatai A. Abdusalam

0 Rating
0 Reviews
2 Students
1 Courses

Student Feedback

Data Science

0

Course Rating
0.00%
0.00%
0.00%
0.00%
0.00%

No Review found

Sign In or Sign Up as student to post a review

Reviews