
Curriculum
Curriculum
- 65 Sections
- 425 Lessons
- 96 Hours
- The Field of Data Science - The Various Data Science Disciplines5
- The Field of Data Science - Connecting the Data Science Disciplines1
- The Field of Data Science - The Benefits of Each Discipline1
- The Field of Data Science - Popular Data Science Techniques11
- 4.0Techniques for Working with Traditional Data
- 4.1Real Life Examples of Traditional Data
- 4.2Techniques for Working with Big Data
- 4.3Real Life Examples of Big Data
- 4.4Business Intelligence (BI) Techniques
- 4.5Real Life Examples of Business Intelligence (BI)
- 4.6Techniques for Working with Traditional Methods
- 4.7Real Life Examples of Traditional Methods
- 4.8Machine Learning (ML) Techniques
- 4.9Types of Machine Learning
- 4.10Real Life Examples of Machine Learning (ML)
- The Field of Data Science - Popular Data Science Tools1
- The Field of Data Science - Careers in Data Science1
- The Field of Data Science - Debunking Common Misconceptions1
- Probability4
- Probability - Combinatorics11
- 9.0Fundamentals of Combinatorics
- 9.1Permutations and How to Use Them
- 9.2Simple Operations with Factorials
- 9.3Solving Variations with Repetition
- 9.4Solving Variations without Repetition
- 9.5Solving Combinations
- 9.6Symmetry of Combinations
- 9.7Solving Combinations with Separate Sample Spaces
- 9.8Combinatorics in Real-Life: The Lottery
- 9.9A Recap of Combinatorics
- 9.10A Practical Example of Combinatorics
- Probability - Bayesian Inference12
- 10.0Sets and Events
- 10.1Ways Sets Can Interact
- 10.2Intersection of Sets
- 10.3Union of Sets
- 10.4Mutually Exclusive Sets
- 10.5Dependence and Independence of Sets
- 10.6The Conditional Probability Formula
- 10.7The Law of Total Probability
- 10.8The Additive Rule
- 10.9The Multiplication Law
- 10.10Bayes’ Law
- 10.11A Practical Example of Bayesian Inference
- Probability - Distributions15
- 11.0Fundamentals of Probability Distributions
- 11.1Types of Probability Distributions
- 11.2Characteristics of Discrete Distributions
- 11.3Discrete Distributions: The Uniform Distribution
- 11.4Discrete Distributions: The Bernoulli Distribution
- 11.5Discrete Distributions: The Binomial Distribution
- 11.6Discrete Distributions: The Poisson Distribution
- 11.7Characteristics of Continuous Distributions
- 11.8Continuous Distributions: The Normal Distribution
- 11.9Continuous Distributions: The Standard Normal Distribution
- 11.10Continuous Distributions: The Students’ T Distribution
- 11.11Continuous Distributions: The Chi-Squared Distribution
- 11.12Continuous Distributions: The Exponential Distribution
- 11.13Continuous Distributions: The Logistic Distribution
- 11.14A Practical Example of Probability Distributions
- Probability - Probability in Other Fields3
- Statistics1
- Statistics - Descriptive Statistics24
- 14.0Types of Data
- 14.1Levels of Measurement
- 14.2Categorical Variables – Visualization Techniques
- 14.3Categorical Variables Exercise
- 14.4Numerical Variables – Frequency Distribution Table
- 14.5Numerical Variables Exercise
- 14.6The Histogram
- 14.7Histogram Exercise
- 14.8Cross Tables and Scatter Plots
- 14.9Cross Tables and Scatter Plots Exercise
- 14.10Mean, median and mode
- 14.11Mean, Median and Mode Exercise
- 14.12Skewness
- 14.13Skewness Exercise
- 14.14Variance
- 14.15Variance Exercise
- 14.16Standard Deviation and Coefficient of Variation
- 14.17Standard Deviation
- 14.18Standard Deviation and Coefficient of Variation Exercise
- 14.19Covariance
- 14.20Covariance Exercise
- 14.21Correlation Coefficient
- 14.22Correlation
- 14.23Correlation Coefficient Exercise
- Statistics - Practical Example: Descriptive Statistics2
- Statistics - Inferential Statistics Fundamentals8
- Statistics - Inferential Statistics: Confidence Intervals3
- Confidence Interval Clarifications11
- 18.0Student’s T Distribution
- 18.1Confidence Intervals; Population Variance Unknown; T-score
- 18.2Confidence Intervals; Population Variance Unknown; T-score; Exercise
- 18.3Margin of Error
- 18.4Confidence intervals. Two means. Dependent samples
- 18.5Confidence intervals. Two means. Dependent samples Exercise
- 18.6Confidence intervals. Two means. Independent Samples (Part 1)
- 18.7Confidence intervals. Two means. Independent Samples (Part 1). Exercise
- 18.8Confidence intervals. Two means. Independent Samples (Part 2)
- 18.9Confidence intervals. Two means. Independent Samples (Part 2). Exercise
- 18.10Confidence intervals. Two means. Independent Samples (Part 3)
- Statistics - Practical Example: Inferential Statistics2
- Statistics - Hypothesis Testing15
- 20.0Null vs Alternative Hypothesis
- 20.1Further Reading on Null and Alternative Hypothesis
- 20.2Rejection Region and Significance Level
- 20.3Type I Error and Type II Error
- 20.4Test for the Mean. Population Variance Known
- 20.5Test for the Mean. Population Variance Known Exercise
- 20.6p-value
- 20.7Test for the Mean. Population Variance Unknown
- 20.8Test for the Mean. Population Variance Unknown Exercise
- 20.9Test for the Mean. Dependent Samples
- 20.10Test for the Mean. Dependent Samples Exercise
- 20.11Test for the mean. Independent Samples (Part 1)
- 20.12Test for the mean. Independent Samples (Part 1). Exercise
- 20.13Test for the mean. Independent Samples (Part 2)
- 20.14Test for the mean. Independent Samples (Part 2). Exercise
- Statistics - Practical Example: Hypothesis Testing2
- Introduction to Python9
- Python - Variables and Data Types3
- Python - Basic Python Syntax7
- Python - Other Python Operators2
- Python - Conditional Statements3
- Python - Python Functions8
- Python - Sequences5
- Python - Iterations6
- Python for Data Analysis - NumPy7
- Python for Data Analysis – Pandas10
- Python for Data Analysis - Pandas Exercises4
- Python for Data Visualization - Matplotlib6
- Python for Data Visualization - Seaborn9
- Python for Data Visualization - Pandas Built-in Data Visualization3
- Python for Data Visualization - Plotly and Cufflinks2
- Python for Data Visualization - Geographical Plotting5
- Big Data and Spark with Python12
- 38.0Welcome to the Big Data Section!
- 38.1Big Data Overview
- 38.2Spark Overview
- 38.3Local Spark Set-Up
- 38.4AWS Account Set-Up
- 38.5Quick Note on AWS Security
- 38.6EC2 Instance Set-Up
- 38.7SSH with Mac or Linux
- 38.8PySpark Setup
- 38.9Lambda Expressions Review
- 38.10Introduction to Spark and Python
- 38.11RDD Transformations and Actions
- Python - Advanced Python Tools4
- Advanced Statistical Methods in Python1
- Advanced Statistical Methods - Linear Regression with StatsModels8
- Decomposition of Variability2
- Advanced Statistical Methods - Multiple Linear Regression with StatsModels12
- 43.0Multiple Linear Regression
- 43.1Adjusted R-Squared
- 43.2Multiple Linear Regression Exercise
- 43.3Test for Significance of the Model (F-Test)
- 43.4OLS Assumptions
- 43.5A1: Linearity
- 43.6A2: No Endogeneity
- 43.7A3: Normality and Homoscedasticity
- 43.8A4: No Autocorrelation
- 43.9A5: No Multicollinearity
- 43.10Dealing with Categorical Data – Dummy Variables
- 43.11Making Predictions with the Linear Regression
- Advanced Statistical Methods - Linear Regression with sklearn19
- 44.0What is sklearn and how is it Different from Other Packages
- 44.1How are we Going to Approach this Section?
- 44.2Simple Linear Regression with sklearn
- 44.3Simple Linear Regression with sklearn – A StatsModels-like Summary Table
- 44.4A Note on Normalization
- 44.5Simple Linear Regression with sklearn – Exercise
- 44.6Multiple Linear Regression with sklearn
- 44.7Calculating the Adjusted R-Squared in sklearn
- 44.8Calculating the Adjusted R-Squared in sklearn – Exercise
- 44.9Feature Selection (F-regression)
- 44.10A Note on Calculation of P-values with sklearn
- 44.11Creating a Summary Table with P-values
- 44.12Multiple Linear Regression – Exercise
- 44.13Feature Scaling (Standardization)
- 44.14Feature Selection through Standardization of Weights
- 44.15Predicting with the Standardized Coefficients
- 44.16Feature Scaling (Standardization) – Exercise
- 44.17Underfitting and Overfitting
- 44.18Train – Test Split Explained
- Advanced Statistical Methods - Practical Example: Linear Regression8
- 45.0Practical Example: Linear Regression (Part 1)
- 45.1Practical Example: Linear Regression (Part 2)
- 45.2A Note on Multicollinearity
- 45.3Practical Example: Linear Regression (Part 3)
- 45.4Dummies and Variance Inflation Factor – Exercise
- 45.5Practical Example: Linear Regression (Part 4)
- 45.6Dummy Variables – Exercise
- 45.7Practical Example: Linear Regression (Part 5)
- Linear Regression – Exercise16
- 46.0Advanced Statistical Methods – Logistic Regression
- 46.1Introduction to Logistic Regression
- 46.2A Simple Example in Python
- 46.3Logistic vs Logit Function
- 46.4Building a Logistic Regression
- 46.5Building a Logistic Regression – Exercise
- 46.6An Invaluable Coding Tip
- 46.7Understanding Logistic Regression Tables
- 46.8Understanding Logistic Regression Tables – Exercise
- 46.9What do the Odds Actually Mean
- 46.10Binary Predictors in a Logistic Regression
- 46.11Binary Predictors in a Logistic Regression – Exercise
- 46.12Calculating the Accuracy of the Model
- 46.13Underfitting and Overfitting
- 46.14Testing the Model
- 46.15Testing the Model – Exercise
- Advanced Statistical Methods - Cluster Analysis4
- Advanced Statistical Methods - K-Means Clustering15
- 48.0K-Means Clustering
- 48.1A Simple Example of Clustering
- 48.2A Simple Example of Clustering – Exercise
- 48.3Clustering Categorical Data
- 48.4Clustering Categorical Data – Exercise
- 48.5How to Choose the Number of Clusters
- 48.6How to Choose the Number of Clusters – Exercise
- 48.7Pros and Cons of K-Means Clustering
- 48.8To Standardize or not to Standardize
- 48.9Relationship between Clustering and Regression
- 48.10Market Segmentation with Cluster Analysis (Part 1)
- 48.11Market Segmentation with Cluster Analysis (Part 2)
- 48.12How is Clustering Useful?
- 48.13EXERCISE: Species Segmentation with Cluster Analysis (Part 1)
- 48.14EXERCISE: Species Segmentation with Cluster Analysis (Part 2)
- Advanced Statistical Methods - Other Types of Clustering3
- Mathematics11
- 50.0What is a Matrix?
- 50.1Scalars and Vectors
- 50.2Linear Algebra and Geometry
- 50.3Arrays in Python – A Convenient Way to Represent Matrices
- 50.4What is a Tensor?
- 50.5Addition and Subtraction of Matrices
- 50.6Errors when Adding Matrices
- 50.7Transpose of a Matrix
- 50.8Dot Product
- 50.9Dot Product of Matrices
- 50.10Why is Linear Algebra Useful?
- Deep Learning1
- Deep Learning - Introduction to Neural Networks13
- 52.0Introduction to Neural Networks
- 52.1Training the Model
- 52.2Types of Machine Learning
- 52.3The Linear Model (Linear Algebraic Version)
- 52.4The Linear Model
- 52.5The Linear Model with Multiple Inputs
- 52.6The Linear model with Multiple Inputs and Multiple Outputs
- 52.7Graphical Representation of Simple Neural Networks
- 52.8What is the Objective Function?
- 52.9Common Objective Functions: L2-norm Loss
- 52.10Common Objective Functions: Cross-Entropy Loss
- 52.11Optimization Algorithm: 1-Parameter Gradient Descent
- 52.12Optimization Algorithm: n-Parameter Gradient Descent
- Deep Learning - How to Build a Neural Network from Scratch with NumPy5
- JavaScript0
- Deep Learning - TensorFlow2
- TensorFlow Outline and Comparison with Other Libraries4
- Interpreting the Result and Extracting the Weights and Bias2
- Deep Learning - Digging Deeper into NNs: Introducing Deep Neural Networks9
- Deep Learning - Overfitting6
- Deep Learning - Initialization3
- Deep Learning - Digging into Gradient Descent and Learning Rate Schedules7
- Deep Learning - Preprocessing5
- Deep Learning - Classifying on the MNIST Dataset12
- 63.0MNIST: The Dataset
- 63.1MNIST: How to Tackle the MNIST
- 63.2MNIST: Importing the Relevant Packages and Loading the Data
- 63.3MNIST: Preprocess the Data – Create a Validation Set and Scale It
- 63.4MNIST: Preprocess the Data – Scale the Test Data – Exercise
- 63.5MNIST: Preprocess the Data – Shuffle and Batch
- 63.6MNIST: Preprocess the Data – Shuffle and Batch – Exercise
- 63.7MNIST: Outline the Model
- 63.8MNIST: Select the Loss and the Optimizer
- 63.9MNIST: Learning
- 63.10MNIST – Exercises
- 63.11MNIST: Testing the Model
- Deep Learning - Business Case Example12
- 64.0Business Case: Exploring the Dataset and Identifying Predictors
- 64.1Business Case: Outlining the Solution
- 64.2Business Case: Balancing the Dataset
- 64.3Business Case: Preprocessing the Data
- 64.4Business Case: Preprocessing the Data – Exercise
- 64.5Business Case: Load the Preprocessed Data
- 64.6Business Case: Load the Preprocessed Data – Exercise
- 64.7Business Case: Learning and Interpreting the Result
- 64.8Business Case: Setting an Early Stopping Mechanism
- 64.9Setting an Early Stopping Mechanism – Exercise
- 64.10Business Case: Testing the Model
- 64.11Business Case: Final Exercise
- Deep Learning - Conclusion6
Overview
What is Data science?
Data science combines math and statistics, specialized programming, advanced analytics, artificial intelligence (AI), and machine learning with specific subject matter expertise to uncover actionable insights hidden in an organization’s data.
These insights can be used to guide decision making and strategic planning.
Why learn data science?
Data science has the potential to improve the way we live and work, and it can empower others to make better decisions, solve problems, discover new advancements, and address some of the world’s most pressing issues. With a data science career, you can be a part of this transformation.
- Demand: Data Scientists are in High Demand.
- Growth: Data Science Careers Have High Earning Potential.
- Job Security: Data Science is a Fast-Growing Field.
- Opportunity: Data Science Has a Range of Potential Job Opportunities.
- Flexibility: Data Scientists are Needed in Various Sectors.
Who is called a data scientist?
A data scientist is a professional who creates programming code and combines it with statistical knowledge to create insights from data.
How to learn data Science?
The best way to learn data science is to work on projects so you can gain data science skills that can be applied immediately and are useful from a real-world implementation perspective. The sooner you start working on diverse data science projects, the faster you will learn the related concepts.
TalhaTraining teaches you the essential concepts of the Various Data Science Disciplines like Data, Big Data, BI, Traditional Data Science and ML.
Training Objectives
We will teach you how to build technical knowledge and skills to be a Data Scientist.
Prerequisites
The course can be customized to any level of programming and relational database familiarity.
Hands-on/Lecture Ratio
This training class is 80% hands-on, and 20% lecture. Students learn by doing, with immediate opportunities to apply their learning material to real-world problems.
Training Materials
All related software and lecture sheets will provide in class.
Training Objectives
- Data Science – The Benefits
- Popular Data Science Techniques
- Popular Data Science Tools
- Careers in Data Science
- Debunking Common Misconceptions
- Probability
- Statistics
- Python Programming for Data Scientist
- Big Data and Spark with Python
- Advanced Statistical Methods
- Mathematics
- JavaScript for Data Scientist
- Deep Learning
Training Outline
The Field of Data Science – The Various Data Science Disciplines
- Data Science and Business Buzzwords: Why are there so Many?
- What is the difference between Analysis and Analytics?
- Business Analytics, Data Analytics, and Data Science: An Introduction
- Continuing with BI, ML, and AI
- A Breakdown of our Data Science Infographic
The Field of Data Science – Connecting the Data Science Disciplines
- Applying Traditional Data, Big Data, BI, Traditional Data Science and ML
The Field of Data Science – The Benefits of Each Discipline
- The Reason Behind These Disciplines
The Field of Data Science – Popular Data Science Techniques
- Techniques for Working with Traditional Data
- Real Life Examples of Traditional Data
- Techniques for Working with Big Data
- Real Life Examples of Big Data
- Business Intelligence (BI) Techniques
- Real Life Examples of Business Intelligence (BI)
- Techniques for Working with Traditional Methods
- Real Life Examples of Traditional Methods
- Machine Learning (ML) Techniques
- Types of Machine Learning
- Real Life Examples of Machine Learning (ML)
The Field of Data Science – Popular Data Science Tools
- Necessary Programming Languages and Software Used in Data Science
The Field of Data Science – Careers in Data Science
- Finding the Job – What to Expect and What to Look for
The Field of Data Science – Debunking Common Misconceptions
- Debunking Common Misconceptions
Probability
- The Basic Probability Formula
- Computing Expected Values
- Frequency
- Events and Their Complements
Probability – Combinatorics
- Fundamentals of Combinatorics
- Permutations and How to Use Them
- Simple Operations with Factorials
- Solving Variations with Repetition
- Solving Variations without Repetition
- Solving Combinations
- Symmetry of Combinations
- Solving Combinations with Separate Sample Spaces
- Combinatorics in Real-Life: The Lottery
- A Recap of Combinatorics
- A Practical Example of Combinatorics
Probability – Bayesian Inference
- Sets and Events
- Ways Sets Can Interact
- Intersection of Sets
- Union of Sets
- Mutually Exclusive Sets
- Dependence and Independence of Sets
- The Conditional Probability Formula
- The Law of Total Probability
- The Additive Rule
- The Multiplication Law
- Bayes’ Law
- A Practical Example of Bayesian Inference
Probability – Distributions
- Fundamentals of Probability Distributions
- Types of Probability Distributions
- Characteristics of Discrete Distributions
- Discrete Distributions: The Uniform Distribution
- Discrete Distributions: The Bernoulli Distribution
- Discrete Distributions: The Binomial Distribution
- Discrete Distributions: The Poisson Distribution
- Characteristics of Continuous Distributions
- Continuous Distributions: The Normal Distribution
- Continuous Distributions: The Standard Normal Distribution
- Continuous Distributions: The Students’ T Distribution
- Continuous Distributions: The Chi-Squared Distribution
- Continuous Distributions: The Exponential Distribution
- Continuous Distributions: The Logistic Distribution
- A Practical Example of Probability Distributions
Probability – Probability in Other Fields
- Probability in Finance
- Probability in Statistics
- Probability in Data Science
Statistics
- Population and Sample
Statistics – Descriptive Statistics
- Types of Data
- Levels of Measurement
- Categorical Variables – Visualization Techniques
- Categorical Variables Exercise
- Numerical Variables – Frequency Distribution Table
- Numerical Variables Exercise
- The Histogram
- Histogram Exercise
- Cross Tables and Scatter Plots
- Cross Tables and Scatter Plots Exercise
- Mean, median and mode
- Mean, Median and Mode Exercise
- Skewness
- Skewness Exercise
- Variance
- Variance Exercise
- Standard Deviation and Coefficient of Variation
- Standard Deviation
- Standard Deviation and Coefficient of Variation Exercise
- Covariance
- Covariance Exercise
- Correlation Coefficient
- Correlation
- Correlation Coefficient Exercise
Statistics – Practical Example: Descriptive Statistics
- Practical Example: Descriptive Statistics
- Practical Example: Descriptive Statistics Exercise
Statistics – Inferential Statistics Fundamentals
- Introduction
- What is a Distribution
- What is a Distribution
- The Normal Distribution
- The Standard Normal Distribution
- The Standard Normal Distribution Exercise
- Central Limit Theorem
- Standard error
- Estimators and Estimates
Statistics – Inferential Statistics: Confidence Intervals
- What are Confidence Intervals?
- Confidence Intervals; Population Variance Known; Z-score
- Confidence Intervals; Population Variance Known; Z-score; Exercise
Statistics – Confidence Interval Clarifications
- Student’s T Distribution
- Confidence Intervals; Population Variance Unknown; T-score
- Confidence Intervals; Population Variance Unknown; T-score; Exercise
- Margin of Error
- Confidence intervals. Two means. Dependent samples
- Confidence intervals. Two means. Dependent samples Exercise
- Confidence intervals. Two means. Independent Samples (Part 1)
- Confidence intervals. Two means. Independent Samples (Part 1). Exercise
- Confidence intervals. Two means. Independent Samples (Part 2)
- Confidence intervals. Two means. Independent Samples (Part 2). Exercise
- Confidence intervals. Two means. Independent Samples (Part 3)
Statistics – Practical Example: Inferential Statistics
- Practical Example: Inferential Statistics
- Practical Example: Inferential Statistics Exercise
Statistics – Hypothesis Testing
- Null vs Alternative Hypothesis
- Further Reading on Null and Alternative Hypothesis
- Rejection Region and Significance Level
- Type I Error and Type II Error
- Test for the Mean. Population Variance Known
- Test for the Mean. Population Variance Known Exercise
- p-value
- Test for the Mean. Population Variance Unknown
- Test for the Mean. Population Variance Unknown Exercise
- Test for the Mean. Dependent Samples
- Test for the Mean. Dependent Samples Exercise
- Test for the mean. Independent Samples (Part 1)
- Test for the mean. Independent Samples (Part 1). Exercise
- Test for the mean. Independent Samples (Part 2)
- Test for the mean. Independent Samples (Part 2). Exercise
Statistics – Practical Example: Hypothesis Testing
- Practical Example: Hypothesis Testing
- Practical Example: Hypothesis Testing Exercise
Introduction to Python
- Introduction to Programming
- Why Python?
- Anaconda Details
- Why Jupyter?
- Installing Python and Jupyter
- Understanding Jupyter’s Interface – the Notebook Dashboard
- Prerequisites for Coding in the Jupyter Notebooks
- Jupyter Notebooks
- Jupyter’s Interface
Python – Variables and Data Types
- Variables
- Numbers and Boolean Values in Python
- Python Strings
Python – Basic Python Syntax
- Using Arithmetic Operators in Python
- The Double Equality Sign
- How to Reassign Values
- Add Comments
- Understanding Line Continuation
- Indexing Elements
- Structuring with Indentation
Python – Other Python Operators
- Comparison Operators
- Logical and Identity Operators
Python – Conditional Statements
- The IF Statement
- The ELSE Statement
- A Note on Boolean Values
Python – Python Functions
- Defining a Function in Python
- How to Create a Function with a Parameter
- Defining a Function in Python – Part II
- How to Use a Function within a Function
- Conditional Statements and Functions
- Functions Containing a Few Arguments
- Built-in Functions in Python
- Python Functions
Python – Sequences
- Lists
- Using Methods
- List Slicing
- Tuples
- Dictionaries
Python – Iterations
- For Loops
- While Loops and Incrementing
- Lists with the range() Function
- Conditional Statements and Loops
- Conditional Statements, Functions, and Loops
- How to Iterate over Dictionaries
Python for Data Analysis – NumPy
- Introduction to Numpy
- Numpy Arrays
- Array Indexing
- Numpy Array Indexing
- Numpy Operations
- Numpy Exercises Overview
- Numpy Exercises Solutions
Python for Data Analysis – Pandas
- Introduction to Pandas
- Series
- DataFrames – Part 1
- DataFrames – Part 2
- DataFrames – Part 3
- Missing Data
- Groupby
- Merging Joining and Concatenating
- Operations
- Data Input and Output
Python for Data Analysis – Pandas Exercises
- SF Salaries Exercise Overview
- SF Salaries Solutions
- Ecommerce Purchases Exercise Overview
- Ecommerce Purchases Exercise Solutions
Python for Data Visualization – Matplotlib
- Introduction to Matplotlib
- Matplotlib Part 1
- Matplotlib Part 2
- Matplotlib Part 3
- Matplotlib Exercises Overview
- Matplotlib Exercises – Solutions
Python for Data Visualization – Seaborn
- Introduction to Seaborn
- Distribution Plots
- Categorical Plots
- Matrix Plots
- Grids
- Regression Plots
- Style and Color
- Seaborn Exercise Overview
- Seaborn Exercise Solutions
Python for Data Visualization – Pandas Built-in Data Visualization
- Pandas Built-in Data Visualization
- Pandas Data Visualization Exercise
- Pandas Data Visualization Exercise- Solutions
Python for Data Visualization – Plotly and Cufflinks
- Introduction to Plotly and Cufflinks
- Plotly and Cufflinks
Python for Data Visualization – Geographical Plotting
- Introduction to Geographical Plotting
- Choropleth Maps – Part 1
- Choropleth Maps – Part 2
- Choropleth Exercises
- Choropleth Exercises – Solutions
Big Data and Spark with Python
- Welcome to the Big Data Section!
- Big Data Overview
- Spark Overview
- Local Spark Set-Up
- AWS Account Set-Up
- Quick Note on AWS Security
- EC2 Instance Set-Up
- SSH with Mac or Linux
- PySpark Setup
- Lambda Expressions Review
- Introduction to Spark and Python
- RDD Transformations and Actions
Python – Advanced Python Tools
- Object Oriented Programming
- Modules and Packages
- What is the Standard Library?
- Importing Modules in Python
Advanced Statistical Methods in Python
- Introduction to Regression Analysis
Advanced Statistical Methods – Linear Regression with StatsModels
- The Linear Regression Model
- Correlation vs Regression
- Geometrical Representation of the Linear Regression Model
- Python Packages Installation
- First Regression in Python
- First Regression in Python Exercise
- Using Seaborn for Graphs
- How to Interpret the Regression Table
Decomposition of Variability
- What is the OLS?
- R-Squared
Advanced Statistical Methods – Multiple Linear Regression with StatsModels
- Multiple Linear Regression
- Adjusted R-Squared
- Multiple Linear Regression Exercise
- Test for Significance of the Model (F-Test)
- OLS Assumptions
- A1: Linearity
- A2: No Endogeneity
- A3: Normality and Homoscedasticity
- A4: No Autocorrelation
- A5: No Multicollinearity
- Dealing with Categorical Data – Dummy Variables
- Making Predictions with the Linear Regression
Advanced Statistical Methods – Linear Regression with sklearn
- What is sklearn and how is it Different from Other Packages
- How are we Going to Approach this Section?
- Simple Linear Regression with sklearn
- Simple Linear Regression with sklearn – A StatsModels-like Summary Table
- A Note on Normalization
- Simple Linear Regression with sklearn – Exercise
- Multiple Linear Regression with sklearn
- Calculating the Adjusted R-Squared in sklearn
- Calculating the Adjusted R-Squared in sklearn – Exercise
- Feature Selection (F-regression)
- A Note on Calculation of P-values with sklearn
- Creating a Summary Table with P-values
- Multiple Linear Regression – Exercise
- Feature Scaling (Standardization)
- Feature Selection through Standardization of Weights
- Predicting with the Standardized Coefficients
- Feature Scaling (Standardization) – Exercise
- Underfitting and Overfitting
- Train – Test Split Explained
Advanced Statistical Methods – Practical Example: Linear Regression
- Practical Example: Linear Regression (Part 1)
- Practical Example: Linear Regression (Part 2)
- A Note on Multicollinearity
- Practical Example: Linear Regression (Part 3)
- Dummies and Variance Inflation Factor – Exercise
- Practical Example: Linear Regression (Part 4)
- Dummy Variables – Exercise
- Practical Example: Linear Regression (Part 5)
Linear Regression – Exercise
- Advanced Statistical Methods – Logistic Regression
- Introduction to Logistic Regression
- A Simple Example in Python
- Logistic vs Logit Function
- Building a Logistic Regression
- Building a Logistic Regression – Exercise
- An Invaluable Coding Tip
- Understanding Logistic Regression Tables
- Understanding Logistic Regression Tables – Exercise
- What do the Odds Actually Mean
- Binary Predictors in a Logistic Regression
- Binary Predictors in a Logistic Regression – Exercise
- Calculating the Accuracy of the Model
- Underfitting and Overfitting
- Testing the Model
- Testing the Model – Exercise
Advanced Statistical Methods – Cluster Analysis
- Introduction to Cluster Analysis
- Some Examples of Clusters
- Difference between Classification and Clustering
- Math Prerequisites
Advanced Statistical Methods – K-Means Clustering
- K-Means Clustering
- A Simple Example of Clustering
- A Simple Example of Clustering – Exercise
- Clustering Categorical Data
- Clustering Categorical Data – Exercise
- How to Choose the Number of Clusters
- How to Choose the Number of Clusters – Exercise
- Pros and Cons of K-Means Clustering
- To Standardize or not to Standardize
- Relationship between Clustering and Regression
- Market Segmentation with Cluster Analysis (Part 1)
- Market Segmentation with Cluster Analysis (Part 2)
- How is Clustering Useful?
- EXERCISE: Species Segmentation with Cluster Analysis (Part 1)
- EXERCISE: Species Segmentation with Cluster Analysis (Part 2)
Advanced Statistical Methods – Other Types of Clustering
- Types of Clustering
- Dendrogram
- Heatmaps
Mathematics
- What is a Matrix?
- Scalars and Vectors
- Linear Algebra and Geometry
- Arrays in Python – A Convenient Way to Represent Matrices
- What is a Tensor?
- Addition and Subtraction of Matrices
- Errors when Adding Matrices
- Transpose of a Matrix
- Dot Product
- Dot Product of Matrices
- Why is Linear Algebra Useful?
Deep Learning
- What to Expect from this Part?
Deep Learning – Introduction to Neural Networks
- Introduction to Neural Networks
- Training the Model
- Types of Machine Learning
- The Linear Model (Linear Algebraic Version)
- The Linear Model
- The Linear Model with Multiple Inputs
- The Linear model with Multiple Inputs and Multiple Outputs
- Graphical Representation of Simple Neural Networks
- What is the Objective Function?
- Common Objective Functions: L2-norm Loss
- Common Objective Functions: Cross-Entropy Loss
- Optimization Algorithm: 1-Parameter Gradient Descent
- Optimization Algorithm: n-Parameter Gradient Descent
Deep Learning – How to Build a Neural Network from Scratch with NumPy
- Basic NN Example (Part 1)
- Basic NN Example (Part 2)
- Basic NN Example (Part 3)
- Basic NN Example (Part 4)
- Basic NN Example Exercises
JavaScript
Deep Learning – TensorFlow
- Deep Learning – TensorFlow 2.0: Introduction
- How to Install TensorFlow 2.0
TensorFlow Outline and Comparison with Other Libraries
- TensorFlow 1 vs TensorFlow 2
- A Note on TensorFlow 2 Syntax
- Types of File Formats Supporting TensorFlow
- Outlining the Model with TensorFlow 2
Interpreting the Result and Extracting the Weights and Bias
- Customizing a TensorFlow 2 Model
- Basic NN with TensorFlow: Exercises
Deep Learning – Digging Deeper into NNs: Introducing Deep Neural Networks
- What is a Layer?
- What is a Deep Net?
- Digging into a Deep Net
- Non-Linearities and their Purpose
- Activation Functions
- Activation Functions: Softmax Activation
- Backpropagation
- Backpropagation Picture
- Backpropagation – A Peek into the Mathematics of Optimization
Deep Learning – Overfitting
- What is Overfitting?
- Underfitting and Overfitting for Classification
- What is Validation?
- Training, Validation, and Test Datasets
- N-Fold Cross Validation
- Early Stopping or When to Stop Training
Deep Learning – Initialization
- What is Initialization?
- Types of Simple Initializations
- State-of-the-Art Method – (Xavier) Glorot Initialization
Deep Learning – Digging into Gradient Descent and Learning Rate Schedules
- Stochastic Gradient Descent
- Problems with Gradient Descent
- Momentum
- Learning Rate Schedules, or How to Choose the Optimal Learning Rate
- Learning Rate Schedules Visualized
- Adaptive Learning Rate Schedules (AdaGrad and RMSprop )
- Adam (Adaptive Moment Estimation)
Deep Learning – Preprocessing
- Preprocessing Introduction
- Types of Basic Preprocessing
- Standardization
- Preprocessing Categorical Data
- Binary and One-Hot Encoding
Deep Learning – Classifying on the MNIST Dataset
- MNIST: The Dataset
- MNIST: How to Tackle the MNIST
- MNIST: Importing the Relevant Packages and Loading the Data
- MNIST: Preprocess the Data – Create a Validation Set and Scale It
- MNIST: Preprocess the Data – Scale the Test Data – Exercise
- MNIST: Preprocess the Data – Shuffle and Batch
- MNIST: Preprocess the Data – Shuffle and Batch – Exercise
- MNIST: Outline the Model
- MNIST: Select the Loss and the Optimizer
- MNIST: Learning
- MNIST – Exercises
- MNIST: Testing the Model
Deep Learning – Business Case Example
- Business Case: Exploring the Dataset and Identifying Predictors
- Business Case: Outlining the Solution
- Business Case: Balancing the Dataset
- Business Case: Preprocessing the Data
- Business Case: Preprocessing the Data – Exercise
- Business Case: Load the Preprocessed Data
- Business Case: Load the Preprocessed Data – Exercise
- Business Case: Learning and Interpreting the Result
- Business Case: Setting an Early Stopping Mechanism
- Setting an Early Stopping Mechanism – Exercise
- Business Case: Testing the Model
- Business Case: Final Exercise
Deep Learning – Conclusion
- Summary on What You’ve Learned
- What’s Further out there in terms of Machine Learning
- DeepMind and Deep Learning
- An overview of CNNs
- An Overview of RNNs
- An Overview of non-NN Approaches
Certificates will be awarded to participants at the end of training.
Seats are limited. To confirm your enrollment, pay the course fee @
A/C Name: | TalhaTraining |
A/C No.: | 2141116000973 |
Bank Name: | Prime Bank Limited |
And mail us after paying the course fee.
For registration or information please call or contact any of the following addresses
TalhaTraining
Mobile & WhatsApp 01712742217
Email: training@talhatraining.com or talhatraining@gmail.com
Website: talhatraining.com
Course Features
- Lectures 425
- Quiz 0
- Duration 96 hours
- Skill level All levels
- Language English
- Students 21
- Certificate Yes
- Assessments Yes