Statistics for Data Science
#

A brief summary of the topics covered in this course is as below. This is 25 hours course, it is suggested to complete this course in 3 weeks.

Introduction of Statistics for Data Scientist
#

Introduction to basic statistics terms
Types of statistics
Types of data
Levels of measurement (nominal, ordinal, and interval/ratio)
Measures of central tendency
Measures of dispersion
Random variables
Concept of Set
Skewness, Kurtosis
Covariance and correlation
Data Visualization
Data summarization methods
Tables, Graphs, Charts, Histograms,
Frequency distributions
Box Plot
Chebychev’s Inequality on relationship

Descriptive & Inferential Statistics for Data Scientist
#

Type of Probability distributions – discrete vs continuous distributions,
Cumulative Probabilities, Normal & Standard Normal Distribution
Discrete Distributions
Binomial Distributions
Poisson Distribution
Continuous Distributions
Uniform Distribution
Normal Distribution
Standard Normal Distribution
Exponential Distribution
Sampling methods
Interval Estimation
Central limit theorem – sampling, sampling distribution, properties of sampling distribution, central limit theorem, estimating mean using CLT

Hypothesis Testing for Data Scientist
#

Concepts of hypothesis testing – business relevance, framing hypotheses, hypothesis testing process and p-value
Types of hypothesis tests – left- and right-tailed tests, two-tailed tests, types of errors, hypothesis testing using T-distribution
Industry demos on hypothesis testing (Excel) – two-sample mean test, two-sample proportion test, A/B testing
Z-Test, normal standard distribution
T-Test, t-stats, Student t distribution
T-stats vs. Z-stats
Type 1 & type 2 error
Bayes statistics (Bayes theorem)
Confidence interval (CI), margin of error
Interpreting confidence levels and confidence intervals
Chi-square test
Chi-square distribution using python
Chi-square for goodness of fit test
When to use which statistical distribution?
Analysis of variance (ANOVA)
Assumptions to use ANOVA
ANOVA three type
Partitioning of variance in the ANOVA
Calculating using python
F-distribution
F-test (variance ratio test)
Determining the values of f
F distribution using python

Project & Resources
#

Resources for practice
A Final Assignment

Follow Me

Dr. Hari Thapliyaal

Dr. Hari Thapliyal is a seasoned professional and prolific blogger with a multifaceted background that spans the realms of Data Science, Project Management, and Advait-Vedanta Philosophy. Holding a Doctorate in AI/NLP from SSBM (Geneva, Switzerland), Hari has earned Master's degrees in Computers, Business Management, Data Science, and Economics, reflecting his dedication to continuous learning and a diverse skill set. With over three decades of experience in management and leadership, Hari has proven expertise in training, consulting, and coaching within the technology sector. His extensive 16+ years in all phases of software product development are complemented by a decade-long focus on course design, training, coaching, and consulting in Project Management. In the dynamic field of Data Science, Hari stands out with more than three years of hands-on experience in software development, training course development, training, and mentoring professionals. His areas of specialization include Data Science, AI, Computer Vision, NLP, complex machine learning algorithms, statistical modeling, pattern identification, and extraction of valuable insights. Hari's professional journey showcases his diverse experience in planning and executing multiple types of projects. He excels in driving stakeholders to identify and resolve business problems, consistently delivering excellent results. Beyond the professional sphere, Hari finds solace in long meditation, often seeking secluded places or immersing himself in the embrace of nature.

Comments:

Share with :

Statistics For Data Science

On This Page

Statistics for Data Science
#

Introduction of Statistics for Data Scientist
#

Descriptive & Inferential Statistics for Data Scientist
#

Hypothesis Testing for Data Scientist
#

Project & Resources
#

Dr. Hari Thapliyaal

Comments:

Related

On This Page

Statistics for Data Science#

Introduction of Statistics for Data Scientist#

Descriptive & Inferential Statistics for Data Scientist#

Hypothesis Testing for Data Scientist#

Project & Resources#

Dr. Hari Thapliyaal

Comments:

Related

Statistics for Data Science
#

Introduction of Statistics for Data Scientist
#

Descriptive & Inferential Statistics for Data Scientist
#

Hypothesis Testing for Data Scientist
#

Project & Resources
#