Description
Data Science Course Content
Introduction to Data Science, importance of Data
Science, statistical and analytical methods, deploying Data Science for
Business Intelligence, transforming data, machine learning and
introduction to Recommender systems.
Reasons to Use Data Science – Project Life cycle
Reasons to Use Data Science – Project Life cycle
How Data Science solves real world problems, Data
Science Project Life Cycle, principles of Data Science, introduction to
various BI and Analytical tools, data collection, introduction to
statistical packages, data visualization tools, R Programming,
predictive modelling, machine learning, artificial intelligence and
statistical analysis.
Data Conversion
Data Conversion
Converting data into useful information, Collecting
the data, Understand the data, Finding useful information in the data,
Interpreting the data, Visualizing the data
Terms of Statistics
Terms of Statistics
Descriptive statistics, Let us understand some terms in statistics, Variable
Plots
Plots
Dot Plots, Histogram, Stemplots, Box and whisker plots, Outlier detection from box plots and Box and whisker plots
Set & rules of probability, Bayes Theorem
Set & rules of probability, Bayes Theorem
What is probability?, Set & rules of probability, Bayes Theorem
Distributions
Distributions
Probability Distributions, Few Examples, Student T-
Distribution, Sampling Distribution, Student t- Distribution, Poison
distribution
Sampling
Sampling
Stratified Sampling, Proportionate Sampling, Systematic Sampling, P – Value, Stratified Sampling
Tables & Analysis
Tables & Analysis
Cross Tables, Bivariate Analysis, Multi variate
Analysis, Dependence and Independence tests ( Chi-Square ), Analysis of
Variance, Correlation between Nominal variables
Acquiring Data
Boxplot in R programming, understanding distribution and percentile, identifying outliers, Rstudio Tool, various types of distribution like Normal, Uniform and Skewed.
Machine Learning in Data Science
Deploying machine learning for data analysis, solving business problems, using algorithms for searching patterns in data, relationship between variables, multivariate analysis, interpreting correlation, negative correlation.
Deep dive into Data Transformation & Apache Mahout
Data Transformation key phases Data Mapping and Code Generation, Data Processing operation, data patterns, data sampling, sampling distribution, normal and continuous variable, data extrapolation, regression, linear regression model.
Data Testing and Assessment
Data analysis, hypothesis testing, simple linear regression, Chi-square for assessing compatibility between theoretical and observed data, implementing data testing on data warehouse, validating data, checking for accuracy, data operational monitoring capabilities.
Data Model, Algorithms & Prediction
Various techniques of data modelling and generating algorithms, methods of business prediction, prediction approaches, data sampling, disproportionate sampling, data modelling rules, data iteration, and deploying data for mission-critical applications.
Data Segmentation and Analysis
Acquiring Data
Boxplot in R programming, understanding distribution and percentile, identifying outliers, Rstudio Tool, various types of distribution like Normal, Uniform and Skewed.
Machine Learning in Data Science
Deploying machine learning for data analysis, solving business problems, using algorithms for searching patterns in data, relationship between variables, multivariate analysis, interpreting correlation, negative correlation.
Deep dive into Data Transformation & Apache Mahout
Data Transformation key phases Data Mapping and Code Generation, Data Processing operation, data patterns, data sampling, sampling distribution, normal and continuous variable, data extrapolation, regression, linear regression model.
Data Testing and Assessment
Data analysis, hypothesis testing, simple linear regression, Chi-square for assessing compatibility between theoretical and observed data, implementing data testing on data warehouse, validating data, checking for accuracy, data operational monitoring capabilities.
Data Model, Algorithms & Prediction
Various techniques of data modelling and generating algorithms, methods of business prediction, prediction approaches, data sampling, disproportionate sampling, data modelling rules, data iteration, and deploying data for mission-critical applications.
Data Segmentation and Analysis