99402 89059 | 99402 19052 | 90030 15832
pandian@accordedu.com

For Fees Details Enquiry Here


GET NOTICED BY TOP HIRING COMPANIES WITH ACCORD JOB ASSIST PROGRAM


Data Science and Machine Learning Course Training Syllabus

Statistics
Statistics and Probability Basics

Introduction to Statistics and Probability - Importance of statistics in data science - Definitions: Population vs Sample - Types of data (Qualitative vs Quantitative) - Levels of measurement (Nominal, Ordinal, Interval, Ratio) - Introduction to Probability (Probability theory, axioms of probability) - Probability Distributions - Probability Mass Function (PMF) vs Probability Density Function (PDF) - Cumulative Distribution Function (CDF) - Common probability distributions - Binomial distribution - Poisson distribution - Uniform distribution - Exponential distribution - Normal distribution (Gaussian) - Basic Probability Concepts - Conditional probability - Bayes' theorem and applications- Independent and dependent events

Standard Deviation and Central Tendency

Measures of Central Tendency - Measures of Dispersion - Standard Deviation - Calculating mean, median, mode using `numpy`, `scipy`, and `statistics` libraries - Standard deviation and variance calculation using `numpy`- Plotting and visualizing normal distributions using `matplotlib` and `seaborn`- Calculating probabilities for normal distribution using `scipy.stats.norm`

Z-Scores and Bias-Variance Trade-off

What is a Z-score - Z-scores for standard normal distribution - Bias-Variance Trade-off-Bias - Definition and examples (underfitting) - Variance Definition and examples (overfitting) - Z-score calculations

Distance Metrics and Outlier Analysis

Distance Metrics - Euclidean - Manhattan Distance - Cosine Similarity - Minkowski Distance - Outlier Analysis - Causes and effects of outliers in data - Calculating distance metrics using scipy

Missing Value Treatment

Types of missing data (Missing Completely at Random (MCAR), Missing at Random (MAR), Missing Not at Random (MNAR)) - Techniques for Handling Missing Values - Identifying missing values using `pandas`- Imputation techniques using `SimpleImputer` from `sklearn`- Visualizing missing data patterns

Correlation

Definition and types of correlation (Positive, Negative, No correlation) - Pearson correlation coefficient - Spearman's rank correlation – Covariance - Covariance

Python Libraries
Introduction to Python Libraries and Installation

What are Python Libraries - Purpose and benefits of using libraries in data science - Overview of popular data science libraries (NumPy, Pandas, Matplotlib, Scikit-learn, etc.) Installing Python Libraries - Introduction to `pip` - Installing libraries using `pip - Popular IDEs and Tools for Data Science - Overview of Jupyter Notebook, VSCode, PyCharm

NumPy Library

Introduction to NumPy - Importance of NumPy for numerical computing - Understanding NumPy Arrays (ndarrays) vs Python Lists - Installation of NumPy (`pip install numpy`) - Basic Operations in NumPy - Creating NumPy arrays (`array()`, `arange()`, `linspace()`) - Understanding shape, dimensions, and data types of arrays - Indexing and slicing NumPy arrays - Reshaping arrays, Transposing arrays, Flattening arrays. - NumPy Mathematical Functions - Element-wise operations on arrays - Aggregate functions (`sum()`, `mean()`, `median()`, `std()`, `var()`) - Matrix operation

Pandas Library

Introduction to Pandas - Importance of Pandas for data manipulation - Pandas structures: Series and DataFrames - Installation of Pandas (`pip install pandas`) - Working with Pandas Series - Creating a Series from lists, NumPy arrays, and dictionaries - Working with Pandas DataFrames - Creating DataFrames from dictionaries, lists, NumPy arrays, CSV files, etc - Reading and writing data to/from files (CSV, Excel, JSON) - Accessing data from DataFrames (`loc[]`, `iloc[]`) - Basic DataFrame operations - Renaming columns, Adding and removing columns/rows - Sorting data, Filtering data, Handling duplicates - Pandas Data Cleaning and Manipulation - Handling missing data (`isnull()`, `dropna()`, `fillna()`)- Changing data types of columns- Applying functions to columns (`apply()`, `map()`)- Grouping data and aggregation (`groupby()`, `agg()` - Merging, joining, and concatenating DataFrames (`merge()`, `concat()`, `join()`) - Pivot tables and cross- tabulations - Reshaping DataFrames using `melt()`, `stack()`, `unstack()`- Handling large datasets (chunking, memory optimization)

Matplotlib Library for Data Visualization

Introduction to Matplotlib - Importance of data visualization in data science - Basic architecture of Matplotlib (Figure, Axes, Subplots) - Installing Matplotlib - Basic Plots using Matplotlib - Line plots - Bar plots and horizontal bar plots - Scatter plots - Histograms - Pie charts - Customizing Plots - Adding titles, labels, legends, gridlines - Changing plot styles- Setting axis limits, ticks, and scales (log scale, etc.) - Subplots and multiple plots on the same figure - Creating stacked plots and bar charts - Contour plots and heatmaps - Working with dates on x-axis - 3D plotting using `mpl_toolkits.mplot3d`

SQL
Introduction and Installation

Introduction to Databases - What is a Database? Importance of databases in applications - Introduction to Database Management Systems (DBMS) - Types of DBMS: Relational, NoSQL, Object-Oriented, etc. - Overview of SQL and its role in relational databases (RDBMS

Introduction and Installation

Introduction to SQL Server - What is SQL Server? Overview of RDBMS concepts - Overview of different SQL Server Editions (Express, Standard, Enterprise) - SQL Server Management Studio (SSMS) Introduction and Setup. - Installing and using SQL Server Management Studio (SSMS) for database management

DDL (Data Definition Language)

Creating and Modifying Databases – Creating and Modifying Tables - Table Constraints - - Primary Keys (`PRIMARY KEY` constraint) - Foreign Keys (`FOREIGN KEY` constraint) - Unique Constraints (`UNIQUE` constraint) - Default Values (`DEFAULT` constraint) - Check Constraints (`CHECK` constraint). - Operators, Data Types, and Type Conversion - Arithmetic Operators - Comparison Operators - Logical Operators – BETWEEN, IN, LIKE for pattern matching - IS NULL and IS NOT NULL operators. - SQL Server Data Types - Type Conversion

DML (Data Manipulation Language

Inserting Data - Updating Data - Deleting Data

DQL (Data Query Language)

Basic Querying with `SELECT - Filtering Data with `WHERE - Sorting Results Using `ORDER BY` to sort records in ascending (`ASC`) and descending (`DESC`) order. - Conditional Logic with `CASE`- Using Aggregate Functions - Group By, Having, - Difference between `WHERE` and `HAVING`- Using `HAVING` with aggregate functions.

Predefined Functions (String, Date/Time Functions

Common string functions: `LEN()`, `SUBSTRING()`, `CHARINDEX()`, `UPPER()`, `LOWER()`, `REPLACE()`, `LEFT()`, `RIGHT()` - Concatenating strings with `+` or `CONCAT()`. Date/Time Functions in SQL Server - Getting the current date/time (`GETDATE()`, `SYSDATETIME()`) - Adding/subtracting dates (`DATEADD()`) - Finding the difference between dates (`DATEDIFF()`) - Extracting parts of a date (`YEAR()`, `MONTH()`, `DAY()`)

Joins in SQL Server

Introduction to Joins - What are joins? Why do we use them - Types of Joins - INNER JOIN, LEFT JOIN, RIGHT JOIN, FULL OUTER JOIN, CROSS JOIN, SELF JOIN

Transaction Control and Permission

Understanding Transactions - `BEGIN TRANSACTION` - `COMMIT TRANSACTION`- Committing the transaction - `ROLLBACK TRANSACTION` - `SAVEPOINT` - SQL Server Privileges and Subqueries - Granting Permissions - Revoking permissions - Role-based permissions in SQL Server.

Subqueries and Index in SQL Server

Using subqueries in `SELECT`, `WHERE`, `FROM`, and `HAVING` clauses - Correlated Subqueries vs Non-Correlated Subqueries - Subqueries in `JOIN` conditions - Indexes - Creating indexes - Removing indexes - Clustered vs Non-Clustered Indexes. - Impact of indexes on performance (index maintenance, over-indexing).

Database Objects

Views – Creating Views - Modifying views – Removing Views - Using views for complex query simplification - Stored Procedures - Creating stored procedures - Executing stored procedures (`EXEC`) - Input/output parameters in stored procedures - Error handling in stored procedures with `TRY...CATCH – Triggers - Creating triggers `INSERT`, `UPDATE`, `DELETE` operations - BEFORE and AFTER triggers.

Regular Expressions in SQL Server

Introduction to Regular Expression - Using `PATINDEX()` and `LIKE` for Pattern Matching - Simple pattern matching with `LIKE`- Finding patterns using `PATINDEX()`- Complex pattern matching using wildcards (`%`, `_`) - Combining Regular Expressions with String Functions

Machine Learning
Introduction to Machine Learning

Overview of Machine Learning - Types of Machine Learning: Supervised, Unsupervised, Reinforcement Learning - Applications of Machine Learning

Supervised Learning

Classification and Regression
- Classification vs. Regression - Differences and similarities –
Regression Algorithms
    Linear Regression
  • Simple Linear Regression
  • Multiple Linear Regression
  • Implementation using Python (NumPy, scikit-learn)
  • Non-Linear Regression
  • Polynomial Regression
  • Implementation using Python
  • Regression Trees
  • Decision Trees for Regression
  • Implementation using Python (scikit-learn)
Classification Algorithms
    Logistic Regression
  • Binary and Multiclass Logistic Regression
  • Implementation using Python (scikit-learn)
  • Decision Trees
  • Understanding Decision Trees
  • Implementation using Python (scikit-learn)
    Random Forest
  • Ensemble Methods
  • Implementation using Python (scikit-learn)
  • Support Vector Machines (SVM)
  • Linear and Non-Linear SVM
  • Implementation using Python (scikit-learn)
3. Unsupervised Learning
Clustering
    K-Means Clustering
  • Algorithm and Applications
  • Implementation using Python (scikit-learn)
  • Hierarchical Clustering
  • Algorithm and Applications
  • Implementation using Python (scikit-learn)
  • Association
    Apriori Algorithm
  • Market Basket Analysis
  • Implementation using Python (mlxtend)
4. Reinforcement Learning
    Introduction to Reinforcement Learning
  • Key Concepts: Agent, Environment, Reward, Policy
  • Types of Reinforcement Learning
  • Model-Based vs. Model-Free
  • Value-Based vs. Policy-Based
  • Q-Learning
  • Algorithm and Applications
  • Implementation using Python (OpenAI Gym, TensorFlow)
5. Project
    Project Planning and Design
  • Problem Definition
  • Data Collection and Preprocessing
  • Model Building and Evaluation
  • Selecting the appropriate algorithms
  • Training and Testing the Model
  • Model Evaluation Metrics
  • Deployment
  • Model Deployment Techniques