A structured, notebook-first repository for learning Python and applying it to data science through hands-on practice, projects, and real datasets.
- Overview
- What You Will Learn
- Repository Structure
- Learning Sequence
- Datasets
- Quick Start
- Tools and Technologies
- Contributing
- License
This repository is designed as a complete Python-for-data-science learning track. It begins with core Python concepts and gradually progresses to data analysis and visualization using NumPy, Pandas, Matplotlib, and Seaborn.
- Python fundamentals and syntax
- Intermediate and advanced Python concepts
- Object-oriented programming (OOP)
- Exception and file handling
- NumPy fundamentals and advanced operations
- Data analysis with Pandas
- Data visualization with Matplotlib and Seaborn
- Practice exercises and mini projects
| Folder | Focus Area |
|---|---|
Fundamentals of Python/ |
Core Python basics, control flow, data structures, functions |
Advance Python/ |
Decorators, scope, and advanced language concepts |
Exception Handling in Python / |
Error handling patterns and custom exceptions |
File Handling in Python/ |
File I/O, binary files, and serialization |
OOPS in Python/ |
Classes, inheritance, polymorphism, abstraction, encapsulation |
Python fundamental Questions/ |
Practice questions for core and intermediate topics |
My projects/ |
Mini projects for applied learning |
NumPy For Python / |
NumPy fundamentals and array operations |
NumPy For Practice/ |
NumPy practice notebooks |
NumPy Advance/ |
Broadcasting, advanced indexing, utilities |
Matplotlib in Python/ |
Plotting and charting exercises |
Seaborn in Python/ |
Statistical and relational visualizations |
Pandas in Python/ |
Series, DataFrames, DateTime operations, and practice |
Database from Kaggle/ |
Dataset collection used across notebooks |
Exploratory Data Analysis (EDA)/ |
EDA-focused notebook work |
Note: Some folder names include spaces, and a few intentionally include trailing spaces (for example,
Exception Handling in Python /andNumPy For Python /) to preserve the repository's original notebook and dataset paths. Copy paths carefully when using terminal commands.
Recommended study order:
- Fundamentals of Python
- Advance Python
- Exception Handling in Python
- File Handling in Python
- OOPS in Python
- Python fundamental Questions
- My projects
- NumPy For Python
- NumPy For Practice
- NumPy Advance
- Matplotlib in Python
- Seaborn in Python
- Pandas in Python
- Exploratory Data Analysis (EDA)
The Database from Kaggle/ directory contains datasets used throughout NumPy, Pandas, and visualization notebooks, including:
- IPL match and delivery datasets
- Movie and IMDb datasets
- Student performance and expense datasets
- Additional CSV/Excel files in
Database from Kaggle/datasets /(note the intentional trailing space afterdatasets).
git clone https://github.com/aayushmanz/Python-For-Data-Science.git
cd Python-For-Data-Science
python -m venv .venv
source .venv/bin/activate
pip install jupyter numpy pandas matplotlib seaborn
jupyter notebookFor Windows PowerShell:
.venv\Scripts\Activate.ps1- Python 3
- Jupyter Notebook
- NumPy
- Pandas
- Matplotlib
- Seaborn
- Git and GitHub
Contributions are welcome.
- Fork the repository
- Create a feature branch
- Make and commit your changes
- Open a pull request with a clear summary
This project is available under the MIT License.
Maintained by Ayush Suthar.