Data Preprocessing
- Data Cleaning
- Removing duplicates
- Handling missing values
- Data Transformation
- Scaling
- Encoding
- Data integration (from multiple sources)
- Joining (combine rows from multiple tables based on an index)
- Merging (based on various columns)
- Data Reduction
- Sampling
- Dimensionality Reduction