Table of Contents
Getting Started
Theoretical Overview
Data Management
- Data Management Overview
- Data Management Techniques
- Path directories
- Adding Unique Identifiers
- Trailing Period Removal
- Standardized Dates
- DataFrame Analysis
- Generating Summary Tables for Variable Combinations
- Saving DataFrames to Excel with Customized Formatting
- Creating Contingency Tables
- Highlighting Specific Columns in a DataFrame
- Binning Numerical Columns
Plotting Heuristics
- Creating Effective Visualizations
- KDE and Histogram Distribution Plots
- Feature Scaling and Outliers
- Stacked Crosstab Plots
- Box and Violin Plots
- Scatter Plots and Best Fit Lines
- Correlation Matrices
- Partial Dependence Plots
About EDA Toolkit
- ASCII Art
- Acknowledgements
- Contributors/Maintainers
- Citing EDA Toolkit
- Changelog
- Version 0.0.15
- Version 0.0.14
- Version 0.0.13
- Add
ValueError
for Insufficient Pool Size inadd_ids
and Enhance ID Deduplication - Enhance
strip_trailing_period
to Support Strings and Mixed Data Types - Changes in
stacked_crosstab_plot
- Add Environment Detection to
dataframe_columns
Function - Add
tqdm
Progress Bar todataframe_columns
Function - Other Enhancements and Fixes
- Add
- Version 0.0.12
- Version 0.0.11
- Version 0.0.10
- Version 0.0.9
- Version 0.0.8
- Version 0.0.8c
- Version 0.0.8b
- Version 0.0.8a
- Version 0.0.7
- Version 0.0.6
- Version 0.0.5
- Version 0.0.4
- Version 0.0.3
- Version 0.0.2
- Version 0.0.1rc0
- Version 0.0.1b0
- Version 0.0.1b0
- Version 0.0.1b0
- Version 0.0.1b0
- References