A Python-based ETL solution for automating employee timesheet analysis and project status reporting from Excel data sources.
EXTRACTOR is a robust data processing tool that transforms raw employee logs and project data into actionable insights. Built with Python, Pandas, and Tkinter, this application features:
- Automated Excel data extraction and transformation
- Intelligent status tracking (Approved/Missing/Rejected)
- GUI-powered workflow for non-technical users
- Executable packaging for Windows environments
- Data Extraction: Processes Cisco-generated RDMS exports and employee lists
- Smart Transformation:
- Name standardization and title removal
- Dynamic week-of-year calculations
- Multi-dimensional status categorization
- Visual Reporting:
- Color-coded Excel output
- Auto-adjusted column widths
- Merged project grouping
- Desktop GUI:
- File picker interface
- Sheet selection dropdowns
- Progress feedback
- Core:
Python 3.9+
- Data Processing:
Pandas
,XlsxWriter
,NumPy
- GUI:
Tkinter
,PIL (Python Imaging Library)
- Packaging:
PyInstaller
pip install pandas xlsxwriter numpy pillow
- Select source Excel files:
- Main dataset (GTE export)
- Employee list
- Choose output path
- Click "Démarrer" to generate the analysis report
Employee logs (header row 14)
Project Number | Employee Name | Hours | Status
- Pivot table with Y/W columns
- Status colors
- Merged project groups
pyinstaller --onefile --windowed --noconsole \
--add-data "logo.png:." \
--add-data "fav.ico:." \
--icon=fav.ico app.py
Find complete technical specifications and workflow diagrams here:
EXTRACTOR Documentation