Skip to content

open-data-kazakhstan/education-mcda-kz

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Education in Kazakhstan

Description

This dataset provides various educational and socio-economic indicators across the regions of Kazakhstan. The main data is stored in data/data_norm.csv, with columns covering metrics such as:

  • Internal and External R&D Expenses: This column includes expenses related to research and development within and outside each region.
  • Innovation Activity Indicators: Shows regional data on enterprise innovation, providing insights into each region's activity level.
  • Financial Operations in Educational Organizations: Includes revenue and expenditure data for educational institutions.
  • Preschool Availability: Indicates the number of children per 100 places in preschool organizations.
  • Technical and Vocational Education Organizations: Represents the number of such institutions per region.
  • General Education Teachers: Contains teacher counts in general education schools.

Some data in the dataset has been manually adjusted due to the recent reorganization of regions in Kazakhstan. These adjustments include proportional estimations or synthetic data additions to maintain dataset consistency.

Installation

To set up this project:

  1. Clone the repository:

    git clone https://github.com/open-data-kazakhstan/education-kz.git
  2. Create and activate a virtual environment:

    python -m venv /path/to/localrepo
    cd /path/to/localrepo
    Scripts/activate  # For Windows users
  3. Install dependencies:

    pip install -r requirements.txt
  4. Generate or update the datapackage.json file:

    python scripts/package.py

Data

Data sources for the educational indicators include Kazakhstan National Bureau of Statistics, containing raw and normalized versions:

  • data/data.csv: Raw data with various educational indicators for Kazakhstan's regions, using a semicolon (;) delimiter.
  • data/data_normalized.csv: Normalized and cleaned data with adjustments for recent regional changes.
  • archive/innovations.xls: Source file for innovation activity indicators.
  • archive/research_and_development_works.xls: R&D expenses by region.
  • archive/revenues_and_expenditures.xls: Financial data for education institutions, including revenues and expenditures by region.

Scripts

  • package.py: Generates or updates the datapackage.json file, capturing metadata about the dataset for easier sharing and usage.
  • transform.py: Transforms the raw data files from their original format (Excel or other) into the structured CSV format data_normalized.csv for analysis.
  • transform.ipynb: Jupyter notebook version of the transform.py script with additional information on choosing weights for the score calculation.

License

This dataset is available under the Open Data Commons Public Domain Dedication and License (PDDL).

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published