HilbertSeriesML

Scripts to generate and analyse HS data.

Data-files are listed in the Data folder, they provide datasets of (fake) Hilbert Series (HS) coefficients, as well as datasets of equivalent HS function parameters used in the machine learning (ML) investigations.

Code-scripts are listed in the Scripts folder:

DataGeneration.nb - Mathematica notebook to generate the listed datasets in the Data folder (mathematica was an order of magnitude faster in Taylor expansion calculation than python's Sympy). Run each section to generate the respective data for each investigation as described. Note also that the BC-realHS.txt dataset is a list of HS coefficients for Fano 3-dimensional taken from the GRDB.
ML_Regressor.py - Python script to ML the embedding weights of fake-generated HS in refined form from their HS expansion coefficients of orders 0-100 or 1000-1009, as described in section III.B. Set the 'HS_low_check' variable to '1'/'0' to use the lower or higher order coefficients repsectively for ML, then run the cells sequentially.
ML_Classifier.py - Python script to ML the Gorenstein index or dimension of fake-generated HS in Palindromic-refined form from their HS expansion coefficients of orders 0-100 or 1000-1009, as described in section III.B. Set the 'Param_check' variable to '1'/'0' to ML Gorenstein index or dimension respectively; then set 'HS_low_check' variable to '1'/'0' to use the lower or higher order coefficients repsectively for ML, then run the cells sequentially.
ML_BinaryClassification.py - Python script to use ML to distinguish the refined/Palindromic-refined form of fake-generated HS, or whether the HS coefficients came from a fake-generated HS or from real HS from the GRDB, as described in sections III.C & III.D respectively. Set the 'Palin_check' variable to '1'/'0' to perform the classification to distinguish HS function form or distinguishing fake from real investigations respectively; then set 'HS_low_check' variable to '1'/'0' to use the lower or higher order coefficients repsectively for ML, then run the cells sequentially.
DataGeneration_CI.nb - Mathematica notebook to generate the data used in the complete intersections (CI) investigation of section III.E. The script has two subsections dependent on whether HS generated have palindromic numerators or not. Within each subsection run cells sequnetially to define the generation functions, then to generate and export the data (using the labelled subsubsections as to whether numerator coefficients or Taylor expansion coefficients are to be saved).
ML_CompleteIntersection.py - Python script to ML whether a HS corresponds to a complete intersection (CI) or not, from either numerator coefficients or Taylor expansion coefficients, as descibed in section III.E. Please first unzip the compressed CI.db datafile in the Data folder before running. Run the script's first cell to import used libraries, run sequentially the cells of the chosen investigation to ML, then run the script's final cell to repeat the investigation with 5-fold cross-validation if desired.
ML_Complete_Intersection/ - Dicrectory contains python scripts to ML wether a fake HS corresponds to a complete intersection (CI) or not. As input Data/ci_big.db is used which contains lists of successive quotients of Taylor expansion coefficients. Further details about this investigation can be found in Section III.E in the paper. Please first unzip the compressed Data/ci_big.db file before running the scripts. Run ml_ci_rf.py for the principal component analysis + random forest experiment. Run ml_ci_nn.py for the principal component analysis + neural network experiment. A list of parameters at the beginning of each script facilitates the fine-tuning of the machine learning investigations.
PCAs.py - Python script to perform principal component analysis of the binary classification problems. Run the scripts first cell to import used libraries, then the second cell to import the data, ensuring correct filepaths. The latter cells then create and fit the scalers and pcas.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
Data		Data
Scripts		Scripts
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HilbertSeriesML

About

Releases

Packages

Languages

jhofscheier/HilbertSeriesML

Folders and files

Latest commit

History

Repository files navigation

HilbertSeriesML

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages