Skip to content
/ INF554 Public

INF554 @ école polytechnique 2020. Team ==Baseline==

Notifications You must be signed in to change notification settings

fv316/INF554

Folders and files

NameName
Last commit message
Last commit date

Latest commit

149f455 · Feb 5, 2020

History

50 Commits
Dec 30, 2019
Dec 30, 2019
Dec 25, 2019
Feb 5, 2020
Dec 2, 2019
Dec 7, 2019

Repository files navigation

INF554 Project Repository for Team ==Baseline==

Onboarding

Create virtualenv with

$ virtualenv -p python3 venv

Activate it and install depenencies

$ . ./env.sh  or $ . ./venv/bin/activate
$ pip install -r requirements.txt

Download the data from:

Link

Alternatively:

$ kaggle competitions download -c link-prediction-data-challenge-2019

Whenever you install something new, add it with

$ pip freeze > requirements.txt

Running the Notebooks and Files:

Install the necassary libraries.

  • Original data files are expected in the main directory.

Files/folders:

  • Preprocessing.ipynb preprocessing of text data to create created stemmed corpus. Must be run first.
  • INF554_Simple.ipynb run after Preprocessing.ipynb, calculates the main graph and text based features.
  • INF554_Node2Vec.ipynb run after INF554_Simple.ipynb, calculated node2vec embeddings.
  • INF554_Ensemble.ipynb run last. This file created various models and combined their results in a model ensemble to improve performance.
  • ./pickles stored the pickles features.

About

INF554 @ école polytechnique 2020. Team ==Baseline==

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published