Reputation Economics: IBM Indonesia Intern Project

Using IBM Cloud Services to analyze company's financial performance based on CXO personality and company's reputation. Programmers: Gilang Ramadhan Ilhami, Nicholas Dwiarto Wirasbawa.

Reputation Economics Outline

Getting all the basic information data consisting of company name, CXO, annual report link, and president director's message page on the annual report.
Getting all the financial data from the company annual report. Transform it into a CSV.
3 programs were needed: ORM, Inference, and Scraper.
Utilizing the scraper to get all the basic information data.
Using the ORM to put the gathered data into IBM DB2 on Cloud.
Utilizing the scraper again to scrape president director messages and storing them in IBM COS.
Using the ORM to get all the paths/data from the IBM DB2 on Cloud.
Using Inference to gain an insight about the personality of each President Directors.
The Inference program also pulls (if data is a path) and parses the data from IBM COS.
Utilizing the Inference again to get an insight about corporate's reputation.
Merge all of resulting CSV's into one CSV.
Data analysis!

Initial Setup

Use Python 3.6+ to run this script.
Download or clone the repository.
Create a virtual environment with Python.
Helper link to install virtual environment for Windows: https://programwithus.com/learn-to-code/Pip-and-virtualenv-on-Windows/.
Helper link to install virtual environment for MacOS: https://sourabhbajaj.com/mac-setup/Python/virtualenv.html.
Activate the virtual environment by running the commands (in the Command Prompt) on Mac/Linux: source venv/bin/activate. On Windows: C:\path\to\venv\Scripts\activate.bat. Replace venv with the appropriate virtual environment name.
Install the required dependencies listed in requirements.txt by running pip install -r requirements.txt. For conveniency, the virtual environment should be created in the same directory level as the repository. Suggested virtual environment name is .venv.

Quick Start Guide - Running Personality Insight and Discovery.

Make sure that the Initial Setup is done.
Download phantomjs. Link is here: https://phantomjs.org/download.html.
Open (in file explorer) venv/lib/site-packages/selenium/webdriver/common/service.py and change the cmd path (in def start function, line 70) to your phantomjs.exe file. It should be located in C:\Windows\phantomjs.exe so Windows knows exactly what the program was doing. Set the cmd from [self.path] to be = [path-to-phantomjs].
For Mac/Linux, simply run sudo nano /etc/paths/ and edit the path from there.
Run the test by using python insight.py. Make sure that you are in the inference folder by cd inference in the command line.
Make sure the code in def main() is uncommented for DiscoveryProcessor() or the personality_insight_processor() function!

Additional Notes

Scraper does not have anything to do with IBM Cloud Services, but if anybody wants to try the scraper, please follow the following procedures:

How to run Scraper

Use cd path-to-repo-folder.
Use cd scraper.
Run for Mac/Linux: export PYTHONPATH=/path/to/orm:$PYTHONPATH. For Windows: Change the system variable to include the path to orm folder. Replace /path/to/orm to the full path of the orm folder. Helper link could be found here: https://www.computerhope.com/issues/ch000549.htm.
An alternative: Copy the orm.py file into the same directory that you run your script (whether scraper or inference/insight).
In main_scraper.py, change the index value of list of company names as required (line 87-89).
Run the command python main_scraper.py --path_to_excel path/to/excel_file inside the scraper folder. Replace path/to/excel_file with the actual path to the excel file.

Driver for Scraper

Download chromedriver.
Open venv/lib/site-packages/selenium/webdriver/common/service.py and change the cmd path (in def start function, line 70) to your chromedriver.exe file. It should be located in C:\Windows\chromedriver.exe so Windows knows exactly what the program was doing. Set the cmd to be = [path-to-chromedriver].
For Mac/Linux, simply run sudo nano /etc/paths/ and edit the path from there.
Run the test by using python insight.py. Make sure that you are in the inference folder.
Make sure the code in def main() is uncommented for personality_insight_processor function!

Additional Information regarding the `translate_selenium.py` file

An issue was encountered with using Python's Google Translate API for translation. In order to overcome this, a file named translate_selenium.py was created to handle translation.

However, there are some limitations:

Some setups are required to be able to run the script. Please refer to https://github.com/huseinzol05/Malaya/tree/master/translator
The setup provided is only available for Linux and MacOs device. A recommended work-around for Microsoft user is to use the linux sub-system provided by the operating system.

Name		Name	Last commit message	Last commit date
Latest commit History 92 Commits
inference		inference
orm		orm
prediction		prediction
scraper		scraper
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reputation Economics: IBM Indonesia Intern Project

Reputation Economics Outline

Initial Setup

Quick Start Guide - Running Personality Insight and Discovery.

Additional Notes

How to run Scraper

Driver for Scraper

Additional Information regarding the `translate_selenium.py` file

About

Releases

Packages

Contributors 2

Languages

grilhami/Reputation-Economics-IBM-Intern-Project

Folders and files

Latest commit

History

Repository files navigation

Reputation Economics: IBM Indonesia Intern Project

Reputation Economics Outline

Initial Setup

Quick Start Guide - Running Personality Insight and Discovery.

Additional Notes

How to run Scraper

Driver for Scraper

Additional Information regarding the translate_selenium.py file

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Additional Information regarding the `translate_selenium.py` file

Packages