Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

VAN data extraction - PDF conversion #2

Open
vrwarner opened this issue Sep 18, 2020 · 0 comments
Open

VAN data extraction - PDF conversion #2

vrwarner opened this issue Sep 18, 2020 · 0 comments
Assignees
Labels
enhancement New feature or request

Comments

@vrwarner
Copy link
Collaborator

vrwarner commented Sep 18, 2020

Motivation

To meet the need for extracting data from VAN in PDF format, we want to create a comprehensive tool that takes that information and converts it into a more consumable file (i.e. CSV, JSON) for analysis.

Workflow Example:

Connect to downloaded VAN PDF with a data table.
Transform the data table in the pdf into data frame.
Create a file in the user-preferred file format.

Proposal

Make a utility to extract data tables from PDFs into consumable formats (i.e. potentially build a database).
You could specify which format to look for, what output you want, and more.

@vrwarner vrwarner added the enhancement New feature or request label Sep 18, 2020
@vrwarner vrwarner self-assigned this Sep 18, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant