Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

EES-4727: Tidying up the data_ingestion code by adding types and spli… #29

Merged
merged 5 commits into from
Dec 7, 2023

Conversation

benoutram
Copy link
Collaborator

This PR is an attempt to further tidy up the data ingestion module.

It splits up parsing content into multiple methods to make the code easier to read.

To make the code easier to read and allow for code analysis it adds explicit function parameter types, return types, and some argument name prefixes.

It simplifies utils.chunk_text to only chunk text.

It fixes a bug trying to extract content from content blocks and now only does it if they are of type 'HtmlBlock'.

It handles methodologies not being found by slug in EES which is occurring with the local EES data volume currently (cause still to be investigated).

@benoutram benoutram force-pushed the EES-4727/data-ingestion-tidy-up branch from a298841 to bc0a864 Compare December 7, 2023 10:33
@benoutram benoutram force-pushed the EES-4727/data-ingestion-tidy-up branch from bc0a864 to 9ee5a96 Compare December 7, 2023 12:07
data_ingestion/services/methodology_service.py Outdated Show resolved Hide resolved
data_ingestion/services/release_service.py Outdated Show resolved Hide resolved
@benoutram benoutram force-pushed the EES-4727/data-ingestion-tidy-up branch from b813ff1 to 8a2e518 Compare December 7, 2023 14:18
@benoutram benoutram merged commit 0686ca8 into main Dec 7, 2023
4 checks passed
@benoutram benoutram deleted the EES-4727/data-ingestion-tidy-up branch December 7, 2023 14:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants