-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
chore: notes for changes [2025-02-02]
- Loading branch information
1 parent
4400dd1
commit fb5790f
Showing
3 changed files
with
55 additions
and
2 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -197,4 +197,3 @@ src/os_open_usrn_functions/explore_and_prep.py | |
.test_venv | ||
settings.json* | ||
main.py | ||
notes.md |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,54 @@ | ||
# Notes for thinking through problems | ||
|
||
## Breadcrumbs for next time... | ||
|
||
Reduce code duplication in each of the catalogues data loaders... | ||
|
||
So.. | ||
|
||
Implement a shared DataUploader trait/protocol for motherduck and aws s3 for all catalogues | ||
Implement a shared DataFrameLoader trait/protocol for all catalogues | ||
|
||
## Notes for implementing shared loaders behaviours | ||
|
||
Need to understand the structure of we pass to the data loaders of eacgh catalogues first | ||
|
||
## SSEN (Scottish and Southern Electricity Networks) Data | ||
**Return Type:** `List[List]` | ||
|
||
| Field | Data Structure | | ||
|-------|---------------| | ||
| Index 0 (Resource Name) | String | | ||
| Index 1 (Timestamp) | ISO 8601 DateTime String | | ||
| Index 2 (Format) | String | | ||
| Index 3 (URL) | String URL | | ||
|
||
## UK Power Networks Data | ||
**Return Type:** `List[Dict]` | ||
|
||
| Field | Data Structure | | ||
|-------|---------------| | ||
| download_url | String URL | | ||
| format | String | | ||
|
||
## Data.gouv.fr (French Government Data) | ||
**Return Type:** `List[Dict]` | ||
|
||
| Field | Data Structure | | ||
|-------|---------------| | ||
| dataset_id | String | | ||
| resource_created_at | ISO 8601 DateTime String with Timezone | | ||
| resource_extras | Dictionary/Object | | ||
| resource_format | String | | ||
| resource_frequency | Nullable | | ||
| resource_id | String | | ||
| resource_last_modified | ISO 8601 DateTime String with Timezone | | ||
| resource_latest | String URL | | ||
| resource_title | String | | ||
| resource_url | String URL | | ||
| slug | String | | ||
|
||
### Summary: | ||
- SSEN: Each list element contains 4 fields | ||
- UK Power Networks: Each dictionary contains 2 key-value pairs | ||
- Data.gouv.fr: Each dictionary contains 11 key-value pairs |