Reduce memtory usage #83

mem48 · 2023-09-08T23:26:13Z

WIP so just a place holder, but a few tweaks to reduce memory usage

Bring up to date

tweaks to reduce memory use, still WIP

Robinlovelace · 2023-09-08T23:34:37Z

R/batch_read.R

@@ -12,13 +12,12 @@ batch_read = function(
    cols_to_keep = c(
      "name", # not used currently but could be handy
      "distances",
-      "gradient_smooth",


We ultimately need that column. Fine if it works, surprised if it does after this change though.

Robinlovelace

Great to see this attempt to reduce memory usage. My thinking is that query could be used to ignore all keys we don't use. Any ideas about impact on memory use after this change in any case? Benchmark could help.

Robinlovelace · 2023-09-10T10:16:33Z

R/batch_read.R

-  res = readr::read_csv(file, show_col_types = FALSE)
-  n_char = nchar(res$json)
+
+  res = data.table::fread(file, select = "json")


I agree with this, readr, which uses vroom, seems to be unreliable.

Robinlovelace · 2023-09-10T10:17:35Z

R/json2sf_cs.R

@@ -90,17 +88,18 @@ json2sf_cs = function(
      message(results_error$Freq[msgs],'x messages: "',results_error$results_error[msgs],'"\n')
    }
  }
-
+  results = RcppSimdJson::fparse(results_raw, query = "/marker", query_error_ok = TRUE, always_list = TRUE)


Is there a way to reduce what is read-in here with a different query?

Robinlovelace · 2023-09-10T10:18:47Z

R/json2sf_cs.R

@@ -147,6 +146,7 @@ cleanup_results <- function(x, cols_to_keep){
  x = add_columns(x)
  x = sf::st_as_sf(x)
  x$SPECIALIDFORINTERNAL2 <- NULL
-  cols = cols_to_keep %in% names(x)
-  x[cols_to_keep]
+  cols_to_keep3 = unique(c(cols_to_keep,"gradient_segment","elevation_change","gradient_smooth"))


What if any of these are not needed? For NPT gradient_smooth is the only one we need.

Robinlovelace · 2023-09-10T10:19:28Z

R/utils.R

@@ -44,10 +44,10 @@ route_rolling_average = function(x, n = 3) {


 get_values = function(v, fun) {
-  sapply(v, function(x) fun(as.numeric(x)))
+  vapply(v, function(x) fun(as.numeric(x)), 1)


Same outcome, what's the advantage?

Robinlovelace · 2023-09-10T10:19:50Z

R/utils.R

 }

-extract_values = function(x) stringr::str_split(x, pattern = ",")
+extract_values = function(x) stringi::stri_split_fixed(x, pattern = ",")


Sounds reasonable, what's the thinking behind this.

Robinlovelace · 2023-09-10T10:21:45Z

Can you try to make actions happy also Malcolm?

mem48 added 2 commits September 8, 2023 23:05

Merge pull request #4 from cyclestreets/master

223b0f6

Bring up to date

memory tweaks

a577adf

tweaks to reduce memory use, still WIP

Robinlovelace reviewed Sep 8, 2023

View reviewed changes

mem48 added 2 commits September 10, 2023 10:10

bug fixes and speedup

56c5d15

Fix warnings

f3f5571

Robinlovelace reviewed Sep 10, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce memtory usage #83

Reduce memtory usage #83

mem48 commented Sep 8, 2023

Robinlovelace Sep 8, 2023

Robinlovelace left a comment

Robinlovelace Sep 10, 2023

Robinlovelace Sep 10, 2023

Robinlovelace Sep 10, 2023

Robinlovelace Sep 10, 2023

Robinlovelace Sep 10, 2023

Robinlovelace commented Sep 10, 2023

Reduce memtory usage #83

Are you sure you want to change the base?

Reduce memtory usage #83

Conversation

mem48 commented Sep 8, 2023

Robinlovelace Sep 8, 2023

Choose a reason for hiding this comment

Robinlovelace left a comment

Choose a reason for hiding this comment

Robinlovelace Sep 10, 2023

Choose a reason for hiding this comment

Robinlovelace Sep 10, 2023

Choose a reason for hiding this comment

Robinlovelace Sep 10, 2023

Choose a reason for hiding this comment

Robinlovelace Sep 10, 2023

Choose a reason for hiding this comment

Robinlovelace Sep 10, 2023

Choose a reason for hiding this comment

Robinlovelace commented Sep 10, 2023