Unexpected: segments.py seems to have a default of 101 results per species. is that a default that can be overrriden with max_segments? #556

OConnorStP · 2025-01-23T23:56:47Z

I just ran segments against a directory that i was expecting to get as many as 2.5 million segments for a given species. running without the max_segments parm maxed out at 100 output files per species. i see a default in in the code for 100 so i'm planning to run again and override that with a really high max_segments. like maybe 3000000.

any thoughts about how things will go? basically i'm looking for a segment per ID, rather than a random sample of them. do you have a sense of a practical upper limit on the number of segments/species? i don't mind a run that takes a week or so, but i'm wondering if i'll just run out of RAM along the way.

i know -- silliy-big dataset. ridiculous to capture all the segments. but that's what i'd like to shoot for. any thoughts about a practical upper limit for max_segments?

thanks in advance,

mike o'connor

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unexpected: segments.py seems to have a default of 101 results per species. is that a default that can be overrriden with max_segments? #556

Unexpected: segments.py seems to have a default of 101 results per species. is that a default that can be overrriden with max_segments? #556

OConnorStP commented Jan 23, 2025

Unexpected: segments.py seems to have a default of 101 results per species. is that a default that can be overrriden with max_segments? #556

Unexpected: segments.py seems to have a default of 101 results per species. is that a default that can be overrriden with max_segments? #556

Comments

OConnorStP commented Jan 23, 2025