A port of the BIGSI database to the Seq language.
COVID sequence for benchmarking comes from https://www.ncbi.nlm.nih.gov/sra/ERX5420434[accn]
Team Members: Gabe Ramirez, Kaveri Nadhamuni, Charvi Gopal
- Clone the repo.
- Install Seq, ensuring that it is in your
$PATH
and that$LD_LIBRARY_PATH
contains{seq_installation_directory}/lib/seq
/bin/bash -c "$(curl -fsSL https://seq-lang.org/install.sh)"
- Install BIGSI's required libraries. See their docs.
- Compile the
bigseq
binary.
cd bigseq
seqc build -release bigseq.seq -o=bigseq
- Set
SEQ_PYTHON
to the output ofpython3 find.py
. - Run
bigseq
.
bloom <ctx_file> <out_file>
: constructs a bloom filterbuild -b <bloom_file> -s <sample_name> ...
: builds an indexsearch <seq>
: searches for a matching sample
See licenses
directory and report.