GitHub - lh3/tabtk: Toolkit for processing TAB-delimited format

Introduction

Tabtk is a fast and lightweight tool for processing TAB-delimited formats.

Basic Unix cut (duplicated columns ignored):
```
  tabtk cut -f 5,1-3,6,6- file.txt
```
Reorder columns:
```
  tabtk cut -rf 5,1-3,6 file.txt
```
Duplicate columns (duplicated columns not ignored with option -r):
```
  tabtk cut -rf 1,1,1 file.txt
```
Use both SPACE and TAB as the delimitor:
```
  tabtk cut -d space -f 1-3 file.txt
```
Cut a CSV file:
```
  tabtk cut -d csv -f 2-4 file.csv
```
Commas can appear in double-quotation marks.
Print lines in streamed.txt that matching loaded.txt on the first column:
```
  tabtk isct loaded.txt streamed.txt
```

Print lines matching the first two columns:

  tabtk isct -1 1,2 loaded.txt streamed.txt

Fixed-width view of a TAB delimited file and truncate long fields to 20
```
  tabtk view -l 20 tab.txt | less -S
```
This by default loads 16MB data to RAM, not the whole file.
Grep a pattern in specified columns:
```
  tabtk grep -f 2 "^rs[0-9]+" file.vcf
```
Compute the mean, min and max of a numeric column:
```
  tabtk num -c 2 file.txt
```
Compute the standard deviation and quartile:
```
  tabtk num -Qc2 file.txt
```

Name	Name	Last commit message	Last commit date
Latest commit lh3 r20: make isec an alias to isct Oct 10, 2024 538366f · Oct 10, 2024 History 21 Commits
.gitignore	.gitignore	r2: improved Makefile; prepare for "grep"	Sep 7, 2014
Makefile	Makefile	r8: added "grep"	Sep 21, 2014
README.md	README.md	r18: default batch size to 16MB; updated README	Jul 30, 2016
khash.h	khash.h	moved from seqtk	Sep 7, 2014
kseq.h	kseq.h	moved from seqtk	Sep 7, 2014
ksort.h	ksort.h	moved from seqtk	Sep 7, 2014
kstring.h	kstring.h	moved from seqtk	Sep 7, 2014
kvec.h	kvec.h	r14: fixed-width view of TAB/CSV	Jul 30, 2016
regexp9.c	regexp9.c	r2: improved Makefile; prepare for "grep"	Sep 7, 2014
regexp9.h	regexp9.h	r2: improved Makefile; prepare for "grep"	Sep 7, 2014
tabtk.c	tabtk.c	r20: make isec an alias to isct	Oct 10, 2024
test.csv	test.csv	r7: CSV support	Sep 13, 2014
test.txt	test.txt	r3: code refactoring for future functionality	Sep 7, 2014
test2.txt	test2.txt	r8: added "grep"	Sep 21, 2014