Regroup consecutive 2-fields lines of a tab-separated file based on an ordered index.
Médéric Hurier 433a8c4ef6 Update 'LICENSE.txt' 2 months ago
examples Remove invalid lines 1 year ago
.gitignore Added git ignore 1 year ago
LICENSE.txt Update 'LICENSE.txt' 2 months ago
README.md Added readme 1 year ago
tsvfold.clj small refactoring 6 months ago
tsvfold_test.clj Modified tests 1 year ago

README.md

Motivation

Coordinates files are convenient to orient datasets, but they consumes much more disk space than the original data.

The goal of this script is to fold coordinates values based on an ordered index to optimize their storage.

NOTE: this script requires Clojure Boot

Usage

cat coords.tsv | ./tsvfold.clj -k0 > folds.tsv 

Where -k is the index of the ordered key

Examples

You can find examples in the examples/ folder.