+ ACC=/export/www/biostar-central/export/media/projects/cookbook/data-275/Fish_marker_sequences + TAXDIR=/home/www/refs/taxonomy + TABLE=/home/www/refs/taxonomy/table.txt + NODES=/home/www/refs/taxonomy/nodes.dmp + NAMES=/home/www/refs/taxonomy/names.dmp + mkdir -p /home/www/refs/taxonomy + N=1000 + L=100 + mkdir -p input output code + FASTA=input/reference.fa + READS=input/reads.fq + TAXONOMY=input/taxonomy.txt + REPORT=output/centrifuge-report.txt + OUTPUT=output/centrifuge-output.txt + ACCURACY=output/centrifuge-accuracy.txt + cp /export/www/biostar-central/export/media/projects/cookbook/data-275/Fish_marker_sequences input/accessions.txt + epost -db nuccore -input /export/www/biostar-central/export/media/projects/cookbook/data-275/Fish_marker_sequences + efetch -format fasta + epost -db nuccore -input /export/www/biostar-central/export/media/projects/cookbook/data-275/Fish_marker_sequences + esummary + xtract -pattern DocumentSummary -element Caption,TaxId,Title + INDEX=index/db + mkdir -p index + centrifuge-build -p 2 --conversion-table /home/www/refs/taxonomy/table.txt --taxonomy-tree /home/www/refs/taxonomy/nodes.dmp --name-table /home/www/refs/taxonomy/names.dmp input/reference.fa index/db + pip install plac -q WARNING: pip is being invoked by an old script wrapper. This will fail in a future version of pip. Please see https://github.com/pypa/pip/issues/5599 for advice on fixing the underlying issue. To avoid this problem you can invoke Python with '-m pip' instead of running pip directly. + URL1=https://raw.githubusercontent.com/biostars/biocode/master/scripts/fasta/simulate.py + curl https://raw.githubusercontent.com/biostars/biocode/master/scripts/fasta/simulate.py % Total % Received % Xferd Average Speed Time Time Time Current Dload Upload Total Spent Left Speed 0 0 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0 100 2369 100 2369 0 0 10255 0 --:--:-- --:--:-- --:--:-- 10255 + python code/simulate.py --fname input/reference.fa --count 1000 + centrifuge -x index/db -U input/reads.fq -S output/centrifuge-output.txt --report-file output/centrifuge-report.txt report file output/centrifuge-report.txt Number of iterations in EM algorithm: 3 Probability diff. (P - P_prev) in the last iteration: 6.68843e-11 Calculating abundance: 00:00:00 + centrifuge-kreport -x index/db output/centrifuge-output.txt Loading taxonomy ... Loading names file ... Loading nodes file ... + URL2=https://raw.githubusercontent.com/biostars/biocode/master/scripts/classify/validate.py + curl https://raw.githubusercontent.com/biostars/biocode/master/scripts/classify/validate.py % Total % Received % Xferd Average Speed Time Time Time Current Dload Upload Total Spent Left Speed 0 0 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0 100 3027 100 3027 0 0 12456 0 --:--:-- --:--:-- --:--:-- 12456 + python code/validate.py -f output/centrifuge-output.txt -t input/taxonomy.txt -c 1000