This recipe performs data quality control with the tool cutadapt on microRNA FASTQ data downloaded from SRA.
It selects and downloads data deposited for an SRA BioProject, the value is: PRJNA272617
The recipe trims Illumina sequencing adapters from the end of sequences then performs a sliding window approach for correcting trailing low-quality sequences.
The recipe demonstrates other advanced techniques, for example selecting the smallest run from all the sequencing runs deposited for the project while also demonstrating the proper use of automation via loops constructed for parallel
.
Finally, quality control reports are generated with fastqc
.
For more information see the