DEV Community

Adriano De Marino, PhD
Adriano De Marino, PhD

Posted on

BCFtools

Since I have started to work in bioinformatics, I faced a lot of issues with computational performances and memory usage for the execution of some tasks. In bioinformatics, generally, people work with large amount of data and optimisation is often required.

In case of genomics data one of the most used format is the Variant Call Format (VCF) 

After year of working, with this type of file, one of the best software for manipulating and modification of this format is bcftools. You can do basically everything with this CLI software on a VCF file.
 
The main tips I recommend are:

  • Always convert your file from VCF to BCF format, this will increase at least x4 the execution time speed on all tasks. 
  • The flag —-threads will not speed up the process, The multiprocessing is only applied for compression and decompression.
  • When you have to execute multiple commands on your VCF file, such as: Extraction, Annotation and Normalisation, ensure that you are using bcftools with | (pipe) and -Ou in order to avoid to overwrite intermediate files that will slow down your process.

An example:
bcftools view -t chr2 input.vcf.gz -Ou |
bcftools annotate —-rename-chrs alias.txt -Ou |
bcftools norm -m - -Oz -o output.vcf.gz —-threads 8


This command:

  1. extract only chromosome 2 variants -t chr2
  2. Change chromosome prefix —rename-chrs
  3. Split multi-allelic sites into bi-allelic records and save the output in VCF compressed format -m - and -Oz.

Heroku

Deploy with ease. Manage efficiently. Scale faster.

Leave the infrastructure headaches to us, while you focus on pushing boundaries, realizing your vision, and making a lasting impression on your users.

Get Started

Top comments (0)

Playwright CLI Flags Tutorial

5 Playwright CLI Flags That Will Transform Your Testing Workflow

  • 0:56 --last-failed: Zero in on just the tests that failed in your previous run
  • 2:34 --only-changed: Test only the spec files you've modified in git
  • 4:27 --repeat-each: Run tests multiple times to catch flaky behavior before it reaches production
  • 5:15 --forbid-only: Prevent accidental test.only commits from breaking your CI pipeline
  • 5:51 --ui --headed --workers 1: Debug visually with browser windows and sequential test execution

Learn how these powerful command-line options can save you time, strengthen your test suite, and streamline your Playwright testing experience. Click on any timestamp above to jump directly to that section in the tutorial!

Watch Full Video 📹️

👋 Kindness is contagious

If you found this article helpful, please give a ❤️ or share a friendly comment!

Got it