HaploGrep Rest API

Simple Rest API is up and working again. Please check this article for further details.
Best.
Sebastian

 

HaploGrep2 Update 2.1.1

2 haplogoups were labeled wrong and are fixed now – thanks Rocio for pointing out!

previously until 2.1.0
correct in 2.1.1
R0+16189 R+16189
H+16311 HV+16311

 

 

Combine multiple VCF files

HaploGrep2 currently accepts only single file uploads. To merge several VCF files to one, several tools exist:

CombineVariants
vcfcombine
vcf-merge

An example can be found here

Hope this helps!

 

HaploGrep2 Update 2.1.0

We just updated HaploGrep2, with the following minor points:

  • the Export of the Extended Report (Export / Haplogroup Extendet (txt)) got extended with the “Found_Polys” – see Describtion below:
  • The Report for the potential Phantom Mutations got corrected, so that positions with bases according the rCRS reference are not listed anymore.
  • Report of the possible Recombinations based now on Hamming Distance instead of Kulczynksi-Distance.
  • 6 haplogoups were labeled wrong with H2 instead of H:
previously until 2.0.3
correct in 2.1.0
H2+195 H+195
H2+195+146 H+195+146
H2+152 H+152
H2+16129 H+16129
H2+16291 H+16291
H2+13708 H+13708

Currently, the following columns are included:

SampleID – the identifier of the sample

Range – Sequenced / Genotyped positions on the mitochondrial genome

Haplogroup – resulting Haplogroup

Cluster – if first hit is ambiguous, the result of the cluster is listed in this column

Overall_Rank – the haplogrouping score (from 0.5 to 1) where 0.5 is indicates no SNPs found, and 1 is a perfect match. Now always with “.” as decimal separator

Not_found_Polysfalse negatives – mutations expected in this haplogroup but not found

Found_Polystrue positives – mutations found for the resulting haplogroup. Backmutations are considered as well, indicated by ! (see 182T! or 195C! in Sample Africa01)

Remaining_Polys – Variants not being used for this haplogrouping classification – indicates: a) possibly new haplogroup, or b) possible sample admixture, or phantom mutation (false positives). Listed here are hotspot mutations as well as local private mutations (found in at least one different haplogroup) or global private mutation (unknown in the current phylogeny), as well as heteroplasmic mutations or reference identical positions (the latter is often the case for MicroArray based data).

AAC_Remaining – the remaining Variants in the previous column are checked – and marked as such if involved in an Amino Acid Change.

Input_Sample – the profile used for the classification

 

HaploGrep2 publication

Happy to announce the publication of the HaploGrep2 paper in this years Web Server Issue 2016 in Nucleic Acids Research. For data generated with massive parallel sequencing devices in form of fastq or bam files, the mtDNA-Server paper also published in this years issue, gives some details there.

 

haplogrep2mtdna-server

 

HaploGrep 2 Stand-alone Version

With some delay, we finally updated the stand-alone Version of HaploGrep to the latest version 2. This includes the latest Phylotree 17 (Forensic Science International: Genetics Supplement Series, from December 2015), finescaling the human phylogeny even further. This version has no file-upload limit as  currently applied on the web service (file size of 5MB and max. 3,000 samples – you can use compressed files in zip format tough). We also provide a command-line version of haplogrep2, which makes it straight-forward to integrate it in your workflows or pipelines directly. Or you can use the Rest-API for doing so.

Here’s the direct link to the Download Page – enjoy – and don’t hesitate to contact us in case of questions, suggestions, or any kind of problems.

And here’s the evolution of HaploGrep’s sessions per month from Google analytics, with the release of the Phylotree Versions:

haplogrep_phylotree

 

 

New HaploGrep Exports in Detail

Here’s the overview of the new export options, HaploGrep 2 offers, by clicking on the small arrow next to Export:

haplogrep_export

Missing a specific export format? Feel free to contact us!

 

Phylotree 17 available

After exactly 2 years, Phylotree, the “database” behind HaploGrep got updated by Mannis van Oven. Here’s the accompanying publication on Phylotree 17 . The mtDNA tree has now 5,437 haplogroups, which is a growth of over 13% to the previous version. Find out how Phylotree 17 differs for your dataset, by using the updated HaploGrep 2 Version, with the latest mtDNA tree build 17.

 

 

 

 

HaploGrep’s RestAPI now available!

Hi all,
due to many requests we are happy to announce that we provide Haplogrep’s REST API to the public! This will allow everyone to determine haplogroups in a very convenient way. In the following snippets we show how a simple call works (a) from the UNIX command line using curl and (b) from Java. If you have an example running for other languages, please let us know and we will add it here.

Happy haplogrouping & happy new year!
Sebastian

1) Unix Command Line: This call uploads the file myfile.vcf and returns a JSON String including the tags “id” and “haplogroup”:

curl -i -X POST -H "Content-Type: multipart/form-data" -F "importfile=@myfile.vcf" https://haplogrep.uibk.ac.at/haplogrep-ws

2) Here you can see the exactly same call with Java:

import org.restlet.resource.*;
import org.restlet.ext.html.*;
import org.restlet.data.MediaType;
import org.restlet.data.Status;
import org.restlet.representation.*;
import org.json.*;

import java.io.File;
import java.io.IOException;

public class SampleClient {
public static void main(String[] args) throws IOException {
// change location here
File file = new File("/home/seb/samplefile.hsd");
//POST file
ClientResource cr = new ClientResource("https://haplogrep.uibk.ac.at/haplogrep-ws");
final FormDataSet fds = new FormDataSet();
fds.setMultipart(true);
final FormData fileRep = new FormData("importfile", new FileRepresentation(file, MediaType.APPLICATION_ALL));
fds.getEntries().add(fileRep);
cr.post(fds);

//Response
JSONArray jsonArray = new JSONArray(cr.getResponse().getEntityAsText());
for (int i = 0; i < jsonArray.length(); i++) {
JSONObject object = jsonArray.getJSONObject(i);
String id = (String) object.get("id");
String hg = (String) object.get("haplogroup");
Status status = cr.getResponse().getStatus();
}
}
}

 

1000G Phase 3 mtDNA data available

The 1000 Genome Consortium recently released the Phase 3 mitochondrial DNA data of over 2,500 samples for download, besides their papers in Nature, stating:

Mitochondrial chromosome variants are now available for the Phase 3 individuals from our FTP site

Since in the current Version HaploGrep 2 can deal with the VCF file (unzipped), all 2500 samples can be analyzed instantly