Required libraries¶
Note
This step is not needed when using the Docker version or running on the cloud.
Halvade works with several libraries, the Halvade jar and these libraries needs to be available for Spark.
1 2 3 4 5 6 7 8 9 | HALVADE_VERSION=2.0.4 wget https://bitbucket.org/dries_decap/halvadeforspark/downloads/halvade-assembly-${HALVADE_VERSION}.jar HADOOP_BAM_VERSION=7.10.0 wget https://repo1.maven.org/maven2/org/seqdoop/hadoop-bam/${HADOOP_BAM_VERSION}/hadoop-bam-${HADOOP_BAM_VERSION}.jar HTSJDK_VERSION=2.11.0 wget https://repo1.maven.org/maven2/com/github/samtools/htsjdk/${HTSJDK_VERSION}/htsjdk-${HTSJDK_VERSION}.jar .. note:: For Amazon AWS or Google Cloud Engine please use the libraries provided in `halvade-cloud-bundle.tar.gz<https://bitbucket.org/dries_decap/halvadeforspark/downloads/halvade-cloud-bundle.tar.gz>`_ as a bug fix is required in HadoopBAM to run without errors. |
Build from source¶
The Halvade jar can also be obtained by building from the source files. To do this, clone the git repository and built the package with sbt as follows:
1 2 3 | git clone https://dries_decap@bitbucket.org/dries_decap/halvadeforspark.git cd halvadeforspark sbt assembly |
This will build the jar file in the target/scala-2.12/ directory.