combinations detected: IGHD3-22*01 for Sample 1
and IGHD1-26*01 for Sample 2.
In order to validate our identified main clone se-
quences, highlighted with a rectangular box in Figure
2 A and B, we verify if our V, D and J rearranged
alleles were the same as those obtained by insert-
ing the main clone PCR sequence in different online
tools. We perfomed this comparison against four free
available tools: IMGT/JunctionAnalysis (Monod,
2004), JOINSOLVER (Souto-Carneriro, 2004), VDJ-
solver (Ohm-Laursen, 2006) and SoDA(Volpe, 2006).
As it is possible to note from Table 1, despite
methods and algorithms implemented by the differ-
ent tools are different, all of them agree about the V,
D and J allele assignments of the sequence obtained
in laboratory via PCR. The main clone detected by
the above mentioned tools is for both the Samples the
same we identified applying our pipeline.
5 CONCLUSIONS
The arising interest in understanding the correlation
between a specific pathology and the detection of ab-
normal antibodies with the main purpose of promote
the engineering of therapeutic antibodies conducted
us to develop a new approach to the analysis of the
V(D)J junction of mature B cells. Differently from
the other available tools our pipeline aims at identify
the recombinant V, D and J alleles by starting from
a set of RNA-Seq paired-end reads rather than from
a single sequence. The results obtained on two Sam-
ples of MCL, confirmed by several available tools on
the main clone PCR validated sequence, conducted
us to affirm that the implemented pipeline is capable
to manage the typical sequence features characteriz-
ing V, D and J alleles in other words homologies and
polymorphisms.
Future works will aim at validate the developed
flow on other neoplastic datasets and than at identify
the specific main clone sequence by considering all
the enzymatic processes above mentioned acting dur-
ing the VDJ recombination. We also intend to opti-
mize the proposed algorithm in order to identify and
characterize subclones or divergent clones in a neo-
plastic population and follow them up over time since
it is worth noting that during lymphoma development
the B cell repertoire can evolve.
REFERENCES
Abate, F. (2012). Bellerophontes: A rna-seq data analysis
framework for chimeric transcripts discovery based on
accurate fusion model. Bioinformatics.
Alt, F. W. (1982). Joining of immunoglobulin heavy chain
gene segments: implications from a chromosome with
evidence of three d-jh fusions. Proceedings of the Na-
tional Academy of Sciences.
Altschul, S. F. (1990). Basic local alignment search tool.
Journal of Molecular Biology.
Bassing, C. H. (2002). The mechanism and regulation of
chromosomal v(d)j recombination. Cell.
Fraser, N. L. (2003). The vh gene repertoire of splenic b
cells and somatic hypermutation is systemic lupus ery-
thematosus. Arthritis Res Ther.
Gaeta, B. A. (2007). ihmmune-align: hidden markov
model-based alignment and idedntification of
germline genes in rearranged immunoglobulin gene
sequences. Bioinformatics.
Giudicelli, V. (2004). Imgt/v-quest, an integrated software
program for immunoglobulin and t cell receptor v-j
and v-d-j rearrangement analysis. Nucleic Acids Re-
search.
Huang, S. (1998). Vh usage and somatic hypermutation in
peripheral blood b cells of patients with rheumatoid
arthritis. Clin Exp Immunol.
Hueber, W. (2002). Autoantibody profiling for the study
and treatment of autoimmune disease. Arthritis Res
2002.
Jackson, K. J. L. (2012). Divergent human populations
show extensive shared igk rearrangements in periph-
eral blood b cells. Immunogenetics.
Jung, D. (2004). Unraveling v(d)j recombination; insights
into gene regulation. Cell.
Langmead, B. (2009). Ultrafast and memory-efficient align-
ment of short dna sequences to the human genome.
Genome Biology.
Li (2002). Genetic diversity og the immunoglobulin heavy
chain vk region. Immunology Review.
Monod, M. Y. (2004). Imgt/junctionanalysis: the first tool
for the analysis of the immunoglobulin and t cell re-
ceptor complex v-j and v-d-j junctions. Bioinformat-
ics.
Ohm-Laursen (2006). No evidence for the use of dir, d-d
fusions, chromosome 15 open reading frames or vh
replacement in the peripheral repertoire was found on
application of an improved algorithm, jointml, to 6329
human immunoglobulin h rearrangements. Immunol-
ogy.
Prabakaran, P. (2011). Expressed antibody repertoires
in human cord blood cells: 454 sequencing and
imgt/high v-quest analysis of germiline gene usage,
junctional diversity, and somatic mutations. Immuno-
genetics.
Souto-Carneriro, M. M. (2004). Characterization of the hu-
man ig heavy chain antigen binding complementarity
determining region 3 using a newly developed soft-
ware algorithm, joinsolver. The Journal of Immunol-
ogy.
Stephen, M. (2009). Shrimp: Accurate mapping of short
color-space reads. PLoS Computational Biology.
Volpe, M. J. (2006). Soda: implementation of a 3d align-
ment algorithm for inference of antigen receptor re-
combinations. Bioinformatics.
ANovelPipelineforV(D)JJunctionIdentificationusingRNA-SeqPaired-endReads
189