likely causes for a failed LIMS and share our experi-
ence for the implementation of a successful LIMS.
The rest of the paper is structured as follows: Sec-
tion 3 introduces the design of LIMS, while Section 4
describes the implementation of LIMS, and Section 5
presents the integration of LIMS with existing infor-
mation systems. Section 6 discusses the implemen-
tation of a successful LIMS. Section 7 concludes the
paper and outlines future work.
2 RELATED WORKS
Some commercial and open source LIMSs (Bath
et al., 2011; PerkinElmer, 2016; Grimes and Ji, 2014;
Progeny, 2016; Illumina, 2016) are available but typ-
ically require extensive modification and extension to
address the specific needs of NGS genomics labs. We
therefore developed a web-based LIMS, which is ro-
bust and flexible for managing the samples and the
NGS processes.
3 SYSTEM DESIGN
As described in Figure 1, the functionalities of our
LIMS can be grouped into the following categories:
• Enrollment of Sample Information. Enrollment
of sample information is not a simple form fill-
ing process but constituted by two or even three
steps, and supports multiple business models. For
instance, salesman who stationed at hospitals will
input the basic sample information (e.g., sam-
ple code, sample type, photo of sample sheet)
into system by mobile applications embedded in
Wechat. Then, when the sample along with the
sample sheet are transported to company, typists
in company will finish the full sample information
(in sample sheet) enrollment. Finally, the correct-
ness of sample information enrolled in LIMS will
be checked by another team in company.
• Sample Logistics Information Tracking. Samples
come from different cities distributed in the coun-
try, and will be transported to our head quarter to
be tested. Each sample logistics information will
be tracked by system. When samples are packed
and sent to our company, package logistics code
will be scanned into system. Then, the system
can get the newest logistics information from ex-
press companies through their open APIs. We
can check the package logistics information status
during the whole transportation process. If some-
thing unusual (e.g., delayed, destroyed) happened
in logistics process, the recipients and senders can
response quickly to take steps to minimize the
losses.
• Sample Assessment and Processing. Depends on
sample attribute, sample sheet, sample number,
test type or more factors, the samples arrived at
company can be divided to different processing
directions including rejection, resampling, stor-
age, or flow to the experiment steps. Sample re-
cipients, experiment operators and genomic ana-
lysts can choose the processing directions and op-
erate in the system.
• Experimental Management. Experimental man-
agement is the most important and complicated
part of LIMS. Based on sample type, test type,
previous processing result and operator’s subjec-
tive judgment, the samples will experience multi-
ple experimental steps such as plasma isolation,
nucleic acid extraction, molecular library con-
struction, molecular library quality control, and
etc. Our system is also flexible to support the con-
figuration of different experimental processes.
• Sequencing of Samples. After the experimental
steps, the samples are ready to be sequenced by
NGS instrument (e.g., Illumina HiSeq X Ten). To
minimize the cost of using sequencing reagents,
generally, one NGS instrument operation will be
required to sequencing as many samples as pos-
sible, therefore, samples belonging to different
test business line will be combined together in the
pooling process. After sequencing, the raw .bcl
data will be separated into deferent genomic anal-
ysis business processes. Abnormal result data will
be auto labeled by particular rules configured.
• Genomic Analysis. Before genomic analysis,
LIMS will prepare the raw .bcl data for the fil-
ter and quality control software to be processed.
The generated FASTQ data will be further pre-
pared for the genomic analysis pipelines operated
in the cluster computing environment. After ge-
nomic analysis, LIMS will retrieve the analysis
result set to system so as to wait for the genetic
interpretation.
• Genetic Interpretation. Sequencing and analysis
results are presented in a neat and orderly man-
ner in LIMS for the genetic interpretation scien-
tists. Additionally, LIMS can provide relevant
knowledge base for genetic interpretation such
as gene-disease associations from several public
data sources (e.g. AutDB (Mindspec, 2016), Dis-
GeNET (Pinero et al., 2015), OMIM (Hamosh
et al., 2015)) and the literatures.
Laboratory Information Management System for NGS Genomics Labs
327