Enhanced Graph Representations of Chromatin Interaction Networks

Edgars Celms

, Lelde Lace

, Gatis Melkus

, Peteris Rucevskis

, Sandra Silina

Andrejs Sizovs

and Juris Viksna

Institute of Mathematics and Computer Science, University of Latvia, Raina Bulvaris 29, Riga, Latvia

{edgars.celms, lelde.lace, gatis.melkus, peteris.rucevskis, sandra.silina, andrejs.sizovs, juris.viksna}@lumii.lv

Keywords:

Chromatin Interaction Networks, Graph Representations, Network Topology, Graph Patterns.

Abstract:

We present a novel extension of graph representations of chromatin interaction networks incorporating edge

directionality and vertex label assignments and focus on patterns deﬁned by different types of 3-cliques that can

occur under such assignments. 3-cliques are chosen for their simplicity and comparative ubiquity in chromatin

interaction graphs; also, our previous work indicates a certain level of biological signiﬁcance that can be

assigned to them. Here we explore statistical distributions of different types of directionality- and strand-

based 3-cliques patterns in two well-curated promoter capture Hi-C data sets and observe that the pattern

distributions strongly deviate from random, if they are considered in the context of a number of additional

features. These observations provide a good justiﬁcation for further exploration of chromatin interaction data

sets using network representations that include edge directionality and node label assignments and indicate a

possibility that these annotation features could be related to some speciﬁc underlying biological mechanisms.

1 INTRODUCTION

In this work, we explore the extension of graph rep-

resentations of chromatin interaction networks with

edge directionality and vertex label assignments. The

work was motivated by our previous studies, in which

we have shown the beneﬁts of analysing chromatin in-

teraction networks in terms of topological features of

network representations by undirected graphs (Lace

et al., 2020; Viksna et al., 2020) as well as indica-

tions of the usefulness of enriching these representa-

tions by additional vertex and edge labelling and edge

directionality. The intention is to include in graph rep-

resentations to the maximum extent the available in-

formation about DNA spatial structure, with the pos-

sibility of 3D genome reconstruction from chromoso-

mal contact maps already been (at least, in principle)

demonstrated (Morlot et al., 2016).

The signiﬁcance of DNA 3D structure became

well-acknowledged in 80s as an ’unanticipated dis-

covery’ (Schleif, 1992), backed by evidence of mi-

https://orcid.org/0000-0001-9608-3792

https://orcid.org/0000-0001-7650-2355

https://orcid.org/0000-0002-3077-6809

https://orcid.org/0009-0006-6189-2008

https://orcid.org/0009-0000-3917-9026

https://orcid.org/0009-0004-6958-9965

https://orcid.org/0000-0003-2283-2978

croscopy and targeted wet-lab experiment data. As

signiﬁcant structural features were acknowledged

loops — free segments of DNA lacking interaction

with other DNA parts. The other types of well-

deﬁned DNA structural features, however, were dis-

covered (or proposed) only after advances in chro-

matin conformation capture techniques. The ﬁrst

comprehensive analysis of chromatin interactions was

presented in (Lieberman-Aiden et al., 2009); the au-

thors introduce the notion of compartments – parti-

tion of each chromosome in 2 loci (arbitrarily named

A and B) such that contacts within each set are en-

riched and contacts between them are depleted. The

provided compartment deﬁnition is not strictly for-

malised, though, and the notion of compartments later

seems to be used by other researchers in similar, but

not necessarily exactly the same, contexts.

Already well-deﬁned and well-recognised fea-

tures are TADs (Topologically Associated Domains),

which were initially proposed in (Dixon et al., 2012)

with more precise deﬁnition and detection techniques

developed in (Rao et al., 2014). TADs can be con-

sidered as closely interacting continuous segments of

DNA and are ’well deﬁned’ in the sense that they cor-

respond to obvious and usually well-separated activ-

ity spots on diagonals of heatmaps of Hi-C data. In

(Rao et al., 2014), a robust procedure for automated

loop identiﬁcation from heatmaps is proposed by us-

Celms, E., Lace, L., Melkus, G., Rucevskis, P., Silina, S., Sizovs, A. and Viksna, J.

Enhanced Graph Representations of Chromatin Interaction Networks.

DOI: 10.5220/0013173100003911

Paper published under CC license (CC BY-NC-ND 4.0)

In Proceedings of the 18th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2025) - Volume 1, pages 579-585

ISBN: 978-989-758-731-3; ISSN: 2184-4305

579

ing a number of clustering methods. Potential biolog-

ical mechanisms for forming TADs are well described

in (Matharu and Ahituv, 2015). In (Grubert et al.,

2020) is provided a very recent statistical analysis of

loop and TAD cell type speciﬁcity for 24 human cell

types using datasets from ENCODE portal.

Another interesting (and not too widely studied)

features of interaction data are FIREs (Frequently

Interacting Regions) introduced in (Schmitt et al.,

2016). FIREs can be considered as small sub-

segments of TADs actively interacting with other

regions within them. The authors also propose a

method for FIRE identiﬁcation and show that they

have strong tissue-speciﬁcity (on the basis of 21 tissue

data). A recent review of structural features of chro-

matin interactions (that partially also acknowledges

our still limited understanding of them) was presented

in (Eagen, 2018) and discusses all four structural fea-

tures mentioned above: loops, compartments, TADs,

and FIREs. The authors also provide a computation-

ally more robust deﬁnition of compartments as clus-

ters of two or more strongly interacting TADs.

Regarding the biological role of DNA structural

features, whilst in general it is already well acknowl-

edged (Schleif, 1992), the opinion of the importance

or role of particular structural features still somewhat

varies (that partially might be explained by the use of

different experimental techniques or data processing),

albeit in general consensus that all these structures

have important biological roles seems to be reached.

A number of potential biological mechanisms for this

have also been proposed (Catarino and Stark, 2018;

Mora et al., 2016; Schoenfelder et al., 2010).

Whilst processed chromatin interaction datasets

can naturally be considered as large weighted net-

works, there have been comparatively few studies that

have treated the data explicitly as such network. Most

often, one or several different heatmap (also known as

Hi-C interaction map) representations are used, which

provide a ’general picture’ but are certainly lacking

in detail. One of the ﬁrst studies that tries to anal-

yse Hi-C data as the network is (Siahpirani et al.,

2016); the authors use a number of known cluster-

ing methods (as well as propose their own) for Hi-C

data analysis; the results allow us to differentiate be-

tween four (two human, two mice) cell lines. In (Thi-

bodeau et al., 2017) ML-based network analysis was

applied to data from 3 cell lines, and the authors have

identiﬁed several tens of small subgraphs (’graphlets’

with 2-5 vertices) that were over-represented in the

interaction networks; some of these were shown to be

cell type-speciﬁc, and some were not. Also (Javierre

et al., 2016) provides examples of a few small sub-

graphs of interaction networks, together with possible

explanations of their biological role. The most ex-

plicit network-oriented approach was probably used

by (Schulz et al., 2018), where the authors present an

algorithm for ﬁnding common single linkage clusters

(called δ-teams) in a set of interaction graphs. The

method was also applied to 3 interaction networks

generated by (Dixon et al., 2012), although without

strong conclusions being drawn.

One of the difﬁculties in analysing networks at the

graph topology level is the very wide range of den-

sities of graphs obtained from measured interaction

data. Thus, although the feasibility of reconstruction

of DNA 3D structure from chromatin contact maps

has been demonstrated (Morlot et al., 2016) in princi-

ple, the amount of information about 3D structure that

can be extracted from experimentally obtained Hi-C

datasets is a completely different matter.

Nevertheless, there is steadily increasing interest

in graph-based chromatin structure models (Pancaldi,

2023), including proposals of some very high abstrac-

tion level representations (Dotson et al., 2022; Tan

et al., 2018). Our previous results (Lace et al., 2020;

Viksna et al., 2020) have also shown that topologi-

cal properties of chromatin interaction graphs can be

successful for differentiation between different tissue

and cell types and that certain small network substruc-

tures can be well associated with groups of genes with

similar activity modes. In the background context, we

expect that the proposed extensions could broaden the

classes of networks to which the signiﬁcance of spe-

ciﬁc topological features can be applied.

2 MATERIALS AND METHODS

2.1 Topological Patterns of 3-Cliques

The use of edge directionality is particularly appropri-

ate for promoter capture Hi-C (PCHi-C) data, where

it is already implicitly deﬁned by interactions be-

ing measured between ’baits’ (promoters) and ’other

ends’ (some of which can also be promoters). Thus,

PCHi-C data sets can inherently be viewed as directed

graphs, the challenge is to gain some understanding

whether and/or how this directionality can be associ-

ated with biologically interesting features captured by

these data sets.

In our previous work we have successfully

demonstrated biological signiﬁcance of a number of

speciﬁc topological features in PCHi-C graphs (and

also Hi-C graphs in general). The most of these topo-

logical features were deﬁned by the presence of small

subnetworks, which we refer to as ’patterns’ (’mo-

tifs’ is another commonly used term in computational

BIOINFORMATICS 2025 - 16th International Conference on Bioinformatics Models, Methods and Algorithms

580

biology for such subnetworks, although it tend to im-

ply assignment of also some biological meaning; a

particularly elaborate approach of using subnetworks

in biological network analysis has been developed in

(Yaveroglu et al., 2015; Sarajlic et al., 2016; Przulj

and Malod-Dognin, 2016) by a research team who re-

fer to such substructures as ’graphlets’).

One of the simplest types of such identiﬁed topo-

logical features were k-cliques – fully connected sub-

graphs of k vertices. k-cliques provided good discrim-

inatory power between chromatin interaction graphs

between different tissues and cell types (Lace et al.,

2020), and clique-rich regions can also be well asso-

ciated with increased transcriptional activity (Melkus

et al., 2023). Due to complexity of k-clique ﬁnding

growing exponentially with k, we have limited explo-

ration of k-cliques for values of k = 3...5. Although,

when present, 4- and 5-cliques exhibited particularly

good discriminatory power, with the analysed Hi-C

data sets deﬁning comparatively sparse graphs their

numbers were quite limited, and the most abundant

and particularly signiﬁcant turned out to be such sim-

ple topological structures as 3-cliques. The pres-

ence and biological signiﬁcance of speciﬁc types of

3-cliques has also been justiﬁed by the proposed as

well as experimentally observed transitivity of CTCF-

mediated chromatin loops (Wang et al., 2021).

Due to this it also appears to be very natural to

base the initial assessment of edge directionally and

labelling on the analysis of different types of 3-cliques

that can be obtained by such assignments. We call

these clique types patterns and consider here patterns

that are deﬁned by edge directionality (patterns A, B,

C and D) and by strand assignments to edge endpoints

(patterns X, Y and Z).

In representations of PCHi-C data, 3-clique pat-

terns are deﬁned by the order of their vertices on chro-

mosomes and the direction of edges between them –

up to symmetries, this leads to 4 different types of pat-

terns A, B, C and D (Figure 1). The underlying data

do not imply that A, B, C and D patterns can be di-

rectly associated with different 3D conformations of

chromatin structure. However, they affect the rela-

tive locations of genes and their promoting regions on

DNA loops formed by 3-cliques. Patterns of types A,

B and C are all deﬁned by two promoters, and one can

expect that they will occur with similar frequencies.

Type D patterns, however, are deﬁned by three pro-

moters. Assuming that for all the edges the probabil-

ities for their ’other ends’ to happen to be also ’baits’

are completely random, the probability of pattern to

have three promoters by chance would be p = N

with N

being the total number of promoters and N

being the total number of edge endpoints for a par-

ticular chromosome. Such assumption would lead to

the expected frequencies of patterns A, B, and C be-

ing 1/(p + 3) and the frequency of pattern D being

p/(p + 3). In practice, however, the probabilities p

are not distributed to edges completely randomly, but

depend on multiple other factors (e.g. node degree

distribution), some of which are data set speciﬁc. Due

to this it is difﬁcult to assign meaningful exact expec-

tation frequencies of patterns, but we can assume that

the types A, B and C should be equally frequent, and

the type D is expected to occur less frequently.

Another natural way to partition 3-cliques into

subtypes is by deﬁning patterns according to the

strand speciﬁcity of interacting regions associated

with endpoints of edges. Strand-speciﬁcity of inter-

acting regions likely could be of relevance for vari-

ous types of chromosome conformation capture tech-

niques, not only PCHi-C, provided that such infor-

mation can be extracted from measurement data. As-

suming that the endpoint regions of an observed chro-

matin interaction can be reliably assigned to speciﬁc

chromosome strands, we, up to symmetries, obtain

three types of patterns: X, Y and Z (Figure 1). In

this case, different patterns can already be directly

related to different folding of DNA in 3D space, al-

though the signiﬁcance of this might start to manifest

only for short-range interactions. Clique node allo-

cation to strands is characterised by node sequence

a → b → c, with a, b, c denoting one of the two strands

A and B, which provides 8 possible allocations. Two

of these correspond to type X patterns (A → A → A

and B → B → B), two correspond to type Y patterns

(A → B → A and B → A → B), and four to type Z

patterns (A → A → B, A → B → B, B → B → A and

B → A → A). Thus, with 3-clique vertices being ran-

domly located on strands, patterns of types X and Y

are expected to occur with the probabilities 1/4 each

and pattern of type Z with the probability 1/2.

Figure 1: Four types of patterns of 3-cliques of chromatin

interactions deﬁned by edge directionality: A, B, C and D

(left); and three types of 3-clique patterns deﬁned by gene

locations on strands: X, Y and Z (right).

Enhanced Graph Representations of Chromatin Interaction Networks

581

2.2 Data Sets Used

We have analysed statistical properties of distribu-

tions of these edge directionality and strand location

patterns for two datasets of promoter capture Hi-C

(PCHi-C) data.

One of the best-suited datasets available for such a

purpose is the data set of long-range interactions be-

tween promoters and other regulatory elements that

were generated by The Babraham Institute and Uni-

versity of Cambridge (Javierre et al., 2016). This

data set is still largely unique because it contains

genome-wide data covering a representative subset

of the entire haematopoietic lineage collected using

a uniﬁed protocol. The data was obtained by pro-

moter capture Hi-C (PCHi-C) in 17 human primary

haematopoietic cell types, and from 31253 identiﬁed

promoter interaction regions across all chromosomes,

a subset of high-conﬁdence PCHi-C interactions have

been selected using CHiCAGO pipeline (Cairns et al.,

2016). The graphs deﬁning PCHi-C networks were

constructed using the same signiﬁcance score of de-

tected interaction that was proposed in the dataset

analysed here. We use the same threshold that was

proposed in (Javierre et al., 2016) – i.e., interactions

with a signiﬁcance score 5 or above were selected

for deﬁning graph edges. Depending on the chromo-

some, the number of graph vertices ranges between

3000 and 23000 and the number of edges ranges be-

tween 8000 and 66000. We refer to this dataset as

Haema17.

The other well-curated promoter capture Hi-C

dataset (Jung et al., 2019) that we have used contains

data of 28 human tissue samples with an average of

9000 nodes and 17000 interactions per chromosome’s

tissue sample (when ﬁltered by the suggested p-value

threshold p ≥ 0.7). Due to longer-range interactions

being considered less reliable, the authors have in-

cluded in their data only interactions with lengths not

exceeding 2Mb (which, unfortunately, is a limitation

from our perspective). We refer to this dataset as Tis-

sue28.

The average graph densities are similar for both

datasets and range between 2 and 3 depending on

the chromosome if cell type or tissue speciﬁcity of

the edges are not taken into account. The number of

edges shared by several cell lines or tissues, however,

is slightly higher in Haema17 dataset; this well cor-

responds to the fact that the Haema17 contains data

for more closely related cell types in comparison to

Tissue28.

Only interactions within the same chromosomes

are included in graphs deﬁned by these datasets (thus,

each chromosome is represented separately by its spe-

ciﬁc graph). Chromosomes X and Y are not included

in the analysis. The edge directionally used to deﬁne

A, B, C and D types of 3-clique patterns are directly

based on ’bait’ and ’other end’ assignments as given

in PCHi-C data sets. Strand assignments for deﬁning

X, Y and Z types of patterns were derived from strand

assignments of promoter regions according to the En-

sembl reference genome (GRCh38). Not all strand

assignments could be derived in such a way, and a

number of derived assignments were ambiguous. The

corresponding chromatin interactions in these cases

were excluded from datasets, resulting in the exclu-

sion of approximately 50% of interactions.

3 RESULTS

We have analysed statistical properties of distribu-

tions of edge directionality and strand location pat-

terns for Haema17 and Tissue28 data sets. Figure 2

shows the distribution of pattern A, B, C and D fre-

quencies (as counted in thousands) for each of the

chromosomes and also their average lengths (in kilo-

bases) for Haema17 data set. As it was anticipated,

there are no signiﬁcant distinctions of pattern A, B

and C frequencies (466119 type A, 457752 type B

and 500289 type C patterns across the all chromo-

somes). Also, as it was anticipated, pattern D is less

abundant (81287 type D patterns across all the chro-

mosomes), although there is a notable variation in its

relative frequency by chromosomes. Very noticeably,

however, span lengths for type D patterns are signif-

icantly larger, indicating that their occurrence is not

distributed randomly – the average span lengths of

type A, B and C patterns being around 80Mb, and the

average span length of type D patterns around 205Mb.

Also worth noticing is the fact that the average length

increase for type D patterns is less pronounced for

chromosomes in which they occur more frequently

(e.g., for chromosomes 6 and 19, which have, corre-

spondingly, the average lengths around 30Mb and 130

Mb for the A, B and C types and the average lengths

around 50Mb and 150 Mb for the D type).

Unfortunately, Tissue28 data set does not allow to

replicate or validate the observation of substantially

increased span lengths of type D patterns since longer

range interactions have been explicitly excluded from

it.

At the present stage we can not provide a plausi-

ble biological explanation for increased span lengths

of type D patterns, although the observed increase of

lengths is statistically signiﬁcant and likely must have

underlying reasons that merit further exploration.

A property that is shared by both datasets is the

BIOINFORMATICS 2025 - 16th International Conference on Bioinformatics Models, Methods and Algorithms

582

Figure 2: Counts (in thousands) of A, B, C and D type pat-

terns (above), and their average span lengths (in kb) (below)

for Haema17 dataset.

reduction of span lengths for all types of patterns

that are present in a larger number of tissues or cell

types. Figure 3 shows a comparison of average pat-

tern lengths in Tissue28 data set for unrestricted pat-

terns and patterns that occur in at least 5 different tis-

sue types – for the latter, the average span lengths drop

more than twice. Very similar approximately twofold

size reduction can be observed for Haema17 data set

(for cut-off using the same requirement, that patterns

must be shared by at least 5 different cell types), al-

though in numerical terms, the average span lengths

for this data set are larger. It should be noted that

this reduction of span lengths can not be explained by

reduced span lengths of single interactions shared by

multiple tissue or cell types, average lengths of which

depend little on the number of tissue or cell types in

which a particular interaction is observed.

The frequencies of X, Y and Z type patterns ob-

served in the whole data sets generally correspond to

random expectations, with similar counts of types X

and Y and type Z patterns being approximately twice

as frequent. For Haema17 data set, their frequency

distribution is shown in Figure 4 (above). (For eas-

ier comparison, the counts of Z types of patterns are

divided by 2.) The frequency distribution, however,

noticeably changes when we restrict our attention to

patterns with limited span lengths. In this case, a no-

ticeable increase in pattern X frequencies, as well as

a noticeable decrease in pattern Y frequencies, can be

observed. The results for Haema17 data set using 200

kilobase length cut-off is shown in Figure 4 (below).

Very similar results of X, Y and Z type pattern distri-

bution can be observed also for Tissue28 data set.

Figure 3: Average span lengths (in kb) of A, B, C and D

type patterns without tissue type speciﬁcity (above), and for

patterns that occur in at least 5 different tissues (below) for

Tissue28 dataset.

Figure 4: Counts (in thousands) of X, Y and Z type patterns

with unrestricted lengths (above), and with lengths up to

200kb (below) for Haema17 data set. For easier comparison

the counts of Z type patterns are divided by 2.

4 DISCUSSION

We have proposed an extension of graph represen-

tations of chromatin interaction networks by incor-

porating edge directionality and vertex label assign-

ments and have analysed statistical properties of dis-

tributions of patterns deﬁned by edge directional-

ity (types A, B, C and D) and strand assignments

(types X, Y and Z) in well-curated two promoter cap-

ture Hi-C data sets. The results conﬁrm that both

directionality- and strand-based patterns are not ran-

domly distributed, and there should be underlying bi-

ological reasons for the observed deviations.

One of the observed deviations is signiﬁcantly in-

Enhanced Graph Representations of Chromatin Interaction Networks

583

creased span lengths of type D patterns, which are de-

ﬁned by three promoters (in comparison, A, B and

C types of patterns involve only two promoters). A

possible explanation for this might be that, compared

to other types, the observed type D patterns can be

more likely associated with functioning gene regula-

tory feedback loops. Such a hypothesis could be val-

idated from gene expression measurements. Unfortu-

nately, no appropriate gene expression data with good

genome-wide coverage are available for cell types

from Haema17 data set. Similar validation would

be much easier to perform for Tissue28 data set us-

ing, e.g., gene expression data from GTEx consortium

(Consortium, 2020). However, the long-span chro-

matin interactions for which to perform such valida-

tions have not been included in the underlying PCHi-

C data. Nevertheless, there is a good potential to val-

idate or refute such a hypothesis when new, better-

suited experimental data sets become available. Pro-

vided that appropriate datasets for such type of analy-

sis is at hand, an interesting and promising challenge

in this research direction would be integrated analy-

sis of chromatin interaction and gene expression net-

works using a uniﬁed approach already well estab-

lished for gene expression networks, such as (Song

and Zhang, 2015).

The observation that for all types of patterns, their

average span lengths are reduced for patterns that are

present in a larger number of tissues or cell types is

consistent and complements a previously known fact

of tissue speciﬁcity of 3-cliques. The underlying rea-

sons for this merit additional exploration; however,

any pattern-type speciﬁcity for this property is lack-

ing.

Of notable interest might be the observed bias of

X, Y and Z pattern distributions for shorter-range in-

teractions. The most abundant type X patterns involve

all promoters positioned on the same strand, and the

least frequent type Y involves all adjacent promoters

lying on alternate strands. Thus, a plausible explana-

tion for such a bias could be related to local spatial

constraints on chromosome 3D structure, although a

much more detailed and comprehensive study would

be needed to assess this.

The observed statistical biases of 3-clique pattern

distribution are based on analysis of two PCHi-C data

sets and it remains an open question of how general-

izable or data set-speciﬁc these statistical deviations

could be. We also do not anticipate that any particular

type of the proposed patterns can be closely related

to some very speciﬁc biological role. Nevertheless,

the analysis gives a good justiﬁcation for a further,

more comprehensive exploration of chromatin inter-

action data sets using network representations that in-

clude edge directionality and strand-based node la-

bel assignments, if these can be assigned on the basis

of the available data, and indicates a possibility that

these features might be related to some underlying bi-

ological mechanisms.

ACKNOWLEDGEMENTS

The research was supported by Latvian Council of

Science project lzp-2021/1-0236.

REFERENCES

Cairns, J.and Freire-Pritchett, P. et al. (2016). CHiCAGO:

robust detection of DNA looping interactions in cap-

ture Hi-C data. Genome Biology, 17:127.

Catarino, R. and Stark, A. (2018). Assessing sufﬁciency

and necessity of enhancer activities for gene expres-

sion and the mechanisms of transcription activation.

Genes & Development, 32(3-4):202–223.

Consortium, G. (2020). The GTEx Consortium atlas of ge-

netic regulatory effects across human tissues. Science,

369(6509):1318–1330.

Dixon, J., Selvaraj, S., et al. (2012). Topological domains

in mammalian genomes identiﬁed by analysis of chro-

matin interactions. Nature, 485:376–380.

Dotson, G., Chen, C., et al. (2022). Deciphering multi-way

interactions in the human genome. Nature Communi-

cations, 13:5498.

Eagen, K. (2018). Principles of chromosome architecture

revealed by Hi-C. Trends in Biochemical Sciences,

43(6):469–478.

Grubert, F., Srivas, R., et al. (2020). Landscape of cohesin-

mediated chromatin loops in the human genome. Na-

ture, 583(7818):737–743.

Javierre, B. Burre, O. et al. (2016). Lineage-speciﬁc

genome architecture links enhancers and non-coding

disease variants to target gene promoters. Cell,

167(5):1369–1384.

Jung, I., Schmitt, A., et al. (2019). A compendium of

promoter-centered long-range chromatin interactions

in the human genome. Nature Genetics, 51(10):1442–

1449.

Lace, L., Melkus, G., et al. (2020). Characteristic topo-

logical features of promoter capture Hi-C interaction

networks. Communications in Computer and Infor-

mation Science, 1211:192–215.

Lieberman-Aiden, E., Van Berkum, N., et al. (2009). Com-

prehensive mapping of long-range interactions reveals

folding principles of the human genome. Science,

326(5950):289–293.

Matharu, N. and Ahituv, N. (2015). Minor loops

in major folds: Enhancer–promoter looping, chro-

matin restructuring, and their association with tran-

scriptional regulation and disease. PLOS Genetics,

11(12):e1005640:476–486.

BIOINFORMATICS 2025 - 16th International Conference on Bioinformatics Models, Methods and Algorithms

584

Melkus, G., Silina, S., et al. (2023). Clique-based topo-

logical characterization of chromatin interaction hubs.

Lecture Notes in Computer Science, 14248:476–486.

Mora, A., Sandve, G., et al. (2016). In the loop: promoter-

enhancer interactions and bioinformatics. Brieﬁngs in

Bioinformatics, 17(6):980–995.

Morlot, J., Mozziconacci, J., et al. (2016). Network con-

cepts for analyzing 3D genome structure from chro-

mosomal contact maps. PJ Nonlinear Biomedical

Physics, 4:2.

Pancaldi, V. (2023). Network models of chromatin struc-

tures. Current Opinion in Genetics & Development,

80:102051.

Przulj, N. and Malod-Dognin, N. (2016). Network analytics

in the age of big data. Science, 353(6295):123–124.

Rao, S., Huntley, M., et al. (2014). A 3D map of the hu-

man genome at kilobase resolution reveals principles

of chromatin looping. Cell, 162(3):687–688.

Sarajlic, A., Malod-Dognin, N., et al. (2016). Graphlet-

based characterization of directed networks. Scientiﬁc

Reports, 6(35098).

Schleif, R. (1992). DNA looping. Annual Review of Bio-

chemistry, 61:199–223.

Schmitt, A., Hu, M., et al. (2016). A compendium of chro-

matin contact maps reveal spatially active regions in

the human genome. Cell Reports, 17(8):2042–2059.

Schoenfelder, S., Clay, I., et al. (2010). The transcriptional

interactome: gene expression in 3D. Current Opinion

in Genetics & Development, 20(2):127–133.

Schulz, T., Stoye, J., et al. (2018). GraphTeams. a method

for discovering spatial gene clusters in hi-c sequenc-

ing data. BMC Genomics, 19(S5):308.

Siahpirani, A., Ferhat, A., et al. (2016). A multi-task

graph-clustering approach for chromosome conforma-

tion capture data sets identiﬁes conserved modules of

chromosomal interactions. Genome Biology, 17:114.

Song, W. and Zhang, B. (2015). Multiscale embedded gene

co-expression network analysis. PLoS Computational

Biology, 11(11):e1004574.

Tan, Z., Guarnera, E., et al. (2018). Exploring chromatin

hierarchical organization via Markov State Modelling.

PLOS Computational Biology, 14(12):e1006686.

Thibodeau, A., Marquez, E., et al. (2017). Chromatin inter-

action networks revealed unique connectivity patterns

of broad H3K4me3 domains and super enhancers in

3D chromatin. Scientiﬁc Reports, 7(1):14466.

Viksna, J., Melkus, G., et al. (2020). Topological struc-

ture analysis of chromatin interaction networks. BMC

Bioinformatics, 20(S23):618.

Wang, W., Gao, L., et al. (2021). Ccip: predicting ctcf-

mediated chromatin loops with transitivity. Bioinfor-

matics, 37(24):4635–4642.

Yaveroglu, O., Milenkovic, T., and Przulj, N. (2015).

Proper evaluation of alignment-free network compar-

ison methods. Bioinformatics, 31(16):2697–2704.

Enhanced Graph Representations of Chromatin Interaction Networks

585