InteRRact is a web interface for exploring human cis and trans RNA–RNA interactions derived from RNA duplex probing and proximity ligation experiments (e.g., SPLASH, PARIS, LIGR-seq, RIC-seq). InteRRact provides an interactive interface to explore published human datasets, filter duplex groups using confidence annotations (replicate support, read count, predicted hybridisation energy and p-values), visualise trans interactions as gene-level networks, inspect cis contacts in a linear track-like view, and compare interaction landscapes across datasets.
Use the buttons below to learn about InteRRact capabilities, or use the navigation bar to start exploring.
We consider evidence for RNA-RNA interactions in terms of experimental duplex probing data. These duplex data are created in RNA proximity ligation experiments - along with the experimental methods that rely on utilising RBPs - that aim to identify the loci of an RNA-RNA interaction. Each RNA-RNA duplex corresponds to interaction within one RNA or to an interaction between two RNAs. Information on an RNA-RNA duplex is typically stored in terms of pairs of coordinates corresponding to genomic or transcriptomic locations, supplemented with various metadata. For details about the RNA-RNA interaction probing experiments and bioinformatics analysis, click source data & processing button above.
Transcriptome-wide RNA–RNA interaction datasets provide a unique view into RNA structures and trans RNA-RNA interactions in cells. These datasets, however, are often sparse, protocol-dependent, and lack a ground truth. As a result, the systematic exploration, comparison, and functional interpretation of RNA–RNA interaction networks remains challenging and relies on quantitative meaning methods for assessing their reliability which InteRRact incorporates in terms of level of experimantal replication, estimated p-values and other means.
InteRRact provides an interactive framework to explore, filter, compare, and interpret RNA–RNA interaction datasets in a biologically meaningful and statistically meaningful way.
InteRRact provides access to carefully pre-proccesed data from RNA duplex probing experiments and provides three main means to explore it: InteRRact contains three main of modules:
For details and a short tutorial, please click the respective button above
Traditional linear genome views obscure the RNA-RNA contact network organisation, while pure network representations hide positional and structural details. InteRRact visualises RNA-RNA interaction data in terms of a network representation as well as in terms of a linear genomic view that is capable of focusing on one or two genomic locations. In this manner, more global network information can be explored alongside a detailed view of genomic locations for a particular RNA-RNA interactions.
Raw datasets of RNA–RNA interactions are inherently sparse and prone to experimental biases as well as artefacts from computational post-processing of the raw data. Only a small fraction of sequencing reads are actually chimeric, crosslinking and ligation efficiencies vary, and bioinformatic processing may introduce false positives. To overcome this challenge, all raw RNA-RNA interaction datasets have been uniformly post-processed with DuplexDiscover, which is able to derive p-values as well as other quantitative features to reliably estimate the reliability of RNA-RNA interactions. Using InteRRact, you can readily filter the data via these pre-calculated features: number of supported reads, hybridisation energy, p-vales or other features. It thus easy to filter for reliability and to explore your own hypotheses regarding the data.
In addition to the user-controlled filtering options described above, the datasets featured in InteRRact are pre-filtered by default to exclude low-complexity and other problematic regions, as well as RNA-RNA interactions mapping to intergenic loci. Please refer to the Dataset Statistics tab to see what is filtered out. If you still want to browse interactions originating from problematic regions, you can disable pre-filtering in the Advanced options sidebar pane (Visualise single dataset and Pairwise comparison modules).
InteRRact provides a structured framework to compare two datasets within a unified annotation scheme. The union graph of two RNA-RNA interaction networks encodes shared as well as dataset-specific nodes and edges and supports clustering of pooled duplex groups, enabling assessment of reproducibility, condition-specific interactions, and protocol-dependent differences.
In addition to the pairwise comparison with an intreraction graph, InteRRact allows comparison across multiple datasets by computing the their intersections.
If you encounter a problem with the InteRRact server, please contact yegor.smb@gmail.com
This module enables interactive exploration of a single RNA–RNA interaction dataset derived from proximity ligation or duplex probing experiments. It is designed for hypothesis-driven inspection of one experiment or condition at a time. Trans interactions are visualised as a gene-level interaction network, while locus-level evidence can be inspected in a linear track view. You can dynamically filter interactions to focus on robust duplex groups and explore community structure within the network.
Dataset selection
Choose experiment / cell line / condition.
Minimum reads
Require a minimum number of supporting reads per duplex group.
Minimum replicates
Require detection in multiple replicates.
Minimum component size
Remove small disconnected subgraphs to focus on hubs and structured communities.
RRIs per gene pair
Require more than N duplex groups between two genes.
Hybridisation energy (ΔG)
Filter interactions by predicted duplex stability.
P-value threshold
Retain statistically supported interactions for the binominal random ligation test.
P-value threshold for the odds ratio test
Retain statistically supported interactions for the odds-ratio ‘interaction orrured by chance’ test.
Gene search / coordinate search
Focus on a gene of interest or a genomic locus.
Network physics & colouring
Adjust layout behaviour and colour nodes by community.
These controls expose the confidence metrics directly derived from the experimental library and downstream statistical processing.
This tutorial explains how to use InteRRact in the single-dataset mode using a biologically motivated example based on recently published work. Bergeron et al. analysed PARIS, LIGR-seq and SPLASH datasets and identified pletora of snoRNA-host transcript interactions. The central validated interaction in their analysis is snoRNA-mRNA interaction: SNORD2 <-> EIF4A2. SNORD2 resides on EIF4A2 and interacts with its host intron near exon 4 and regulates EIF4A2 splicing by sequestering the nucleotides at the branch point.
We will use InteRRact to locate the SNORD2-EIF4A2 interaction in the trans RNA-RNA interaction network, inspect its supporting duplex groups, and examine the corresponding locus in the linear genomic view. We then zoom out to broader outlook in order to demonstrate that the same workflow can be extended to many other snoRNAs in the dataset.
We will use the LIGR-seq dataset, as this LIGR-seq protocol produces the largest amount of small RNA interactions, including snoRNAs. For this example, we keep the default confidence filters unchanged. To make the graph easier to interpret and pinpoint the RNA biotypes, the colouring rule can be set to gene type.
To focus on the interaction of interest, we enter SNORD2 in the gene search field and request display of only n = 1 interacting neighbour in the graph. This reduces the network to the immediate local neighbourhood of SNORD2 and makes it easier to inspect the relevant interaction.
The resulting graph shows SNORD2 together with its interacting partner EIF4A2. Selecting the SNORD2 node fills the node information panel with the basic properties of the gene, including its identifier, type and number of interacting partners. Selecting the SNORD2-EIF4A2 edge fills the edge information panel and exposes the interaction as a set of underlying duplex groups.
Once the SNORD2-EIF4A2 edge is selected, the detailed edge view shows the number of duplex groups supporting the interaction together with their confidence-related measures. The full set of confidence features can also be downloaded from this panel.
Selecting an entry in the duplex groups table shows the schematic representation of the hybrid on the two duplex-group arms. The same selection can then be used to switch directly to the linear genomic view.
To load the linear representation, go to the corresponding panel and click load linear tracks. The IGV browser then loads the gene annotation and interaction tracks. Because a specific duplex group has already been selected in the edge view, the option focus both panels on selected duplex group can be used to centre the top and bottom browser panels on the two arms of that duplex group.
In this example, SNORD2 interacts with its host gene, so both browser panels point to the same EIF4A2 locus. Bergeron et al. proposed that SNORD2 may affect host-gene splicing before its own maturation by forming an intramolecular RNA structure involving the branch point. In the linear view, duplex group ID:11862 is visibly extended beyond the annotated genomic boundaries of SNORD2. This indiates that underlying chimeric reads map outside the snoRNA gene coordinates, supporting the interpretation that the detected interaction may act in cis, in agreement with the model discussed in the paper.
This step shows how InteRRact can be used to move from a network edge to the underlying duplex evidence and then to the locus-level representation of the same interaction.
The SNORD2-EIF4A2 interaction is an example of functional snoRNA-mRNA interaction which has been validated independently from the proximity-ligation data. However, many other intreactons of this type, that may have non-canonical biological function in trans could be observed in the LIGR-seq.
The same workflow can be applied more broadly to other snoRNAs present in the dataset. We remain in the LIGR-seq dataset and use snoRNA-centred search to browse for other snoRNA- interactions. Confidence filters can be tightened at this stage to retain only more strongly supported interactions.
With the graph coloured by gene type, snoRNA nodes can be located easily in the network represetation. For any interaction of interest, you can move from the network to the selected edge, inspect the associated duplex groups, and then immidiatly switch to the complementary linear genomic view .
In this way, InteRRact can be used not only to folow up on the investigated examples such as SNORD2-EIF4A2, but also to identify many additional interactions that may serve as candidates for further exploration.
This module allows the qualitative comparison of two RNA–RNA interaction datasets. It visualises the union of the two RNA-RNA interaction networks and encodes whether nodes and edges are shared or dataset-specific. The comparison mode addresses the lack of standardised approaches for comparing the outcome of two RNA–RNA interaction experiments.
Input filters for the Datasets A & B
Confidence filters are applied to each dataset prior to gathering the super-set of the RRI. They are similar to the filters used in single-dataset module.
Input filters for the combined RRI network
After two datasets are compared, the combined RRI set is created. It can be filtered in following ways:
Show dataset-specific communities You can either select the RRI specific to the dataset A or B, or both. Additionally, if the community consists of RRI from different datasets i.e same gene has different RRI, such community is labelled as mixed. Mixed communities also could be isolated and expored.
Minimum component size
Removes small disconnected subgraphs to focus on hubs and structured communities.
Minimum RRIs per gene pair
Require more than N duplex groups between two genes.
Whether to show the edges where distinct (unmatched) RRIs present between the gene in each dataset.
Minimum matched interaction threshold
Minimum % of matched RRI to label edge as matched
Define how many/which fraction of all edge RRI are required for an edge to be annotated as matched and drawn normally in green. I.e the edge between the two genes that is formed by four RRI, of which the half is present in both datsaets, and another half are distinct can be drawn as the conserved or mixed one. These options allow you to customise the rule for this.
Gene / coordinate search
Focus on specific genes within the union graph.
To showcase the pairwise comparison module, we will use H1 and hNPC cell lines from RIC-seq data. Our aim will be to the combined network to identify interactions that are specific to the pluripotent or neuronal progenitor state. We configure the data filtering parameters to moderately strict, requiesting only RRI which are observed in 3/4 replicates and use the odds-ratios test p-values.
We first select communities present only in H1. This allows us to focus on RNA interaction programs that are absent from, or rewired in, hNPC cells.
Several signal appear immediately: the graph contains well-known pluripotency-associated markers, including LIN28A. LIN28A is a well-known regulator of stem-cell identity. Another gene HMGA1 is a non-histone chromatin protein implicated in embryonic development and maintenance of stem-like cellular states.
In addition to these genes, MALAT1 and several ENSG-annotated loci, many of which correspond to lncRNAs or less well-characterized transcripts suggest that non-coding RNAs are also part of the H1-specific interaction landscape. This highlights an important feature of the pairwise comparison view: it captures not only known regulators, but also candidate non-coding components of the same cell-type-specific network.
Importantly, not all biologically relevant changes form large communities. Some of the most specific differences between the cell lines are represented by two-nodes, single-edge components, which we filtered out by restricting the analysis to components larger than n = 3.
We observe the same pattern in the opposite direction when selecting communities present only in hNPCs. In this case, the graph no longer highlights pluripotency-associated factors but instead suggests interactions related to neural progenitor identity and neuronal differentiation, consistent with the cell type. Several cliques containing genes associated with the hNPC phenotype are present. For example, QKI RBPs are known regulators of glial differentiation, and APP has established roles in neurodevelopment.
Together with the H1-specific graph, this comparison shows that the pairwise trans interaction view can recover biologically meaningful differences between cell states: H1-specific communities highlight pluripotency-associated programs, whereas hNPC-specific communities reveal neural progenitor-associated rewiring.
Finally, we swith to communities, which contain genes interating with different partners in NPCs and H1. Such mixed communities highlight a different type of relationship between the two cell types. Rather than being fully specific to either H1 or hNPC, these components contain a shared interactor* together with **cell-type-specific edges
In such communities, the same gene can participate in different interactions in the two datasets. This makes the pairwise trans interaction graph useful not only for identifying fully H1- or hNPC-specific programs, but also for detecting cases where a gene is present in both cell types while its interaction partners change between conditions.
Thus, mixed communities provide a complementary view to the exclusive ones: instead of showing fully gained or lost modules, they reveal shared RNA interaction neighborhoods that are rewired between H1 and hNPC.
This module allows you to see the results of multiple comparisons across more than two conditions or experiments.
It summarises overlaps between matched interactions across selected datasets and presents them as an interactive UpSet plot.
Additionally, you can include one RRI dataset passed from the visualise your data module.
Interactions contributing to each overlap category can be browsed, filtered by confidence measures and exported in bedpe format.
Because datasets may originate from different experimental protocols and may differ in the range and availability of confidence features, the initial superset is assembled without applying confidence thresholds.
The required inputs for the UpSet plot are:
Please note that rendering of the plot is handled by the client browser. If more than six datasets are included in the comparison, rendering of the plot may take up to 1-2 minutes.
Once the comparison plot is ready, you can click on the overlap category, which will display the tabular view and activate the data filters.
Tabular data include all RRIs in each dataset and can be ordered by a special variable Comp.Group that labels the overlapping RRIs.
You can upload your custom set of RRIs to the visualise your data module, which, once the import is successful and the module has been launched, can also be matched against datasets already hosted on InteRRact. Upon successful saving of the data, you will see a short message in the bottom-right corner (while in visualise your data). In the multiple comparison module, the filename of your upload should appear in the list of available data.
Computing the overlaps between your RRIs and datasets could take up to several minutes, depending on the size of your input. The current maximum allowed number of user-imported RRIs is 5000.
Note that user input is treated as a special case, as it is not included while the superset is being computed. Instead, user-provided RRIs are directly matched against the superset. The unique user RRIs are added to the superset as mismatching, while overlapping ones are recorded as matched. Importantly, only a single hit of a user-provided RRI per corresponding superset RRI is added to the comparison statistics. In other words, if your RRIs are redundant, i.e. you have many duplex groups where both arms overlap, only one overlap per such redundant set adds count to the comparison. You still retain all uploaded RRIs in the tabular browser.
Each selected dataset contributes its processed RNA-RNA interactions to a common superset of matched interactions. Overlaps are then computed across all selected datasets and shown in the UpSet plot. The following figure details the procedure.
This module enables visualisation of user-provided RNA–RNA interaction data using the same interactive framework as the built-in datasets. Uploaded interactions must be provided in standard BEDPE format. This allows external datasets to be explored, filtered, and interpreted consistently within InteRRact.
The module supports two ways of importing the RRI data.
.bedpe tab-formatted file with pre-defined columns that specify the gene names. Columns can be stored at any position after the 10th column and must have name.A and name.B. See the example file header and few lines below| chromA | startA | endA | chromB | startB | endB | name | score | strandA | strandB | name.A | name.B | n_reads | E | p_value | extra_one |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| chr6 | 85677790 | 85677887 | chr17 | 49339501 | 49339625 | demo_1 | 86.747 | - | - | ENSG00000271793 | ZNF652 | 575 | -47.6 | 0.1462 | 51.265 |
| chr6 | 34245525 | 34245630 | chr12 | 110429261 | 110429362 | demo_2 | 91.550 | + | - | HMGA1 | ENSG00000258210 | 216 | -78.8 | 0.0203 | 49.645 |
| chr9 | 19379527 | 19379620 | chr9 | 19379762 | 19379854 | demo_3 | 92.659 | - | - | RPS6 | RPS6 | 185 | -34.3 | 0.03044 | 53.002 |
| chr19 | 17862086 | 17862167 | chrX | 153662314 | 153662369 | demo_4 | 88.625 | + | + | RPL18A | RPL18AP16 | 11 | -32 | 0.724 | 49.1585 |
| chr17 | 21001950 | 21002037 | chr17 | 21002046 | 21002138 | demo_5 | 88.446 | - | - | USP22 | USP22 | 212 | -44.9 | 0.258 | 48.025 |
| chr8 | 39913918 | 39914010 | chr8 | 39914191 | 39914256 | demo_6 | 87.417 | + | + | IDO1 | IDO1 | 253 | -26.7 | 0.00756 | 53.000 |
It is important to select the correct option for recognition of the provided node names. Even if you inputs contains name.A and name.B, they won’t be parsed unless the Use provided node names is set to yes
.bedpe tab-formatted file without node (gene) names. In this case, the
genes are guessed my matching the coordinate to the Gencode v48 primary annotation.| chromA | startA | endA | chromB | startB | endB | name | score | strandA | strandB | n_reads | extra_two |
|---|---|---|---|---|---|---|---|---|---|---|---|
| chr6 | 85677790 | 85677887 | chr17 | 49339501 | 49339625 | demo2_1 | 86.747 | - | - | 575 | 0.888 |
| chr6 | 34245525 | 34245630 | chr12 | 110429261 | 110429362 | demo2_2 | 91.55 | + | - | 216 | 0.839 |
| chr9 | 19379527 | 19379620 | chr9 | 19379762 | 19379854 | demo2_3 | 92.659 | - | - | 185 | 1.055 |
| chr19 | 17862086 | 17862167 | chrX | 153662314 | 153662369 | demo2_4 | 88.625 | + | + | 11 | 0.806 |
| chr17 | 21001950 | 21002037 | chr17 | 21002046 | 21002138 | demo2_5 | 88.446 | - | - | 212 | 1.15 |
Following columns are recognised and could be used for filtering your data.
score is included in the BEDPE specificationn_readsEp_value<any name><any name>In case node names are not provided and for the cis RNA-RNA interactions, data are expected to be in GRCh38 (hg38) coordinates
The tab-menu on the left side displays the results of the import, indicating the sucess status and features that were recognised by the module.
Once uploaded, the same filtering and network controls as in other modules become available. Module starts when the Launch RRI browser button is hit
Once the import is sucessfull and the RRI browser has started, your input will be internally saved and could be passed to the multiple comparison module. Please not that uploaded files are not stored on the server and will be removed once the browser session has ended. The current maximum allowed amount of user-imported RRIs is 5000.
Two example files show-casing the are provided on the module page. Once they are re-uploaded, trans RRI network view and linear browser could be used similarly to the analyse single dataset module
RNA proximity ligation experiments, as well as the methods reliant on capturing disjoint RNA fragments bound by proteins yeild information on the RNA-RNA contacts. Contrary to the SHAPE- and DMS map RNA probing, these methods aim at identifying the interaction loci of single or multiple RNA rather than accessing the flexibility of RNA molecule though devising the reactivity scores. Very simplisticly, It coud be viewed as the RNA analog DNA-DNA contacts (Hi-C), although RNA duplex probing methods have considerably lower coverage and differ in most aspects from experiment to data handling and interepretation. For the more detailed information we refer the user to the technology review and here for the details on data processing method we use for the data hosted on InteRRact. The acession number and statistics on the filtered data could be found in the dataset statistics page.
Raw sequencing data are processed using a standardised bioinformatic workflow to detect, cluster, annotate and filter RNA–RNA duplex groups prior to visualisation.
Hosted datasets in InteRRact were collected from published RNA duplex probing experiments and pre-processed using a common workflow before loading into the server.
Raw sequencing reads were first cleaned with fastp to remove low-quality reads and adapters. For PARIS datasets, additional protocol-specific preprocessing was applied using the icSHAPE pipeline scripts, following the procedures used for the original protocol.
Reads were then aligned to the human reference genome (GRCh38) with STAR in chimeric alignment mode.
Following alignment settings were chosen to recover reags originating from RNA-RNA ligation.
--chimOutType Junctions
--chimOutJunctionFormat 1
--alignIntronMin 1
--alignIntronMax 10
--outSJfilterReads All
--chimSegmentMin 15
--chimMultimapNmax 10
--chimScoreDropMax 30
--chimScoreJunctionNonGTAG 0
The resulting Chimeric.out.Junction files were used as the input for downstream interaction calling.
Candidate interactions are clustered into duplex groups using DuplexDiscovereR. A duplex group represents a set of overlapping or closely spaced chimeric reads supporting the same RNA-RNA interaction.
This grouping reduces redundancy at the read level and provides a compact interaction unit for filtering, visualisation and comparison. For each duplex group, InteRRact stores a set of summary measures that can be used during downstream exploration, including:
For datasets with replicates, duplex groups detected in individual replicates of the same condition are further aggregated into condition-level duplex groups. This allows the server to display a more compact representation while preserving replicate support and the main confidence-related features.
InteRRact employs two statistical models for asessment the RRI confidence.
Detected duplex groups are annotated with:
Duplex groups mapping to simple repeats, low-complexity regions or high-mapping-signal regions were excluded from the main set hosted datasets. This filtering step reduces interactions that are more likely to reflect mapping artefacts or transcriptional noise than bona fide RNA-RNA duplexes. If you are interested in the RNA-RNA interactions that may occur with RNA transcribed from those problematic regions, you can disable pre-filtereing in the Advanced options sidebar pane (Visualise single dataset and Pairwise comparison modules).
Filtered duplex groups are aggregated at the gene level to construct trans RNA–RNA interaction networks.
Community detection is performed using a greedy modularity optimisation algorithm (Louvain). Gene Ontology (GO) over-representation analysis is performed per detected community.
For dataset comparison:
Community detection is performed on the pooled interaction graph.
RNA–RNA interaction datasets are inherently sparse and prone to experimental artefacts. Therefore, multiple orthogonal confidence metrics are retained and exposed in the interface:
These parameters allow users to apply principled filtering based on interaction robustness.
RNA–RNA interaction datasets are inherently sparse and prone to experimental and computational artefacts. Only a small fraction of sequencing reads are chimeric, crosslinking and ligation efficiencies vary, and bioinformatic processing can introduce false positives.
InteRRact exposes multiple confidence-related parameters to allow data filtering:
These metrics provide orthogonal evidence for duplex reliability and help assess robustness of detected interactions.
Citation is not avaliable yet
Legend explanation:
• Repeat Blacklist: Duplex groups mapping to the repetitive or low-complexity regions
• ENCODE Blacklist: Duplex groups mapping to the high-signal or problematic genomic regions defined by ENCODE.
• No Gene loci: Duplex groups not mapping to any annotated gene
• Pseudo trans repeat: Duplex groups with arms mapping to the identical repetitive element at different genomic loci. Mapping Artifact.
Coverage scores were computed with the following procedures.
• Coverage tracks were extracted inpedendently for each library and average value assigned duplex group
• Duplep group coverage within library was scaled by the size-factor calcualted with DeSeq2 to enhance compatibility when used in RRI comparisons.
• Coverage score was averaged across replicates when combining individual sample into per-condition datasets