You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
# LONG NON-CODING GENES ANNOTATION WITH RESPECT TO THE CLOSEST PROTEIN-CODING GENE
2
2
3
-
The aim of the script is to
3
+
The aim of the [FEELnc_tpLevel2gnLevelClassification.R](https://github.com/tderrien/FEELnc/blob/master/scripts/FEELnc_tpLevel2gnLevelClassification.R) is to transform *transcript-level* classification from the FEELnc_classifier module to *gene-level* classification.
4
4
5
5
6
-
The following columns tags concern the FEELnc class annotation of the LNC (FEELn) with respect to the nearest protein coding gene (feelLncPcg):
6
+
The following columns tags concern the FEELnc class annotation of the LncRNA gene with respect to the nearest protein coding gene (feelLncPcg):
7
7
8
8
- feelLncPcgClassName
9
9
- feelLncPcgClassType
10
10
- feelLncPcgGnId
11
11
- feelLncPcgGnName
12
12
- feelLncPcgGnDist
13
13
14
-
# feelLncPcgClassName: Abbreviation of the FEELnc classification of the LNC with respect to the closest PCG
14
+
## feelLncPcgClassName:
15
+
16
+
Abbreviation of the FEELnc classification of the LNC with respect to the closest PCG
15
17
16
18
To transfer the FEELnc information from the **transcript level** to the **gene level**, an order of importance was decided.
17
19
@@ -22,14 +24,17 @@ The class names are composed of three parts:
22
24
- the second part (4 letters) concerns only the genic classes without subtype conflicts (see below), we add one of the three subtypes: Nested (Nest), Overlapping (Ovlp) or Containing (Cont)
23
25
- the third part (_n.n.n or _n.1.n) indicates that there are conflicts between annotation due to several PCGs related to the LNC locus.
24
26
25
-
Conflicts cases are of two types: the cases in which there are more than 1 annotation relative to one unique PCG (as indicated by the `n` in the middle and `1` at the end of `n.n.1`, and the case in which there more than 1 or more annotation relative to more than 1 PCG (`n.X.n` in the feelLncPcgClassType column). In these cases, we prioritized the annotation in the column « feelLncPcgClassName», which gives only 1 class per gen
27
+
Conflicts cases are of two types: the cases in which there are more than 1 annotation relative to one unique PCG (as indicated by the `n` in the middle and `1` at the end of `n.n.1`, and the case in which there more than 1 or more annotation relative to more than 1 PCG (`n.X.n` in the feelLncPcgClassType column).
28
+
In these cases, we prioritized the annotation in the column « feelLncPcgClassName», which gives only 1 class per gene.
26
29
- n.n.1 case: Genics have priority over intergenics (lncg > linc).
27
30
Among the genic, exonics have priority over intronics.
28
31
Among exonics and intronics, the subtypes nested / containing / overlapping have the same importance. They are kept if they do not produce conflicts and are removed if there are 2 or more subtypes.
29
32
- n.X.n case: Same order of priority as previously (n.n.1 case).
30
33
Concerning the intergenics, there can be annotation conflicts between the several PCGs: the LNC can be classified as lincDivg with one PCG and lincConv with another one PCG (see figure 1 for an example). We prioritize the classes as following: Divg > SS > Conv. Same-strand have priority over Conv because it could suggest an error in the modelization: the LNC could be a 5'-part or 3'part of the PCG. Between the same strand up and down (lincSSup and lincSSdw), we choose the closest. The third part of the class name of these genes is either `_n.n.n`or `_n.1.n`.
31
34
32
-
feelLncPcgClassType: gives information on three fields separated by a dot `.` (X1.X2.X3) about the classification done by FEELnc of the LNC transcript relatively to the closest PCG transcript (= LNC:PCG pair):
35
+
## feelLncPcgClassType:
36
+
37
+
gives information on three fields separated by a dot `.` (X1.X2.X3) about the classification done by FEELnc of the LNC transcript relatively to the closest PCG transcript (= LNC:PCG pair):
33
38
- X1: number of transcripts of the LNC gene: `1` if 1 transcript, `n` if more than one transcript,
34
39
- X2: number of feelnc class(es) associated to the LNC:PCG pair: `1` if 1 class, `n` if more than one class (the `unclassified` class does not count),
35
40
- X3: number of PCG gene(s) concerned by this (these) annotation(s): `1` if 1 PCG gene, `n` if more than one PCG gene.
@@ -40,12 +45,28 @@ n.n.n: the LNC has several transcripts, with different annotations associated to
40
45
n.1.n: the LNC has several transcripts, all with the same annotation but associated to different PCG.
41
46
unclassified: the LNC is either alone in a contig (beginning by AADN. or KQ), or no interactions were found within the 100 000 pb sliding window used by FEELnc.
42
47
43
-
48
+
49
+

50
+
44
51
Figure. Examples of configurations corresponding to a `n.n.n` type (top) or a `n.1.n` type (bottom)
45
52
46
-
feelLncPcgGnId: Unique identifier of the protein-coding gene relatively to which a LNC gene is classified by FEELnc
53
+
## feelLncPcgGnId:
54
+
55
+
Unique identifier of the protein-coding gene relatively to which a LNC gene is classified by FEELnc
56
+
57
+
## feelLncPcgGnName:
58
+
59
+
Name of the protein-coding gene relatively to which a LNC gene is classified by FEELnc
60
+
61
+
## feelLncPcgGnDist:
62
+
63
+
Distance (in bp), as calculated by FEELnc, between the protein-coding gene relatively to which a LNC gene is classified by FEELnc and the LNC gene.
64
+
65
+
66
+
# Contributions
47
67
48
-
feelLncPcgGnName: Name of the protein-coding gene relatively to which a LNC gene is classified by FEELnc
68
+
Many thanks to Frederic Jehl, [Kevin Muret](https://github.com/kevmuret) and Sandrine Lagarrigue.
49
69
50
-
feelLncPcgGnDist: Distance (in bp), as calculated by FEELnc, between the protein-coding gene relatively to which a LNC gene is classified by FEELnc and the LNC gene.
70
+
# Warnings
51
71
72
+
It has been tested on chicken lncRNAs annotated by FEELnc.
0 commit comments