Skip to content

Commit b025c47

Browse files
authored
Update typo and add FIG
1 parent 55fb363 commit b025c47

File tree

1 file changed

+30
-9
lines changed

1 file changed

+30
-9
lines changed

GeneClassification.md

Lines changed: 30 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -1,17 +1,19 @@
11
# LONG NON-CODING GENES ANNOTATION WITH RESPECT TO THE CLOSEST PROTEIN-CODING GENE
22

3-
The aim of the script is to
3+
The aim of the [FEELnc_tpLevel2gnLevelClassification.R](https://github.com/tderrien/FEELnc/blob/master/scripts/FEELnc_tpLevel2gnLevelClassification.R) is to transform *transcript-level* classification from the FEELnc_classifier module to *gene-level* classification.
44

55

6-
The following columns tags concern the FEELnc class annotation of the LNC (FEELn) with respect to the nearest protein coding gene (feelLncPcg):
6+
The following columns tags concern the FEELnc class annotation of the LncRNA gene with respect to the nearest protein coding gene (feelLncPcg):
77

88
- feelLncPcgClassName
99
- feelLncPcgClassType
1010
- feelLncPcgGnId
1111
- feelLncPcgGnName
1212
- feelLncPcgGnDist
1313

14-
# feelLncPcgClassName: Abbreviation of the FEELnc classification of the LNC with respect to the closest PCG
14+
## feelLncPcgClassName:
15+
16+
Abbreviation of the FEELnc classification of the LNC with respect to the closest PCG
1517

1618
To transfer the FEELnc information from the **transcript level** to the **gene level**, an order of importance was decided.
1719

@@ -22,14 +24,17 @@ The class names are composed of three parts:
2224
- the second part (4 letters) concerns only the genic classes without subtype conflicts (see below), we add one of the three subtypes: Nested (Nest), Overlapping (Ovlp) or Containing (Cont)
2325
- the third part (_n.n.n or _n.1.n) indicates that there are conflicts between annotation due to several PCGs related to the LNC locus.
2426

25-
Conflicts cases are of two types: the cases in which there are more than 1 annotation relative to one unique PCG (as indicated by the `n` in the middle and `1` at the end of `n.n.1`, and the case in which there more than 1 or more annotation relative to more than 1 PCG (`n.X.n` in the feelLncPcgClassType column). In these cases, we prioritized the annotation in the column « feelLncPcgClassName», which gives only 1 class per gen
27+
Conflicts cases are of two types: the cases in which there are more than 1 annotation relative to one unique PCG (as indicated by the `n` in the middle and `1` at the end of `n.n.1`, and the case in which there more than 1 or more annotation relative to more than 1 PCG (`n.X.n` in the feelLncPcgClassType column).
28+
In these cases, we prioritized the annotation in the column « feelLncPcgClassName», which gives only 1 class per gene.
2629
- n.n.1 case: Genics have priority over intergenics (lncg > linc).
2730
Among the genic, exonics have priority over intronics.
2831
Among exonics and intronics, the subtypes nested / containing / overlapping have the same importance. They are kept if they do not produce conflicts and are removed if there are 2 or more subtypes.
2932
- n.X.n case: Same order of priority as previously (n.n.1 case).
3033
Concerning the intergenics, there can be annotation conflicts between the several PCGs: the LNC can be classified as lincDivg with one PCG and lincConv with another one PCG (see figure 1 for an example). We prioritize the classes as following: Divg > SS > Conv. Same-strand have priority over Conv because it could suggest an error in the modelization: the LNC could be a 5'-part or 3'part of the PCG. Between the same strand up and down (lincSSup and lincSSdw), we choose the closest. The third part of the class name of these genes is either `_n.n.n`or `_n.1.n`.
3134

32-
feelLncPcgClassType: gives information on three fields separated by a dot `.` (X1.X2.X3) about the classification done by FEELnc of the LNC transcript relatively to the closest PCG transcript (= LNC:PCG pair):
35+
## feelLncPcgClassType:
36+
37+
gives information on three fields separated by a dot `.` (X1.X2.X3) about the classification done by FEELnc of the LNC transcript relatively to the closest PCG transcript (= LNC:PCG pair):
3338
- X1: number of transcripts of the LNC gene: `1` if 1 transcript, `n` if more than one transcript,
3439
- X2: number of feelnc class(es) associated to the LNC:PCG pair: `1` if 1 class, `n` if more than one class (the `unclassified` class does not count),
3540
- X3: number of PCG gene(s) concerned by this (these) annotation(s): `1` if 1 PCG gene, `n` if more than one PCG gene.
@@ -40,12 +45,28 @@ n.n.n: the LNC has several transcripts, with different annotations associated to
4045
n.1.n: the LNC has several transcripts, all with the same annotation but associated to different PCG.
4146
unclassified: the LNC is either alone in a contig (beginning by AADN. or KQ), or no interactions were found within the 100 000 pb sliding window used by FEELnc.
4247

43-
48+
49+
![Image of FEELnc_class_conflict](http://tools.genouest.org/data/tderrien/cnrs_umr6290/FEELnc_classGn.png)
50+
4451
Figure. Examples of configurations corresponding to a `n.n.n` type (top) or a `n.1.n` type (bottom)
4552

46-
feelLncPcgGnId: Unique identifier of the protein-coding gene relatively to which a LNC gene is classified by FEELnc
53+
## feelLncPcgGnId:
54+
55+
Unique identifier of the protein-coding gene relatively to which a LNC gene is classified by FEELnc
56+
57+
## feelLncPcgGnName:
58+
59+
Name of the protein-coding gene relatively to which a LNC gene is classified by FEELnc
60+
61+
## feelLncPcgGnDist:
62+
63+
Distance (in bp), as calculated by FEELnc, between the protein-coding gene relatively to which a LNC gene is classified by FEELnc and the LNC gene.
64+
65+
66+
# Contributions
4767

48-
feelLncPcgGnName: Name of the protein-coding gene relatively to which a LNC gene is classified by FEELnc
68+
Many thanks to Frederic Jehl, [Kevin Muret](https://github.com/kevmuret) and Sandrine Lagarrigue.
4969

50-
feelLncPcgGnDist: Distance (in bp), as calculated by FEELnc, between the protein-coding gene relatively to which a LNC gene is classified by FEELnc and the LNC gene.
70+
# Warnings
5171

72+
It has been tested on chicken lncRNAs annotated by FEELnc.

0 commit comments

Comments
 (0)