-
Notifications
You must be signed in to change notification settings - Fork 70
Description
🐛 Bug
When loading my MIXCR .tsv files using read_repertoires() I keep getting the error
"Error in agg_receptors():
! Not all columns in the receptor schema present in the data: [cdr3]. Please double check and run again."
I have tried using cdr3_aa, v_call and Immunarch 0.9.1 names such as CDR3.aa and V.name. I always got the same error.
Upon trying with a MIXCR column name (e.g. CloneId) there are no errors.
Idata$annotations$ shows the names of MIXCR columns, not immunarch/Immundata columns.
When I use the repload -> from_imunarch() method I do not have this problem.
I would keep doing it this way, however when using this method my metadata is not integrated into idata$repertoire.
Code (basically the tutorial code)
md_file <- file.path(TSV_DIR, "metadata.txt")
md_table <- read_metadata(md_file)
tsv_files <- list.files(TSV_DIR, pattern = "\.tsv$", full.names = TRUE)
idata <- read_repertoires(path = tsv_files,
schema = c("cloneId"),
metadata = md_table,
preprocess = make_default_preprocessing("airr"),
repertoire_schema = "Donor")
Printing: idata$annotations
Results: annotations
A duckplyr data frame: 52 variables
cloneId readCount readFraction uniqueMoleculeCount uniqueMoleculeFraction
.......
47 more variables: targetSequences , targetQualities ,
allVHitsWithScore , allDHitsWithScore , allJHitsWithScore ,
allCHitsWithScore , allVAlignments , allDAlignments ,
allJAlignments , allCAlignments , nSeqFR1 ,
minQualFR1 , nSeqCDR1 , minQualCDR1 , nSeqFR2 ,
minQualFR2 , nSeqCDR2 , minQualCDR2 , nSeqFR3 , …
-> All of these are MIXCR column names
Thanks in advance!