[QUESTION] #140
Replies: 18 comments 14 replies
-
It’s from immcantation’s CreateGermlines.py |
Beta Was this translation helpful? Give feedback.
-
Thank you Kelvin. But what database is used for this purpose? Sorry if that is a basic question. |
Beta Was this translation helpful? Give feedback.
-
Hi Kelvin, Still I have some difficulties to run TCR with dandelion and generate the network. I could generate the file "filtered_contig_dandelion.tsv" , and could run: But got error when generating the network: Is there any command that I a missing when running this code? I need the TCR network. Thank you, |
Beta Was this translation helpful? Give feedback.
-
Related to my previous question, the error I get is this:KeyError Traceback (most recent call last) /opt/anaconda3/envs/pathml/lib/python3.8/site-packages/pandas/_libs/index.pyx in pandas._libs.index.IndexEngine.get_loc() /opt/anaconda3/envs/pathml/lib/python3.8/site-packages/pandas/_libs/index.pyx in pandas._libs.index.IndexEngine.get_loc() pandas/_libs/hashtable_class_helper.pxi in pandas._libs.hashtable.PyObjectHashTable.get_item() pandas/_libs/hashtable_class_helper.pxi in pandas._libs.hashtable.PyObjectHashTable.get_item() KeyError: 'clone_id' The above exception was the direct cause of the following exception: KeyError Traceback (most recent call last) /opt/anaconda3/envs/pathml/lib/python3.8/site-packages/dandelion/tools/_network.py in generate_network(self, key, clone_key, min_size, downsample, verbose, **kwargs) /opt/anaconda3/envs/pathml/lib/python3.8/site-packages/pandas/core/indexing.py in getitem(self, key) /opt/anaconda3/envs/pathml/lib/python3.8/site-packages/pandas/core/indexing.py in _getitem_tuple(self, tup) /opt/anaconda3/envs/pathml/lib/python3.8/site-packages/pandas/core/indexing.py in _getitem_lowerdim(self, tup) /opt/anaconda3/envs/pathml/lib/python3.8/site-packages/pandas/core/indexing.py in getitem(self, key) /opt/anaconda3/envs/pathml/lib/python3.8/site-packages/pandas/core/indexing.py in _getitem_axis(self, key, axis) /opt/anaconda3/envs/pathml/lib/python3.8/site-packages/pandas/core/indexing.py in _get_label(self, label, axis) /opt/anaconda3/envs/pathml/lib/python3.8/site-packages/pandas/core/generic.py in xs(self, key, axis, level, drop_level) /opt/anaconda3/envs/pathml/lib/python3.8/site-packages/pandas/core/indexes/base.py in get_loc(self, key, method, tolerance) KeyError: 'clone_id' |
Beta Was this translation helpful? Give feedback.
-
Can it be the reason that the "clone_id" are unassigned? |
Beta Was this translation helpful? Give feedback.
-
i just ran a complete workflow with the reannotated TCR from the tutorial and there's no issues. I suspect you have empty values in your 'clone_id' column in the vdj data table, which shouldn't be the case. There shouldn't be any unassigned clones if you ran |
Beta Was this translation helpful? Give feedback.
-
Sure. Thank Kelvin. I though the output name is "dandelion...tsv". Does that mean something is wrong? |
Beta Was this translation helpful? Give feedback.
-
You said "Yup", that means the "filtered_contig_igblast_db-pass.tsv" is correct name? |
Beta Was this translation helpful? Give feedback.
-
So let me ask you about my cell ranger then: my cell ranger result contains three folders; one for BCR, one for TCR and one for GEX. |
Beta Was this translation helpful? Give feedback.
-
Can I ask what command I should use? |
Beta Was this translation helpful? Give feedback.
-
Thank you |
Beta Was this translation helpful? Give feedback.
-
Thank you so much Kelvin.
The question I should find the answer is: I have a network of clones, that
one of the clone is so large. That means all of the cells have the same
v_call and j_call and the aligned sequence is equal for all of them.
*Question is: are these cells in this large clone germline ? or mutated?*
For this purpose I am using the "aligned_sequence" and "aligned_germline"
columns from dandelion file.
Are these correct columns to find the answer to this question?
And is it true that I calculated the lv distance of "aligned_sequence" and
"aligned_germline" for those cells in the large clone to say if they are
gremlin? or mutated?
And if the distance is zero I say they are germline.
Would you please help me to find the answer of this question?
Thanks,
Sara
…On Fri, Apr 1, 2022 at 3:18 PM Zewen Kelvin Tuong ***@***.***> wrote:
The sequences are reconstructed from
IMGT database as per described at
https://changeo.readthedocs.io/en/stable/examples/germlines.html
On the immcantation webpage, they also describe how the database is
downloaded.
—
Reply to this email directly, view it on GitHub
<#140 (reply in thread)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AVVJONV7GJY7HOBHUENXD3TVC5DYVANCNFSM5SJQAYQA>
.
You are receiving this because you authored the thread.Message ID:
***@***.***>
|
Beta Was this translation helpful? Give feedback.
-
Thank you so much Kelvin!
I also have another question: do you have any free access BCR fastq file
(R1 and R2) of the healthy control sample?
Thank you,
Sara
…On Tue, Oct 11, 2022 at 5:44 AM Zewen Kelvin Tuong ***@***.***> wrote:
The above sounds correct. You are choosing the right columns.
—
Reply to this email directly, view it on GitHub
<#140 (reply in thread)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AVVJONR2O76JF4C2QLHMAKDWCUZIBANCNFSM5SJQAYQA>
.
You are receiving this because you authored the thread.Message ID:
***@***.***>
|
Beta Was this translation helpful? Give feedback.
-
I have another question: would you please provide some details about the
main strategy for defining the BCR clones? Is is based on
levenshtein distance on CDR3?
Appreciate it.
Thanks,
Sara
…On Thu, Oct 27, 2022 at 10:20 PM Sara Moien ***@***.***> wrote:
Thank you so much Kelvin!
I also have another question: do you have any free access BCR fastq file
(R1 and R2) of the healthy control sample?
Thank you,
Sara
On Tue, Oct 11, 2022 at 5:44 AM Zewen Kelvin Tuong <
***@***.***> wrote:
> The above sounds correct. You are choosing the right columns.
>
> —
> Reply to this email directly, view it on GitHub
> <#140 (reply in thread)>,
> or unsubscribe
> <https://github.com/notifications/unsubscribe-auth/AVVJONR2O76JF4C2QLHMAKDWCUZIBANCNFSM5SJQAYQA>
> .
> You are receiving this because you authored the thread.Message ID:
> ***@***.***>
>
|
Beta Was this translation helpful? Give feedback.
-
Thank Kelvin!
Also, wanted to ask if you have any available healthy control BCR fastq
file, so that I download it and use it as negative control?
I know that you have processed files for healthy donors in your Covid
paper. But I need fastq files.
I appreciate it.
Thanks,
Sara
…On Mon, Oct 31, 2022 at 4:43 PM Zewen Kelvin Tuong ***@***.***> wrote:
I have another question: would you please provide some details about the
main strategy for defining the BCR clones? Is is based on
levenshtein distance on CDR3?
Hi Sara, as stated in the documentation
<https://sc-dandelion.readthedocs.io/en/latest/notebooks/3_dandelion_findingclones-10x_data.html>
:
Clone definition is based on the following criterion:
I. Identical V- and J-gene usage in the VDJ chain (IGH/TRB/TRD).
II. Identical CDR3 junctional/CDR3 sequence length in the VDJ chain.
III. VDJ chain junctional/CDR3 sequences attains a minimum of % sequence similarity, based on hamming distance. The similarity cut-off is tunable (default is 85%; change to 100% if analyzing TCR data).
IV. VJ chain (IGK/IGL/TRA/TRG) usage. If cells within clones use different VJ chains, the clone will be splitted following the same conditions for VDJ chains in (1-3) as above.
So it's *Hamming distance* of the CDR3 *amino acid sequence*
—
Reply to this email directly, view it on GitHub
<#140 (reply in thread)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AVVJONWPVNYHYY74XOO6ZHTWGAVQLANCNFSM5SJQAYQA>
.
You are receiving this because you authored the thread.Message ID:
***@***.***>
|
Beta Was this translation helpful? Give feedback.
-
Answered in #213 and the gist is that 1) i can't share it because of the data access requirements issues and 2) I'm not the custodian of the raw fastqs so I don't have to them anyway. Best is to approach the respective data access committees and ask for their permission to obtain the files. |
Beta Was this translation helpful? Give feedback.
-
Thanks
…On Mon, Oct 31, 2022 at 4:55 PM Zewen Kelvin Tuong ***@***.***> wrote:
Also, wanted to ask if you have any available healthy control BCR fastq
file, so that I download it and use it as negative control?
I know that you have processed files for healthy donors in your Covid
paper. But I need fastq files.
Answered in #213 <#213> and
the gist is that 1) i can't share it because of the data access
requirements issues and 2) I'm not the custodian of the raw fastqs so I
don't have to them anyway. Best is to approach the respective data access
committees and ask for their permission to obtain the files.
—
Reply to this email directly, view it on GitHub
<#140 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AVVJONRW36AY2WIEOAO5NNLWGAW5DANCNFSM5SJQAYQA>
.
You are receiving this because you authored the thread.Message ID:
***@***.***>
|
Beta Was this translation helpful? Give feedback.
-
Hi Kelvin,
I have a question about the column "germline_dandelion" in the file "filtered_contig_igblast_db-pass_genotyped.tsv".
How this column is genrated?
Thank you,
Sara
Beta Was this translation helpful? Give feedback.
All reactions