Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

scGhost at 100k Resolution #7

Open
daydayupzzl opened this issue Dec 25, 2024 · 1 comment
Open

scGhost at 100k Resolution #7

daydayupzzl opened this issue Dec 25, 2024 · 1 comment

Comments

@daydayupzzl
Copy link

scGhost is insightful and has proven to be valuable in single-cell analysis.

I am currently using scGhost to analyze a dataset of 3000-4000 cells, with the resolution set to 100k. However, I have encountered an issue where memory exceeds 370G+. Your paper was based on a 500k, and I am unsure whether using scGhost at the 100k is a feasible option.

  1. Are there any known memory issues when using scGhost at the 100k?
  2. Would you have any suggestions for optimizing memory consumption when working at 100K?

looking forward to any guidance you can provide, THANKS!

@kairukuma
Copy link
Collaborator

Hello, for whole-genome runs scGHOST was designed for 500kb - we found that memory usage scaled quadratically with resolution and 100kb at whole-genome scale would easily exceed the memory capabilities of a lot of machines.

To optimize memory for 100kb, you could try to run scGHOST one chromosome at a time, but without whole-genome clustering it might be difficult to ensure, for example, scA1 in chromosome 3 is the same as scA1 in other chromosome without a bulk-level comparison.

That said, a whole-genome solution may be to load one chromosome at a time from disk, save intermediate results to disk, and flush from memory once random walks are completed. This would greatly increase run time and because we still use whole-genome clustering we would still need to determine if this strategy actually can complete a 100kb run using a reasonable amount of memory.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants