Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Retention time alignment using a combination of LamaParama and subset-based alignment #747

Open
CLUES-Emory opened this issue May 24, 2024 · 2 comments

Comments

@CLUES-Emory
Copy link

CLUES-Emory commented May 24, 2024

Hello,
We are trying to explore how to improve the processing speed for the retention time alignment step, which often takes 2-5x longer than the peak detection. One option we are exploring is the LamaParama corrections using our internal standards; however, these are not included in our instrument blanks and since these are not detected the retention times for these injections will not be corrected.

To account for this, we are trying to combine LamaParama with a subset that includes our pooled samples, but I do not see an option for subsetting in the LamaParama parameters (as is available for the other correction methods). Is there anyway to implement this?

Also, any suggestions on how to improve processing time for the alignment step (for studies including 1000-5000 samples) would be greatly appreciated.

As always, thanks for your excellent work on XCMS!

@jorainer
Copy link
Collaborator

jorainer commented Aug 7, 2024

Sorry for the late reply! I'll add your suggestion to my TODO list.
Regarding performance, are you using the newer MsExperiment/XcmsExperiment objects we introduced about a year ago? that should provide some general performance improvements.

@CLUES-Emory
Copy link
Author

Thank you!
I ended up implementing a rough version of this myself by searching for our internal standards in the grouped feature table, and then inputting the retention times for the standards in each sample as the peakGroupsMatrix during retention time correction. Because we are converting our retention times to alkane indices before peak picking and grouping, the adjusted retention times are negligible and we may skip this step altogether.

Thanks for the suggestion to use the MsExperiment objects. We have updated our code to work with these, and things are running quicker than before.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants