Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Inconsistent Anomaly Scores for Single Data Points vs. Batch in COPOD #604

Open
patselle opened this issue Aug 7, 2024 · 0 comments
Open

Comments

@patselle
Copy link

patselle commented Aug 7, 2024

Hello,

I have observed an inconsistency in the anomaly scores produced by the COPOD algorithm when evaluating single data points versus a batch of identical data points. After fitting the model, the scores for a single data point differ from the scores when the same data point is part of a larger batch.
Problem Description

When using the decision_function method, the anomaly score for an individual data point is not consistent with the score obtained when the same data point is included in a batch. This discrepancy seems to arise from how the algorithm combines training and test data for score calculation.
Questions

  1. Intended Use: Is COPOD designed to handle individual data point evaluations consistently after fitting, or is it primarily intended for batch evaluations?
  2. Implementation Details: Are there recommended practices or modifications to ensure consistent anomaly scores regardless of the batch size?
  3. Suggested Fix: Would it be advisable to adjust the decision_function to avoid combining training and test data?

Your guidance on how to address this issue would be greatly appreciated.

Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant