Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Inquiry on the Implementation and Support for vlm_datasets Calibration Data Class #292

Open
aptsunny opened this issue Jan 13, 2025 · 1 comment

Comments

@aptsunny
Copy link

Hello,

I am currently working with the vlm_datasets calibration data class as outlined in the documentation here. I noticed that there seems to be some aspects of the implementation that might not be fully developed or documented.

Here are the details I've gathered from the YAML configuration:

model:
  type: Llava
  path: model path
  tokenizer_mode: slow
  torch_dtype: auto

calib:
  name: vlm_datasets
  type: img_txt
  download: False
  path: datasets path
  n_samples: 32
  bs: 1
  seq_len: 512
  preproc: vlm_general
  padding: True
  seed: *seed

I have a few questions regarding the vlm_datasets:

  1. Is the vlm_datasets calibration data class fully implemented and supported?
  2. Are there any known limitations or issues with the current implementation?
  3. If it's not fully implemented, is there a timeline for when these features might be completed?

I would appreciate any guidance or clarification on this matter. If there is additional documentation or resources available, please let me know.

Thank you for your time and support.

Best regards

@gushiqiao
Copy link
Contributor

@chengtao-lv

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants