CLS Token for ATS/EvoViT #13

JoakimHaurum · 2024-11-29T10:29:58Z

When you tested the ATS and EvoViT pruning methods, how did you exactly incorporate the CLS token?
As you mention the CLS token is not "natural" for dense tasks, but given you use a DeiT backbone, you should have it from there. Do you simply reuse the DeiT CLS token (even if it is not trained during the VIT Adapter dense training), or do you initialize a new random token after the dense training?

kaikai23 · 2024-12-03T06:39:32Z

Hi, we initialize a random CLS token for these methods. Although there is no explicit supervision for the CLS token, we found that it can still effectively serve as the selector, as discussed in Appendix D of the paper.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CLS Token for ATS/EvoViT #13

CLS Token for ATS/EvoViT #13

JoakimHaurum commented Nov 29, 2024

kaikai23 commented Dec 3, 2024

CLS Token for ATS/EvoViT #13

CLS Token for ATS/EvoViT #13

Comments

JoakimHaurum commented Nov 29, 2024

kaikai23 commented Dec 3, 2024