When Models Examine Themselves: Vocabulary-Activation Correspondence in LLMs

(zenodo.org)

3 points | by patternmatcher 12 hours ago ago

1 comments