Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

This project has been in the works for about a year. The initial commit to the public repo was not really closely related to this project, it was part of the release of the Transformer debugger, and the repo was just reused for this release.


ha thank you Leo; i myself felt uneasy pointing out commit date based evidence and you just proved why.

mild followup question: any alpha to be gained from training the same SAEs on two different generations of GPT4, eg GPT4 on march 2023 vs june 2023 vintage, whatever is most architecturally comparable, and diffing them. what would be your priors on what you’d find?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: