1. White House . Executive order on the safe, secure, and trustworthy development and use of artificial intelligence, 2023.
2. Anthropic. Third-party testing as a key ingredient of ai policy, 2024.
3. California Legislature. Sb-1047 safe and secure innovation for frontier artificial intelligence models act.
4. Lab-bench: Measuring capabilities of language models for biology research;arXiv preprint,2024
5. The wmdp benchmark: Measuring and reducing malicious use with unlearning;arXiv preprint,2024