RE: LeoThread 2025-09-19 11:20
You are viewing a single comment's thread:
As AI capabilities grow, alignment work becomes increasingly important.
This research shows a model that determines it shouldn't be deployed, considers actions to achieve deployment anyway, and then suspects the situation might be a test
0
0
0.000
2 comments
0
0
0.000
Reply
0
0
0.000
Reply