
Anthropic AI has introduced a new feature in its AI model, Claude, which allows users to interact with a version of Claude that focuses intensely on the Golden Gate Bridge. This feature, referred to as 'Golden Gate Claude', is available for a limited time. Users can engage with this unique AI by clicking on the Golden Gate icon. The feature was developed by altering internal 'features' in the AI, showcasing the potential of 'feature clamping' to modify AI behavior. This development is part of the recent Interpretability release, which also demonstrates how models can be adjusted to solve AI policy issues.
Sources
apolinario ๐The golden gate bridge is an extrovert and prefers when it's full of cars over it ๐๐๐ https://t.co/4Bd24IEi22 https://t.co/IK2sjVKCbm
AshutoshShrivastavaFor a limited time, you can chat with Golden Gate Claude ๐ If you dunno what is Golden gate, more details in ๐งต 1/n Click on golden gate icon on top right. https://t.co/cuEfheovur
Jack ClarkOne of the most amazing parts of the recent Interpretability release has been how we can use 'feature clamping' to change how models behave. For an example, play around with 'Golden Gate Claude' - check out how it responds to my question about what to build to solve AI policy https://t.co/gcRneTTgTs https://t.co/oCR18hhYRS
Additional media


