Andrea
Siposova
Toggle navigation
about
blog
publications
projects
repositories
cv
submenus
publications
projects
blog
ctrl k
GenAISafety
an archive of posts with this tag
Jul 14, 2025
Can reinforcement learning from human feedback be turned into an attack vector for AI?