Andrea Siposova

about
blog
publications
projects
repositories
cv
submenus
publications
projects
blog

GenAISafety

an archive of posts with this tag

Jul 14, 2025	Can reinforcement learning from human feedback be turned into an attack vector for AI?

© Copyright 2025 Andrea Siposova. Powered by Jekyll with al-folio theme. Hosted by GitHub Pages. Photos from Unsplash.