Censored Sampling of Diffusion Models Using 3 Minutes of Human Feedbac…
페이지 정보
본문
Diffusion models have recently shown remarkable success in high-quality image generation. Sometimes, however, a pre-trained diffusion model exhibits partial misalignment in the sense that the model can generate good images, but it sometimes outputs undesirable images. If so, we simply need to prevent the generation of the bad images, and we call this task censoring. In this work, we present censored generation with a pre-trained diffusion model using a reward model trained on minimal human feedback. We show that censoring can be accomplished with extreme human feedback efficiency and that labels generated with a mere few minutes of human feedback are sufficient. Code available at: https://github.com/tetrzim/diffusion-human-feedback.
- 이전글Design of a Metal 3D Printed Double-Ridged Horn Antenna With Stable Gain and Symmetric Radiation Pattern Over a Wide Frequency Range 24.08.01
- 다음글Exact Optimality of Communication-Privacy-Utility Tradeoffs in Distributed Mean Estimation 24.08.01
댓글목록
등록된 댓글이 없습니다.