KaiqiangSong
KaiqiangSong
Home
Experience
Publications
Contact
Light
Dark
Automatic
rlhf
DecipherPref: Analyzing Influential Factors in Human Preference Judgments via GPT-4
Human preference judgments are pivotal in guiding large language models (LLMs) to produce outputs that align with human values. Human …
Yebowen Hu
,
Kaiqiang Song
,
Sangwoo Cho
,
Xiaoyang Wang
,
Hassan Foroosh
,
Fei Liu
PDF
Cite
Cite
×