Back to Browse

Stepwise Alignment for Constrained Language Model Policy Optimization

865 views
Dec 24, 2024
26:09

タイトル:Stepwise Alignment for Constrained Language Model Policy Optimization スピーカー: 和地瞭良 / Akifumi Wachi (LINEヤフー株式会社) トーク概要: https://nlp-colloquium-jp.github.io/schedule/2024-12-18_akifumi-wachi/ 関連論文: - https://arxiv.org/abs/2404.11049 NLPコロキウムについてはこちら: https://nlp-colloquium-jp.github.io/ 再生リスト (2024年): https://www.youtube.com/playlist?list=PLu1WBmyPcyUNraWKALWbCPdeFuab8msH2

Download

0 formats

No download links available.

Stepwise Alignment for Constrained Language Model Policy Optimization | NatokHD