Community

๐Ÿค LLM ๋ถ„์•ผ์˜ ๋– ์˜ค๋ฅด๋Š” ํ™”๋‘: Abstention (๊ธฐ๊ถŒ)

์ตœ๊ทผ LLM์˜ ํ™˜๊ฐํ˜„์ƒ(Hallucination)์„ ์ค„์ด๊ธฐ ์œ„ํ•œ ๋Œ€์•ˆ์œผ๋กœ ์ตœ๊ทผ ๋– ์˜ค๋ฅด๋Š” ๋ฐฉ๋ฒ•๋ก  ์ค‘ ํ•˜๋‚˜๋Š” "Abstention"์ž…๋‹ˆ๋‹ค. ๋‹จ์–ด์˜ ๋œป์€ "๊ธฐ๊ถŒ, ์ž์ œ"์ธ๋ฐ์š”, ์šฐ๋ฆฌ ๋ง๋กœ๋Š” "๋‹ต๋ณ€ ํฌ๊ธฐ"์ •๋„๊ฐ€ ๋  ๊ฒƒ ๊ฐ™์Šต๋‹ˆ๋‹ค. ์ฆ‰, LLM์ด ์Šค์Šค๋กœ๊ฐ€ ๋ถˆํ™•์‹คํ•œ ์ •๋ณด์— ๋Œ€ํ•ด์„œ๋Š” ๋‹ต๋ณ€์„ ๊ฑฐ๋ถ€ํ•  ์ˆ˜ ์žˆ๋„๋ก ํ•˜๋Š” ๋ฐฉ๋ฒ•์— ๋Œ€ํ•œ ์—ฐ๊ตฌ์ธ๋ฐ์š”, ์š”์ฆ˜ ๊ด€๋ จ ์—ฐ๊ตฌ๊ฐ€ ๋งŽ์ด ๋‚˜์˜ค๋Š” ๊ฒƒ ๊ฐ™์Šต๋‹ˆ๋‹ค. ๊ด€์‹ฌ ์žˆ์œผ์‹  ๋ถ„๋“ค์€ ์‹œ์ž‘ํ•˜๊ธฐ ์ข‹์€ ๋…ผ๋ฌธ 3ํŽธ์„ ์•„๋ž˜์— ์†Œ๊ฐœํ•ด๋‘์—ˆ์œผ๋‹ˆ ๊ฐ„๋‹จํ•˜๊ฒŒ ์‚ดํŽด๋ณด์…”๋„ ์ข‹์„ ๊ฒƒ ๊ฐ™์Šต๋‹ˆ๋‹ค. The Art of Refusal: A Survey of Abstention in Large Language Models * LLM์˜ ๋‹ต๋ณ€ ํฌ๊ธฐ(Abstention) ๋ฐฉ๋ฒ•์— ๋Œ€ํ•œ Survey Paper * ์•„๋ž˜ ์„ธ๊ฐ€์ง€ ๊ด€์ ์„ ๊ธฐ์ค€์œผ๋กœ ๊ธฐ์กด ์—ฐ๊ตฌ๋ฅผ ๋ถ„์„ํ•ฉ๋‹ˆ๋‹ค. * (1) The Query: ์งˆ๋ฌธ ์ž์ฒด์˜ ๋ชจํ˜ธ์„ฑ์œผ๋กœ ๋‹ต๋ณ€ ํšŒํ”ผ๊ฐ€ ํ•„์š”ํ•œ ๊ฒฝ์šฐ * (2) The Model: ๋ชจ๋ธ ์ง€์‹์˜ ๋ถ€์กฑ์œผ๋กœ ๋‹ต๋ณ€ ํšŒํ”ผ๊ฐ€ ํ•„์š”ํ•œ ๊ฒฝ์šฐ * (3) Human Values: ์œค๋ฆฌ์ /์‚ฌํšŒ์  ๊ฐ€์น˜ ๋“ฑ์˜ ์ด์œ ๋กœ ๋‹ต๋ณ€ ํšŒํ”ผ๊ฐ€ ํ•„์š”ํ•œ ๊ฒฝ์šฐ * https://arxiv.org/pdf/2407.18418 The Art of Saying No: Contextual Noncompliance in Language Models * ๋‹ต๋ณ€ํ•˜์ง€ ์•Š์•„๋„ ๋˜๋Š”(Noncompliance) ์งˆ๋ฌธ ์œ ํ˜•์„ ์„ธ๋ถ„ํ™”ํ•˜๊ณ  ๊ด€๋ จ ๋ฐ์ดํ„ฐ์…‹์„ ์ œ๊ณต * "(1) The Query" ์œ ํ˜•์— ํ•ด๋‹นํ•˜๋Š” ๋ฐฉ๋ฒ•๋ก ๊ณผ ๋ฐ์ดํ„ฐ๋ฅผ ์†Œ๊ฐœํ•ฉ๋‹ˆ๋‹ค. * AllenAI, University of Washington, Microsoft Research ๋“ฑ ๊ตต์งํ•œ ๊ธฐ๊ด€๋“ค์ด ์ฐธ์—ฌํ•œ ๋…ผ๋ฌธ์ด๋„ค์š”. * https://www.arxiv.org/pdf/2407.12043 R-Tuning: Instructing Large Language Models to Say โ€˜I Donโ€™t Knowโ€™ * Instruction Tuning ๋‹จ๊ณ„์—์„œ ๋ชจ๋ธ์ด Parametric Knowledge ๋ฅผ ๋ฒ—์–ด๋‚˜๋Š” ์งˆ๋ฌธ์— ๋Œ€ํ•ด์„œ๋Š” ๋‹ต๋ณ€ํ•˜์ง€ ์•Š๋„๋ก ํ•˜๊ธฐ ์œ„ํ•ด 1) ๋ฌด์—‡์„ ๋ชจ๋ฅด๋Š” ์ง€๋ฅผ ํŒŒ์•…ํ•˜๊ณ  2) ๋ชจ๋ฅด๋Š” ๊ฒƒ์— ๋Œ€ํ•ด์„œ๋Š” ๋‹ต๋ณ€ํ•˜์ง€ ์•Š๋„๋ก ํ•™์Šตํ•˜๋Š” ๋ฐฉ๋ฒ• ์ œ์‹œ * "(2) The Model" ์œ ํ˜•์— ํ•ด๋‹นํ•˜๋Š” ๋ฐฉ๋ฒ•๋ก ์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค. * ์ตœ๊ทผ NAACL 2024์—์„œ Outstanding Paper awards๋ฅผ ์ˆ˜์ƒํ–ˆ์–ด์š”. * https://aclanthology.org/2024.naacl-long.394/

์•Œ๋ฆผ

์•Œ๋ฆผ์ด ์—†์Šต๋‹ˆ๋‹ค