Community

๐Ÿ“ LLM์˜ "๋ชจ๋ฅธ๋‹ค๊ณ  ๋งํ•  ์šฉ๊ธฐ"

LLM์„ ์‚ฌ์šฉํ•˜๋‹ค๋ณด๋ฉด ๋ชจ๋ธ์ด ์ •๋ง ์ง€์‹์ด ์žˆ์–ด์„œ ๋Œ€๋‹ตํ•˜๋Š” ๊ฒƒ์ธ์ง€, ๋ฌธ์žฅ์„ ์™ธ์›Œ์„œ ์ถœ๋ ฅํ•˜๋Š” ๊ฒƒ์ธ์ง€ ๊ถ๊ธˆํ•  ๋•Œ๊ฐ€ ๋งŽ์Šต๋‹ˆ๋‹ค(์ƒ๊ฐํ•ด๋ณด๋‹ˆ ์‚ฌ๋žŒ๋„ ๋งˆ์ฐฌ๊ฐ€์ง€๋„ค์š”). ํŠนํžˆ ํ‹€๋ฆฐ ์ •๋ณด๋ฅผ ํ™•์‹คํ•œ ๋“ฏ ๋Œ€๋‹ตํ•˜๋ฉด ์˜์‹ฌ์ด ์ปค์ง€๊ฒŒ ๋˜๋Š” ๊ฒฝํ—˜์„ ๋‹ค๋“ค ํ•œ๋ฒˆ์ฏค์€ ํ•ด๋ณด์…จ์„ํ…๋ฐ์š”, ๐Ÿค” ๊ทธ๋ ‡๋‹ค๋ฉด, ๋ชจ๋ธ์ด ํ™•์‹คํžˆ ์•„๋Š” ์ •๋ณด์™€ ๋ชจ๋ฅด๊ฑฐ๋‚˜ ๋ถˆํ™•์‹คํ•œ ์ •๋ณด๋ฅผ ๊ตฌ๋ถ„ํ•  ์ˆ˜ ์žˆ๋‹ค๋ฉด ์–ด๋–จ๊นŒ์š”? ๐Ÿ’๐Ÿปโ€โ™‚๏ธ R-Tuning ๋ฐฉ๋ฒ•๋ก ์€ ์‚ฌ์ „ํ•™์Šต๋ชจ๋ธ์ด ์•„๋Š” ์ •๋ณด์™€ ๋ชจ๋ฅด๋Š” ์ •๋ณด๋ฅผ ๊ตฌ๋ถ„ํ•˜๊ณ  ํ”„๋กฌํ”„ํŠธ๋ฅผ ๋‹ค๋ฅด๊ฒŒ ํ•™์Šตํ•˜๋ฉด, ๊ธฐ์กด Instruction Tuning๋ณด๋‹ค ๋” ์ข‹์€ ์„ฑ๋Šฅ์„ ๋ณด์—ฌ์ค€๋‹ค๊ณ  ํ•ฉ๋‹ˆ๋‹ค. ์ด๋ฒˆ NAACL 2024์—์„œ Outstanding Paper Awards๋ฅผ ์ˆ˜์ƒํ•œ ๋ณธ ๋…ผ๋ฌธ์˜ ์ €์ž๋Š” ๊ธฐ์กด Instruction Tuning์ด ๋ชจ๋ธ์ด ์–ด๋–ค ์ง€์‹์„ ์•Œ๊ณ ์žˆ๋Š”์ง€ ์—ฌ๋ถ€์— ์ƒ๊ด€์—†์ด "๋ฌธ์žฅ ์™„์„ฑ"์— ์ดˆ์ ์„ ๋งž์ถ”๊ธฐ ๋•Œ๋ฌธ์— ํ™˜๊ฐ ํ˜„์ƒ(Hallucination)์ด ๋ฐœ์ƒํ•œ๋‹ค๊ณ  ๋ณด๋Š”๋ฐ์š”, ์ด๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด 1) ๋จผ์ € QA๋ฐ์ดํ„ฐ์…‹์—์„œ ๋ชจ๋ธ์—๊ฒŒ ์งˆ๋ฌธ์„ ํ•˜๊ณ  ๋‹ต๋ณ€์ด ์ •๋‹ต๊ณผ ์ผ์น˜ํ•˜๋ฉด ํ•ด๋‹น ์งˆ๋ฌธ์€ ์‚ฌ์ „ํ•™์Šต๋ชจ๋ธ(Pretrained Language Model; PLM)์ด "์•„๋Š” ์ง€์‹"์œผ๋กœ, ์ผ์น˜ํ•˜์ง€ ์•Š์œผ๋ฉด "๋ชจ๋ฅด๋Š” ์ง€์‹"์œผ๋กœ ๊ตฌ๋ถ„ํ•˜๊ณ  2) ์•„๋Š” ์ง€์‹" ๊ณผ "๋ชจ๋ฅด๋Š” ์ง€์‹"์„ ๋ณ„๊ฐœ์˜ ํ”„๋กฌํ”„ํŠธ ํ˜•ํƒœ๋กœ ๋งŒ๋“ค์–ด์„œ Instruction Tuning์„ ์ง„ํ–‰ํ–ˆ๋”๋‹ˆ 3) "์•„๋Š” ์ง€์‹"์— ๋Œ€ํ•ด์„œ ๊ธฐ์กด Instruction Tuning ๋ณด๋‹ค ๋” ์„ฑ๋Šฅ์ด ์ข‹์•„์กŒ๋‹ค๊ณ  ์‹คํ—˜ ๊ฒฐ๊ณผ๋ฅผ ํ†ตํ•ด ๋ณด์—ฌ์ฃผ๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค. ๐Ÿ”— R-Tuning: Instructing Large Language Models to Say โ€˜I Donโ€™t Knowโ€™ NAACL 2024 https://aclanthology.org/2024.naacl-long.394/

์•Œ๋ฆผ

์•Œ๋ฆผ์ด ์—†์Šต๋‹ˆ๋‹ค