Community

๐ŸŒˆ ์˜์นด AIํŒ€์˜ Applied Research Scientist๋Š” ์–ด๋–ค ์ผ์„ ํ•˜๋‚˜์š”? ๐ŸŽ ์ด ๊ธ€์„ ์ถ”์ฒœํ•˜๋Š” ์ด์œ  - ๋ฐ์ดํ„ฐ, AI ์—…๊ณ„์˜ ์ง๊ตฐ๋“ค์€ ๋ชจ๋‘ ๋‹ค ๋‹ค๋ฅด๊ฒŒ ์ •์˜ํ•˜๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค - 3~4

๐ŸŒˆ ์˜์นด AIํŒ€์˜ Applied Research Scientist๋Š” ์–ด๋–ค ์ผ์„ ํ•˜๋‚˜์š”? ๐ŸŽ ์ด ๊ธ€์„ ์ถ”์ฒœํ•˜๋Š” ์ด์œ  - ๋ฐ์ดํ„ฐ, AI ์—…๊ณ„์˜ ์ง๊ตฐ๋“ค์€ ๋ชจ๋‘ ๋‹ค ๋‹ค๋ฅด๊ฒŒ ์ •์˜ํ•˜๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค - 3~4๋…„ ์ „์—” ๋ฐ์ดํ„ฐ ์‚ฌ์ด์–ธํ‹ฐ์ŠคํŠธ๋ผ๋Š” ์ง๊ตฐ์ด ์—ฐ๊ตฌํ•˜๋Š” ์—ญํ• ๋กœ ๋˜์—ˆ์œผ๋‚˜ ์š”์ƒˆ๋Š” Research Scientist๋ผ๋Š” ์ด๋ฆ„์œผ๋กœ ์‚ฌ์šฉ๋˜๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค - ์‹ค๋ฆฌ์ฝ˜๋ฐธ๋ฆฌ์˜ ํšŒ์‚ฌ๋ฅผ ๋ณด๋ฉด Applied Research Scientist๋ผ๋Š” ์ด๋ฆ„์œผ๋กœ ์ฑ„์šฉ ๊ณต๊ณ ๋ฅผ ๋‚ด๋Š” ๊ฒฝ์šฐ๋„ ์กด์žฌํ•ฉ๋‹ˆ๋‹ค - ์˜์นด์—์„œ๋Š” Applied Research Scientist ์ง๊ตฐ์„ ์–ด๋–ป๊ฒŒ ์ •์˜ํ•˜๋Š”์ง€ ์ •๋ฆฌํ•œ ๊ธ€์ž…๋‹ˆ๋‹ค - ๋ฏธ๋ฌ˜ํ•˜๊ฒŒ ๋‹ค๋ฅธ ๊ด€์ ์„ ๊ฐ€์ง€๊ณ  ์žˆ๋Š”๋ฐ, ์ด ๊ธ€์„ ํ†ตํ•ด ์ง๊ตฐ์— ๋Œ€ํ•œ ์ดํ•ด๋ฅผ ์ž˜ ํ•  ์ˆ˜ ์žˆ์œผ๋ฉด ์ข‹๊ฒ ๋„ค์š”-! โœ ๋‚ด์šฉ ์š”์•ฝ - AI์— ๊ด€๋ จ๋œ ์ง๋ฌด์—๋Š” ๋ฌด์—‡์ด ์žˆ์„๊นŒ? - Research Scientist - Applied Research Scientist - Machine Learning Engineer - Data Scientist Research Scientist: ์–ด๋–ป๊ฒŒ SOTA๋ฅผ ๋›ฐ์–ด๋„˜์„ ์ˆ˜ ์žˆ์„๊นŒ? - Research Scientist๋Š” AI์— ๊ด€๋ จ๋œ ์›์ฒœ ๊ธฐ์ˆ ์„ ์—ฐ๊ตฌํ•˜๋Š” ํฌ์ง€์…˜ - Research Scientist๋Š” Public Benchmark Dataset์—์„œ ์ด์ „ ์—ฐ๊ตฌ๊ฐ€ ๋‹ฌ์„ฑํ•œ ์ตœ๊ณ  ์„ฑ๋Šฅ (State-of-the-Art; SOTA)๋ฅผ ๋„˜์–ด์„œ๋Š” ๊ธฐ๋ฒ•์„ ์—ฐ๊ตฌํ•˜๊ณ , ์ด์ „ SOTA์˜ ํ•œ๊ณ„์ ์„ ๋ณด์™„ํ•˜๋Š” ๊ธฐ๋ฒ•์„ ์—ฐ๊ตฌ - Research Scientist์˜ Research Questions - Gradient Descent๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ Learning Objective๋ฅผ ๋‹ฌ์„ฑํ•˜๋Š” ๊ฒƒ์ด ์•„๋‹ˆ๋ผ, ์ธ๊ฐ„์ฒ˜๋Ÿผ Reasoning์„ ํ•˜๋Š” AI๋ฅผ ๋งŒ๋“ค ์ˆ˜๋Š” ์—†์„๊นŒ? - ์ด๋ฏธ์ง€๋ฅผ ์ดํ•ดํ•˜๋Š” ์—ฌ๋Ÿฌ Neural Networks Architecture๊ฐ€ ์žˆ๋Š”๋ฐ, ํŠน์ •ํ•œ ํŒจํ„ด์— bias ๋˜์ง€ ์•Š๊ณ  ๋” ์ธ๊ฐ„์ฒ˜๋Ÿผ ์ด๋ฏธ์ง€๋ฅผ ์ดํ•ดํ•˜๋Š”(ํ˜น์€ ์ธ๊ฐ„๋ณด๋‹ค ๋” ๋›ฐ์–ด๋‚˜๊ฒŒ) ๊ตฌ์กฐ๋Š” ์—†์„๊นŒ? - ์ตœ๊ทผ์— ์ œ์•ˆ๋œ Language Model (BERT, RoBERTa, S-BERT ๋“ฑ)๋ณด๋‹ค ๋” ์ธ๊ฐ„์ฒ˜๋Ÿผ (ํ˜น์€ ์ธ๊ฐ„๋ณด๋‹ค ๋” ๋›ฐ์–ด๋‚˜๊ฒŒ) ์ง€์‹์„ ์ดํ•ดํ•˜๋Š” ๋ชจ๋ธ์€ ์—†์„๊นŒ? Applied Research Scientist: ์šฐ๋ฆฌ ๋น„์ฆˆ๋‹ˆ์Šค ๋„๋ฉ”์ธ์˜ ๋ฌธ์ œ๋ฅผ ์–ด๋–ป๊ฒŒ ํ’€ ์ˆ˜ ์žˆ์„๊นŒ? - Applied Research Scientist๋Š” ํŠน์ • ๋น„์ฆˆ๋‹ˆ์Šค ๋„๋ฉ”์ธ์˜ ๋ฌธ์ œ๋ฅผ ํ’€ ์ˆ˜ ์žˆ๋Š” AI๋ฅผ ์—ฐ๊ตฌํ•˜๊ณ , ์—ฐ๊ตฌ๋œ ๋ชจ๋ธ์„ ๋ฐฐํฌํ•˜๋Š” ์ผ์„ ์ˆ˜ํ–‰ํ•˜๋Š” ํฌ์ง€์…˜ - Applied Research Scientist๋Š” Public Benchmark์™€ Real-world์˜ ์ฐจ์ด๋ฅผ ๊ณ ๋ฏผํ•˜๋ฉด์„œ, SOTA ๊ธฐ๋ฒ•์ด ์šฐ๋ฆฌ ๋„๋ฉ”์ธ์—์„œ ์™œ ์•ˆ๋˜๋Š”์ง€ (ํ˜น์€ ์™œ ์ž˜ ๋˜๋Š”์ง€)๋ฅผ ํŒŒ์•…ํ•˜๊ณ , ์ œ์•ˆ๋œ ์—ฌ๋Ÿฌ ๊ธฐ๋ฒ•๋“ค์„ ์ตœ์ ํ™”ํ•˜๊ฑฐ๋‚˜ ์ƒˆ๋กœ์šด ๊ธฐ๋ฒ•์„ ๋””์ž์ธํ•˜๊ธฐ๋„ ํ•ฉ๋‹ˆ๋‹ค. - Applied Research Scientist์˜ Research Questions - ๋…ผ๋ฌธ A๋Š” ImageNet, SUN, Place 365์—์„œ ๋†’์€ ์„ฑ๋Šฅ์„ ๋‹ฌ์„ฑํ–ˆ๋Š”๋ฐ, ์šฐ๋ฆฌ ๋„๋ฉ”์ธ์—์„œ๋Š” ์„ฑ๋Šฅ์ด ๋†’์ง€ ์•Š์€๋ฐ, ๊ทธ ์ด์œ ๊ฐ€ ๋ญ˜์ง€? ์šฐ๋ฆฌ ๋ฐ์ดํ„ฐ์™€ Public Benchmark์—๋Š” ์–ด๋–ค ์ฐจ์ด๊ฐ€ ์žˆ์–ด์„œ ๊ทธ๋Ÿด๊นŒ? - ์šฐ๋ฆฌ ๋„๋ฉ”์ธ์—์„œ ๋‹ค๋ฃจ๋Š” ๋ฐ์ดํ„ฐ๋Š” Public Benchmark๋“ค๊ณผ๋Š” ๋„ˆ๋ฌด ๋‹ค๋ฅธ๋ฐ, ์šฐ๋ฆฌ ๋„๋ฉ”์ธ์—์„œ ์ž˜ ๋™์ž‘ํ•˜๋Š” ์ƒˆ๋กœ์šด Neural Architecture๋ฅผ ๋””์ž์ธํ•ด ๋ณผ๊นŒ? - ๋ชจ๋ธ B๊ฐ€ ๋ฐฐํฌ๋˜์—ˆ์„ ๋•Œ ๋‚ฎ์€ Overhead๋ฅผ ๋‹ฌ์„ฑํ•˜๋ ค๋ฉด ์ฝ”๋“œ๋ฅผ ์–ด๋–ป๊ฒŒ ๋ฆฌํŒฉํ† ๋ง ํ•ด์•ผ ํ• ๊นŒ? ๋ชจ๋ธ์— ๋“ค์–ด๊ฐ€๋Š” Input์€ ์–ด๋–ป๊ฒŒ ์„ค๊ณ„ํ•˜๊ณ , Inference ๊ฒฐ๊ณผ๋Š” ์–ด๋–ค ํ…Œ์ด๋ธ”์— ์–ด๋–ป๊ฒŒ ์ ์žฌํ•˜์ง€? Machine Learning Engineer: AI ๋ชจ๋ธ์„ ์–ด๋–ป๊ฒŒ ํšจ๊ณผ์ ์œผ๋กœ ๊ตฌํ˜„ํ•˜๊ณ  ์„œ๋น„์Šคํ™” ์‹œํ‚ฌ๊นŒ? - Machine Learning Engineer๋Š” AI ๋ชจ๋ธ์˜ ๊ฐœ๋ฐœ๊ณผ ์„œ๋น„์Šค์— ๋” ๋ฌด๊ฒŒ๋ฅผ ๋‘๊ณ  ์žˆ๋Š” ํฌ์ง€์…˜ - Machine Learning Engineer์˜ Research Questions - ๋งค ์‹คํ—˜์— ์‚ฌ์šฉ๋œ ๋ฐ์ดํ„ฐ ์…‹๊ณผ ๋ชจ๋ธ์˜ ์•„ํ‚คํ…์ฒ˜, Weight ํŒŒ์ผ๋“ค์ด ๊ด€๋ฆฌ๊ฐ€ ์–ด๋ ค์šด๋ฐ, ์ด๋ฅผ ์ข€ ํšจ๊ณผ์ ์œผ๋กœ ๊ด€๋ฆฌํ•  ์ˆ˜ ์žˆ๋Š” ๋ฐฉ๋ฒ•์ด ์—†์„๊นŒ? - Pytorch๋กœ ์ž‘์„ฑ๋œ ๋ชจ๋ธ์ด ๋น„ํšจ์œจ์ ์ธ ๊ฒƒ ๊ฐ™์•„. ํ”„๋กœ๋•์…˜์— ๋“ค์–ด๊ฐ€๋ ค๋ฉด ๋” Overhead๋ฅผ ๋‚ฎ์ถฐ์•ผ ํ•  ๊ฒƒ ๊ฐ™์€๋ฐ, Tensorflow๋กœ ์ด๋ฅผ ๋ณ€ํ™˜ํ•ด ๋ณผ ์ˆ˜ ์žˆ์„๊นŒ? - GPU์˜ ๊ฐœ์ˆ˜๋Š” ๋งŽ์€๋ฐ ๊ทธ ์„ฑ๋Šฅ์„ 100% ์‚ฌ์šฉํ•˜์ง€๋Š” ๋ชปํ•˜๋„ค. ์ตœ๋Œ€ํ•œ ํšจ์œจ์ ์œผ๋กœ GPU ์ž์›์„ ์‚ฌ์šฉํ•  ์ˆ˜๋Š” ์—†์„๊นŒ? Data Scientist: ๋ฐ์ดํ„ฐ๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ์–ด๋–ค Action์„ ํ•  ์ˆ˜ ์žˆ์„๊นŒ? - Data Scientist๋Š” ๋น„์ฆˆ๋‹ˆ์Šค ๋„๋ฉ”์ธ์—์„œ ๋ฐœ์ƒํ•œ ๋‹ค์–‘ํ•œ ๋ฐ์ดํ„ฐ๋ฅผ ๋ถ„์„ํ•˜๋Š” ํฌ์ง€์…˜ - Data Scientist์˜ Request Questions - (Business) ์ด๋ฒˆ ์ฃผ๋ง์— ๊ฐ•๋‚จ์—ญ 10๋ฒˆ ์ถœ๊ตฌ ์˜์นด ์กด์˜ ์˜ˆ์•ฝ ๊ฑด์€ ์–ผ๋งˆ๋‚˜ ๋ ๊นŒ? - (Business) 2022๋…„์— ์„œ์šธ์‹œ ์€ํ‰๊ตฌ์— ๋ช‡ ๋Œ€์˜ ์ฐจ๋Ÿ‰์„ ๋ฐฐ์ฐจํ•˜๋ฉด ๋Œ€ ๋‹น ๋งค์ถœ์ด ์–ผ๋งˆ๋‚˜ ๋  ๊ฒƒ์œผ๋กœ ์˜ˆ์ธกํ•  ์ˆ˜ ์žˆ์„๊นŒ? - (Business) ๊ฐ€์žฅ ์ ์€ ๋งค์ถœ์ด ๋‚˜์˜ฌ ์ง€์—ญ์„ ๋ฐ์ดํ„ฐ์— ๊ธฐ๋ฐ˜ํ•ด ์ฐพ์•„์ฃผ๋Š” ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์–ด๋–ป๊ฒŒ ๋งŒ๋“ค ์ˆ˜ ์žˆ์„๊นŒ? - (Product) ์˜์นด์˜ Funnel ์ค‘ ๊ฐ€์žฅ ์ „ํ™˜์œจ์ด ๋‚ฎ์€ ๋ถ€๋ถ„์€ ์–ด๋””์ผ๊นŒ? ๊ทธ ๋ถ€๋ถ„์„ ๊ฐœ์„ ํ•˜๊ธฐ ์œ„ํ•ด์„œ๋Š” ์–ด๋–ค Action์„ ํ•  ์ˆ˜ ์žˆ์„๊นŒ? ์–ด๋–ค ์‹คํ—˜์„ ์ง„ํ–‰ํ•˜๋ฉด ์ด์— ๋Œ€ํ•œ ๊ฒฐ๋ก ์„ ์–ป์„ ์ˆ˜ ์žˆ์„๊นŒ? - (Product) ์ƒˆ๋กœ์šด ๊ธฐ๋Šฅ ๊ฐœ๋ฐœ์„ ์‹œ์ž‘ํ•˜๋ ค๊ณ  ํ•˜๋Š”๋ฐ, ์ด ๊ธฐ๋Šฅ ๊ฐœ๋ฐœ์ด ์„ฑ๊ณตํ–ˆ๋‹ค๊ณ  ๋ณด๋ ค๋ฉด ์–ด๋–ค Metric์„ ๊ฒฐ์ •ํ•ด์•ผ ํ• ๊นŒ? ๊ทธ Metric์„ ๋ณด๊ธฐ ์œ„ํ•ด ์–ด๋–ค ์•ฑ, ์›น ๋ฐ์ดํ„ฐ๋ฅผ ๋กœ๊น…ํ•ด์•ผํ• ๊นŒ ์ƒˆ๋กœ์šด ๊ธฐ๋Šฅ์„ AB Test ํ•˜๋ ค๊ณ  ํ•  ๊ฒฝ์šฐ, ์–ด๋–ค ๋ฐฉ๋ฒ•์œผ๋กœ ์„ค๊ณ„ํ•  ์ˆ˜ ์žˆ์„๊นŒ? - (Product) ์ƒˆ๋กœ์šด ๊ธฐ๋Šฅ์ด ์ถœ์‹œ๋œ ์ดํ›„์— ์„ฑ๊ณต์ ์ธ์ง€ ํ™•์ธํ•˜๊ธฐ ์œ„ํ•ด ๋Œ€์‹œ๋ณด๋“œ๋Š” ์–ด๋–ป๊ฒŒ ๊ตฌ์„ฑํ•ด์•ผ ํ• ๊นŒ? ์˜์นด AIํŒ€์ด ํ•˜๋Š” ์ผ - Vision Domain - ์‚ฌ์ „์— ๊ฒฝ์ฐจ๋‚˜ ์ค‘ํ˜• ์ฐจ์— ์†ํ•˜์ง€ ์•Š๋Š”๋‹ค๊ณ  ํŒ๋‹จํ•˜๋ฉด์„œ (Out-of-Distribution Detection), ๊ธฐ์กด ๋ถ„๋ฅ˜๊ธฐ์˜ ์„ฑ๋Šฅ์„ ์œ ์ง€ํ•  ์ˆ˜๋Š” ์—†์„๊นŒ์š”? (Open-Set Recognition) - ์ž˜๋ชป๋œ ์˜ˆ์ธก์„ ์ˆ˜ํ–‰ํ–ˆ์„ ๋•Œ๋Š” less-confident ํ•˜๊ฒŒ ํ‹€๋ฆฌ๊ณ , ์˜ณ์€ ์˜ˆ์ธก์— ๋Œ€ํ•ด์„œ๋Š” more confident ํ•˜๊ฒŒ ๋งž์ถ”๋„๋ก ํ•  ์ˆ˜๋Š” ์—†์„๊นŒ์š”? (Calibration) ์‹ค๋ฌด์—์„œ๋Š” ๋ชจ๋ธ์˜ ์˜ˆ์ธก ๊ฒฐ๊ณผ๋ฟ๋งŒ ์•„๋‹ˆ๋ผ, ๋ชจ๋ธ์ด ํ™•์‹คํ•˜๊ฒŒ ์˜ˆ์ธกํ•œ ๊ฑด๋“ค์„ ๋จผ์ € ๊ฒ€ํ† ํ•˜๊ณ ์ž ํ•˜๋Š”๋ฐ, ์ด ํ™•์‹ ์˜ ์ •๋„๋ฅผ ์–ด๋–ป๊ฒŒ ์ž˜ ์ธก์ •ํ•  ์ˆ˜ ์žˆ์„๊นŒ์š”? - NLP Domain - ๊ณ ๊ฐ์ด ํ•„์š”๋กœ ํ•˜๋Š” ์†”๋ฃจ์…˜์ด ๊ฐ๊ธฐ ๋‹ค๋ฅธ๋ฐ, ์ด ๋ฌธ์˜๋“ค์„ ํ•˜๋‚˜์˜ Intent๋กœ ๋ฌถ์„ ์ˆ˜ ์žˆ์„๊นŒ์š”? ํ˜น์€ ํ•œ ๋ฌธ์žฅ์— ์—ฌ๋Ÿฌ ๊ฐ€์ง€ ๋ฌธ์ œ๊ฐ€ ์„ž์—ฌ์žˆ์„ ๋•Œ๋Š” ์–ด๋–ป๊ฒŒ ์ฒ˜๋ฆฌํ•  ์ˆ˜ ์žˆ์„๊นŒ์š”? (Multi-Labeled Sample) - ์‚ฌ์ „์— ์ •์˜ํ•ด๋‘” Intent์—์„œ ๋ฒ—์–ด๋‚œ ๋ฌธ์˜๋Š” ์–ด๋–ป๊ฒŒ ์‘๋‹ตํ•ด์•ผ ํ• ๊นŒ์š”? (Unknown Intent Detection) Vision ๋„๋ฉ”์ธ์—์„œ์™€ ๋งˆ์ฐฌ๊ฐ€์ง€๋กœ, ๊ณ ๊ฐ์˜ ๋ฌธ์˜์— ๋Œ€ํ•ด ์˜ˆ์ธกํ•œ Intent์— ๋Œ€ํ•œ Confidence๋ฅผ ์–ด๋–ป๊ฒŒ ์ธก์ •ํ•  ์ˆ˜ ์žˆ์„๊นŒ์š”?

์•Œ๋ฆผ

์•Œ๋ฆผ์ด ์—†์Šต๋‹ˆ๋‹ค