10. Finetuning For Llms

10. Finetuning For Llmsยถ

Summaryยถ

Finetuning for LLMs๋Š” ์‚ฌ์ „ ํ•™์Šต๋œ ๋ชจ๋ธ์„ ํŠน์ • ์ž‘์—…์ด๋‚˜ ๋„๋ฉ”์ธ์— ๋งž๊ฒŒ ์กฐ์ •ํ•˜๋Š” ๊ณผ์ •์ž…๋‹ˆ๋‹ค. ์ด ๊ณผ์ •์€ ๋ชจ๋ธ์˜ ์„ฑ๋Šฅ์„ ํ–ฅ์ƒ์‹œํ‚ค๊ณ , ํŠน์ • ์ž‘์—…์— ๋Œ€ํ•œ ์ •ํ™•์„ฑ์„ ๋†’์ด๋Š” ๋ฐ ๋„์›€์ด ๋ฉ๋‹ˆ๋‹ค. Finetuning์€ ๋‹ค์–‘ํ•œ ๋ฐฉ๋ฒ•์œผ๋กœ ์ˆ˜ํ–‰๋  ์ˆ˜ ์žˆ์œผ๋ฉฐ, ์ด ์ค‘์—๋Š” ๊ฐ๋… ํ•™์Šต, ๋น„๊ฐ๋… ํ•™์Šต, ์ง€์นจ ๊ธฐ๋ฐ˜ ํ•™์Šต ๋“ฑ์ด ํฌํ•จ๋ฉ๋‹ˆ๋‹ค. ๋˜ํ•œ, ๋ชจ๋ธ์˜ ๊ฐ€์ค‘์น˜๋ฅผ ์—…๋ฐ์ดํŠธํ•˜๋Š” ๋‹ค์–‘ํ•œ ๊ธฐ์ˆ ์ด ์žˆ์œผ๋ฉฐ, ์ด ์ค‘์—๋Š” ์ „์ฒด finetuning, ์–ด๋Œ‘ํ„ฐ ๊ธฐ๋ฐ˜ tuning, ๋งค๊ฐœ๋ณ€์ˆ˜ ํšจ์œจ์ ์ธ finetuning ๋“ฑ์ด ์žˆ์Šต๋‹ˆ๋‹ค.

Key Conceptsยถ

  • Finetuning : ์‚ฌ์ „ ํ•™์Šต๋œ ๋ชจ๋ธ์„ ํŠน์ • ์ž‘์—…์ด๋‚˜ ๋„๋ฉ”์ธ์— ๋งž๊ฒŒ ์กฐ์ •ํ•˜๋Š” ๊ณผ์ •์ž…๋‹ˆ๋‹ค.

  • Supervised Fine-Tuning : ๋ ˆ์ด๋ธ”์ด ์žˆ๋Š” ๋ฐ์ดํ„ฐ๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ๋ชจ๋ธ์„ ํ•™์Šต์‹œํ‚ค๋Š” ๋ฐฉ๋ฒ•์ž…๋‹ˆ๋‹ค.

  • Unsupervised Fine-Tuning : ๋ ˆ์ด๋ธ”์ด ์—†๋Š” ๋ฐ์ดํ„ฐ๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ๋ชจ๋ธ์„ ํ•™์Šต์‹œํ‚ค๋Š” ๋ฐฉ๋ฒ•์ž…๋‹ˆ๋‹ค.

  • Instruction Fine-Tuning : ์ง€์นจ์„ ์‚ฌ์šฉํ•˜์—ฌ ๋ชจ๋ธ์„ ํ•™์Šต์‹œํ‚ค๋Š” ๋ฐฉ๋ฒ•์ž…๋‹ˆ๋‹ค.

  • Full Fine-Tuning : ๋ชจ๋ธ์˜ ๋ชจ๋“  ๊ฐ€์ค‘์น˜๋ฅผ ์—…๋ฐ์ดํŠธํ•˜๋Š” ๋ฐฉ๋ฒ•์ž…๋‹ˆ๋‹ค.

  • Adapter-Based Tuning : ๋ชจ๋ธ์˜ ์ผ๋ถ€ ๊ฐ€์ค‘์น˜๋ฅผ ์—…๋ฐ์ดํŠธํ•˜๋Š” ๋ฐฉ๋ฒ•์ž…๋‹ˆ๋‹ค.

  • Parameter-Efficient Fine-Tuning : ๋ชจ๋ธ์˜ ์ผ๋ถ€ ๊ฐ€์ค‘์น˜๋ฅผ ์—…๋ฐ์ดํŠธํ•˜๋Š” ๋ฐฉ๋ฒ•์ž…๋‹ˆ๋‹ค.

Referencesยถ

URL Name

URL

The Ultimate Guide to Fine-Tuning LLMs

https://arxiv.org/html/2408.13296v1

Fine-tuning large language models (LLMs) in 2024 - SuperAnnotate

https://www.superannotate.com/blog/llm-fine-tuning

The Ultimate Guide to LLM Fine Tuning: Best Practices & Tools

https://www.lakera.ai/blog/llm-fine-tuning-guide

Finetuning in large language models - Oracle Blogs

https://blogs.oracle.com/ai-and-datascience/post/finetuning-in-large-language-models