6. Deploymentยถ

Summaryยถ

LLM์˜ ๋ฐฐํฌ๋Š” ๋‹จ์ˆœํžˆ ๊ฐ•๋ ฅํ•œ ์–ธ์–ด ๋ชจ๋ธ์„ ์• ํ”Œ๋ฆฌ์ผ€์ด์…˜์— ํ†ตํ•ฉํ•˜๋Š” ๊ฒƒ ์ด์ƒ์˜ ๋ณต์žกํ•œ ํ”„๋กœ์„ธ์Šค์ž…๋‹ˆ๋‹ค. ์ด๋Š” ๋‹ค์–‘ํ•œ ์‹œ์Šคํ…œ๊ณผ ๊ตฌ์„ฑ ์š”์†Œ๋ฅผ ์กฐ์œจํ•˜๋Š” ๊ฒƒ์„ ํฌํ•จํ•˜๋ฉฐ, ๊ฐ ๋ถ€๋ถ„์ด ์ค‘์š”ํ•œ ์—ญํ• ์„ ํ•ฉ๋‹ˆ๋‹ค. LLM ์• ํ”Œ๋ฆฌ์ผ€์ด์…˜์˜ ์•„ํ‚คํ…์ฒ˜๋Š” ๋ฒกํ„ฐ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค, ํ”„๋กฌํ”„ํŠธ ํ…œํ”Œ๋ฆฟ, ์˜ค์ผ€์ŠคํŠธ๋ ˆ์ด์…˜ ๋ฐ ์›Œํฌํ”Œ๋กœ์šฐ ๊ด€๋ฆฌ, ์ธํ”„๋ผ ๋ฐ ํ™•์žฅ์„ฑ, ๋ชจ๋‹ˆํ„ฐ๋ง ๋ฐ ๋กœ๊น…, ๋ณด์•ˆ ๋ฐ ๊ทœ์ • ์ค€์ˆ˜, ๊ธฐ์กด ์‹œ์Šคํ…œ๊ณผ์˜ ํ†ตํ•ฉ ๋“ฑ ์—ฌ๋Ÿฌ ํ•ต์‹ฌ ์š”์†Œ๋กœ ๊ตฌ์„ฑ๋ฉ๋‹ˆ๋‹ค.

Key Conceptsยถ

  • ๋ฒกํ„ฐ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค : LLM์ด ์ƒ์„ฑํ•˜๋Š” ๊ณ ์ฐจ์› ๋ฐ์ดํ„ฐ๋ฅผ ํšจ์œจ์ ์œผ๋กœ ์ €์žฅํ•˜๊ณ  ๊ฒ€์ƒ‰ํ•˜๋Š” ๋ฐ ํ•„์ˆ˜์ ์ธ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค์ž…๋‹ˆ๋‹ค. ์ด๋Š” ์˜๋ฏธ ๊ฒ€์ƒ‰, ์ถ”์ฒœ ์‹œ์Šคํ…œ, ๊ฐœ์ธํ™”๋œ ์‚ฌ์šฉ์ž ๊ฒฝํ—˜ ๋“ฑ์—ไธๅฏๆฌ ํ•ฉ๋‹ˆ๋‹ค.

  • ํ”„๋กฌํ”„ํŠธ ํ…œํ”Œ๋ฆฟ : LLM๊ณผ์˜ ์ƒํ˜ธ์ž‘์šฉ์„ ํ‘œ์ค€ํ™”ํ•˜๋Š” ์‚ฌ์ „ ์ •์˜๋œ ๊ตฌ์กฐ๋กœ, ๋ชจ๋ธ์˜ ์‘๋‹ต์˜ ์ผ๊ด€์„ฑ๊ณผ ์‹ ๋ขฐ์„ฑ์„ ๋ณด์žฅํ•ฉ๋‹ˆ๋‹ค.

  • ์˜ค์ผ€์ŠคํŠธ๋ ˆ์ด์…˜ ๋ฐ ์›Œํฌํ”Œ๋กœ์šฐ ๊ด€๋ฆฌ : ๋ฐ์ดํ„ฐ ์ „์ฒ˜๋ฆฌ, ๋ชจ๋ธ ์ถ”๋ก , ํ›„์ฒ˜๋ฆฌ ๋“ฑ ๋‹ค์–‘ํ•œ ์ž‘์—…์„ ์ž๋™ํ™”ํ•˜๊ณ  ์ŠคํŠธ๋ฆฌ๋ฐํ•˜๋Š” ๋„๊ตฌ์™€ ํ”„๋ ˆ์ž„์›Œํฌ์ž…๋‹ˆ๋‹ค. Apache Airflow๋‚˜ Kubernetes์™€ ๊ฐ™์€ ๋„๊ตฌ๊ฐ€ ์ด๋ฅผ ์ง€์›ํ•ฉ๋‹ˆ๋‹ค.

  • ์ธํ”„๋ผ ๋ฐ ํ™•์žฅ์„ฑ : LLM ์• ํ”Œ๋ฆฌ์ผ€์ด์…˜์„ ์ง€์›ํ•˜๋Š” ์ธํ”„๋ผ๊ฐ€ ๊ฐ•๋ ฅํ•˜๊ณ  ํ™•์žฅ ๊ฐ€๋Šฅํ•ด์•ผ ํ•ฉ๋‹ˆ๋‹ค. ํด๋ผ์šฐ๋“œ ์„œ๋น„์Šค, ํ•˜๋“œ์›จ์–ด ๊ฐ€์†๊ธฐ(GPU, TPU), ๋„คํŠธ์›Œํ‚น ๊ธฐ๋Šฅ ๋“ฑ์ด ํฌํ•จ๋ฉ๋‹ˆ๋‹ค.

  • ๋ชจ๋‹ˆํ„ฐ๋ง ๋ฐ ๋กœ๊น… : ์‹œ์Šคํ…œ ์„ฑ๋Šฅ, ์‚ฌ์šฉ ํŒจํ„ด, ์ž ์žฌ์ ์ธ ๋ฌธ์ œ์— ๋Œ€ํ•œ ์‹ค์‹œ๊ฐ„ ์ •๋ณด๋ฅผ ์ œ๊ณตํ•˜๋Š” ๋ชจ๋‹ˆํ„ฐ๋ง ๋„๊ตฌ์™€ ๋กœ๊น… ๋ฉ”์ปค๋‹ˆ์ฆ˜์ž…๋‹ˆ๋‹ค.

  • ๋ณด์•ˆ ๋ฐ ๊ทœ์ • ์ค€์ˆ˜ : LLM ๋ฐฐํฌ์—๋Š” ๋ฏผ๊ฐํ•œ ๋ฐ์ดํ„ฐ ๋ณดํ˜ธ, ์ ‘๊ทผ ์ œ์–ด, GDPR ๋˜๋Š” HIPAA์™€ ๊ฐ™์€ ๊ด€๋ จ ๊ทœ์ • ์ค€์ˆ˜๋ฅผ ํฌํ•จํ•˜๋Š” ๋ณด์•ˆ ์š”๊ตฌ ์‚ฌํ•ญ์ด ์žˆ์Šต๋‹ˆ๋‹ค.

  • ๊ธฐ์กด ์‹œ์Šคํ…œ๊ณผ์˜ ํ†ตํ•ฉ : LLM ์• ํ”Œ๋ฆฌ์ผ€์ด์…˜์ด ๊ธฐ์กด ์‹œ์Šคํ…œ๊ณผ ์›Œํฌํ”Œ๋กœ์šฐ์™€ ์›ํ™œํ•˜๊ฒŒ ํ†ตํ•ฉ๋˜์–ด์•ผ ํ•ฉ๋‹ˆ๋‹ค.

Referencesยถ

URL ์ด๋ฆ„

URL

DataCamp - Deploying LLM Applications with LangServe

https://www.datacamp.com/tutorial/deploying-llm-applications-with-langserve

Lakera - The Ultimate Guide to Deploying Large Language Models Safely

https://www.lakera.ai/blog/how-to-deploy-an-llm

Reddit - Tools for LLM deployment and distribution

https://www.reddit.com/r/mlops/comments/18p19lq/tools_for_llm_deployment_and_distribution/

HatchWorks - How to Deploy an LLM: More Control, Better Outputs

https://hatchworks.com/blog/gen-ai/how-to-deploy-llm/

Reddit - Building and Deploying LLM apps to production

https://www.reddit.com/r/LLMDevs/comments/137g88l/question_building_and_deploying_llm_apps_to/