Skip to content

๐ŸŒƒGenerative AI Assistant service for k-comics(webtoon) artists๐ŸŒ†

Notifications You must be signed in to change notification settings

jh7316/AI_Comics_Assistant

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

์›นํˆฐ ์ž‘๊ฐ€๋“ค์„ ์œ„ํ•œ ๊ทธ๋ฆผ ์ƒ์„ฑ ์„œ๋น„์Šค 'Oh My Assistant'

Oh My Assistant๋Š” ์›นํˆฐ ๋ฐ ์ผ๋Ÿฌ์ŠคํŠธ ์ž‘๊ฐ€๋“ค์„ ๋•๊ธฐ ์œ„ํ•œ ์ƒ์„ฑํ˜• AI ์„œ๋น„์Šค๋กœ, ์ž‘๊ฐ€ ๊ฐœ๊ฐœ์ธ์˜ ๊ทธ๋ฆผ์ฒด๋ฅผ ํ•™์Šตํ•ด ์‹ค์‚ฌ ์ด๋ฏธ์ง€๋ฅผ ํ•ด๋‹น ๊ทธ๋ฆผ์ฒด๋กœ ๋ณ€ํ™˜ํ•˜๊ณ  ์›นํˆฐ ๋‚ด ์บ๋ฆญํ„ฐ์˜ ํฌ์ฆˆ๋ฅผ ๋ณ€๊ฒฝํ•ด์ฃผ๋Š” ์„œ๋น„์Šค๋ฅผ ์ œ๊ณตํ•˜๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค.

Table of content

Live Demo

  • ๋‹ค์Œ ๋งํฌ์—์„œ ์ง์ ‘ ์‹คํ–‰ํ•ด๋ณด์‹ค ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

Background Image Generator

background.mp4

Pose Image Generator

pose.mp4

Member

๊น€์ฐฌ์šฐ ๋‚จํ˜„์šฐ ๋ฅ˜๊ฒฝ์—ฝ ์ด๊ทœ์„ญ ์ดํ˜„์ง€ ํ•œ์ฃผํฌ
Modeling Modeling Backend Backend Modeling Frontend
Background
Image
Generate
Background
Image
Generate
... Implement BE Pose
Image
Generate
UI/UX Design
Implement FE
detail detail detail detail detail detail
  • ํ”„๋กœํ•„ ์‚ฌ์ง„์„ ๋ˆ„๋ฅด๋ฉด ๊ฐœ์ธ Github ํ”„๋กœํ•„๋กœ ๋„˜์–ด๊ฐ‘๋‹ˆ๋‹ค.
  • detail ํŽ˜์ด์ง€์—์„œ ๊ฐœ์ธ์ด ๊ณตํ—Œํ•œ ๋‚ด์šฉ์— ๋Œ€ํ•ด ๋” ์ž์„ธํ•œ ์ •๋ณด๋ฅผ ์—ด๋žŒํ•˜์‹ค ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

Project Timeline

Project Background

๊ธฐํš ์˜๋„ ๋ฐ ๊ธฐ๋Œ€ํšจ๊ณผ

  • ๋ฐฐ๊ฒฝ: ๊ตญ๋‚ด ์›นํˆฐ์ž‘๊ฐ€๋“ค์˜ ์—ด์•…ํ•œ ์ž‘์—… ํ™˜๊ฒฝ์€ ์›นํˆฐ ์—…๊ณ„์—์„œ ๊ณ ์งˆ์ ์ธ ๋ฌธ์ œ๋กœ ์ด์–ด์ ธ ์™”์Šต๋‹ˆ๋‹ค. ํ• ๋‹น๋œ ์ž‘์—…๋Ÿ‰์— ๋น„ํ•ด ์ด‰๋ฐ•ํ•œ ์ž‘์—… ๊ธฐ๊ฐ„, ๊ทธ๋ฆฌ๊ณ  ์ปท ์ˆ˜ ์กฐ์ •์ด๋‚˜ ํœด์žฌ๊ถŒ ๋ณด์žฅ์ด ์ œ๋Œ€๋กœ ์ด๋ฃจ์–ด์ง€์ง€ ์•Š๋Š” ํ™˜๊ฒฝ๋•Œ๋ฌธ์— ๋งŽ์€ ์ž‘๊ฐ€๋“ค์ด ์ •์‹ ์ , ์‹ ์ฒด์  ๊ฑด๊ฐ• ์•…ํ™”๋กœ ํ”ผํ•ด๋ฅผ ๋ฐ›๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค.
  • ๋ชฉ์ : ๋ฐ˜๋ณต์ ์ด์ง€๋งŒ ์‹œ๊ฐ„์„ ๋งŽ์ด ์†Œ์š”ํ•˜๋Š” ๋ฐฐ๊ฒฝ ์ƒ์„ฑ, ์บ๋ฆญํ„ฐ ํฌ์ฆˆ ๋ณ€๊ฒฝ ๋“ฑ์˜ ์ž‘์—…๋“ค์„ ์ƒ์„ฑํ˜• AI๋ฅผ ํ†ตํ•ด ํ•ด๊ฒฐํ•ฉ๋‹ˆ๋‹ค. ํŠนํžˆ ๋ฐฐ๊ฒฝ ์ƒ์„ฑ์˜ ๊ฒฝ์šฐ, ์ž‘๊ฐ€ ๊ฐœ๊ฐœ์ธ์˜ ๊ทธ๋ฆผ์ฒด๋ฅผ ํ•™์Šตํ•˜์—ฌ ์ž‘๊ฐ€ ๋งž์ถคํ˜• AI ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•ด ๋ณด๋‹ค ๋” ์ž์—ฐ์Šค๋Ÿฌ์šด ์ด๋ฏธ์ง€๋ฅผ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. ํ•ด๋‹น ์„œ๋น„์Šค๋ฅผ ํ†ตํ•ด ๊ฒฐ๊ณผ๋ฌผ์˜ ํ€„๋ฆฌํ‹ฐ๋Š” ๋” ๋†’์ด๋ฉด์„œ ์ž‘์—…์‹œ๊ฐ„์„ ๋‹จ์ถ•์‹œ์ผœ์ค๋‹ˆ๋‹ค.

Service Architecture

  • ์„œ๋น„์Šค ๊ฐ„์˜ ์ƒํ˜ธ ์˜์กด๋„๋ฅผ ๋‚ฎ์ถ”๊ธฐ ์œ„ํ•ด ์›น ์„œ๋ฒ„์™€ ๋ชจ๋ธ ์„œ๋ฒ„๋ฅผ๋ถ„๋ฆฌํ•˜์˜€์Šต๋‹ˆ๋‹ค.
  • ์›น ์„œ๋น™์„ ๋‹ด๋‹นํ•˜๋Š” ์›น ํ”„๋ก ํŠธ์—”๋“œ ์„œ๋ฒ„, ๋ฐฑ์—”๋“œ ์„œ๋ฒ„ ๋ฐ db ์„œ๋ฒ„๋Š” ์ „๋ถ€ aws ec2 ์„œ๋ฒ„์—์„œ ํ˜ธ์ŠคํŒ…๋˜๊ณ , ๋ชจ๋ธ ์„œ๋ฒ„๋Š” NCP v100์„œ๋ฒ„์—์„œ ํ˜ธ์ŠคํŒ…๋˜์–ด ์žˆ์Šต๋‹ˆ๋‹ค.
  • ๋ฐฑ์—”๋“œ ์„œ๋ฒ„๋Š” ํ”„๋ก ํŠธ์—์„œ ๋ฐ›์€ ์ธํ’‹ ์ด๋ฏธ์ง€๋‚˜ ํ”„๋กฌํ”„ํŠธ๋ฅผ ๋ชจ๋ธ ์„œ๋ฒ„๋กœ ๋ณด๋‚ด๊ฒŒ ๋˜๊ณ , ๋ชจ๋ธ ์„œ๋ฒ„๋Š” ์ด์— ๋Œ€ํ•œ ์‘๋‹ต์œผ๋กœ ํ•ด๋‹น ์ธํ’‹์— ๋Œ€ํ•œ inference ๊ฒฐ๊ณผ๋ฅผ ์ „์†กํ•ฉ๋‹ˆ๋‹ค.
  • ๋ฐฑ์—”๋“œ ์„œ๋ฒ„์—์„œ ํ•ด๋‹น ๊ฒฐ๊ณผ๋ฅผ ๋‹ค์‹œ ํ”„๋ก ํŠธ๋กœ ๋ณด๋‚ด์„œ ์œ ์ €์—๊ฒŒ ๋ณด์—ฌ์ฃผ๊ณ , ์ตœ์ข…์ ์œผ๋กœ ์œ ์ €๊ฐ€ ํ•ด๋‹น ๊ฒฐ๊ณผ๋ฅผ ์ €์žฅํ•˜๊ณ  ์‹ถ์œผ๋ฉด ํ”„๋ก ํŠธ์—์„œ ๊ด€๋ จ ์š”์ฒญ์„ ๋ฐฑ์—”๋“œ ์„œ๋ฒ„๋กœ ๋ณด๋‚ด aws s3์ €์žฅ์†Œ์— ๊ด€๋ จ ์‚ฌ์ง„๋“ค์„ ์ €์žฅํ•ฉ๋‹ˆ๋‹ค.

Modeling - Background

Inference

๋ฐฐ๊ฒฝ ์ƒ์„ฑ ์„œ๋น„์Šค๋Š” ์›๋ณธ ์ด๋ฏธ์ง€๊ฐ€ ์ฃผ์–ด์ง€๋Š”์ง€ ์—ฌ๋ถ€์— ๋”ฐ๋ผ์„œ Stable Diffusion์˜ Img2Img ๋ชจ๋ธ๊ณผ Txt2Img ๋ชจ๋ธ ์ค‘ ํ•˜๋‚˜๋ฅผ ์„ ํƒํ•˜๊ณ , ๋‹ค์Œ์˜ ๊ณผ์ •์„ ๊ฑฐ์ณ ์›นํˆฐ ์Šคํƒ€์ผ์˜ ์ด๋ฏธ์ง€๋ฅผ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค.

  1. Noise Initialization
    • ์›๋ณธ ์ด๋ฏธ์ง€๊ฐ€ ์ฃผ์–ด์ง„ ๊ฒฝ์šฐ ์›๋ณธ ์ด๋ฏธ์ง€์— ๋…ธ์ด์ฆˆ๋ฅผ ๋‹จ๊ณ„์ ์œผ๋กœ ์ถ”๊ฐ€ํ•˜๋Š” ๋ฐฉ๋ฒ•์œผ๋กœ ๋…ธ์ด์ฆˆ๋ฅผ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. ์ด๋•Œ Strength ํŒŒ๋ผ๋ฏธํ„ฐ์˜ ๊ฐ’์— ๋”ฐ๋ผ ์ถ”๊ฐ€๋˜๋Š” ๋…ธ์ด์ฆˆ์˜ ์–‘์ด ๊ฒฐ์ •๋ฉ๋‹ˆ๋‹ค. 0์œผ๋กœ ์„ค์ •ํ•  ๊ฒฝ์šฐ ๋…ธ์ด์ฆˆ๊ฐ€ ์ถ”๊ฐ€๋˜์ง€ ์•Š์œผ๋ฉฐ, 1์ผ ๊ฒฝ์šฐ ๋…ธ์ด์ฆˆ๊ฐ€ ์ตœ๋Œ€์น˜๋กœ ์ถ”๊ฐ€๋˜์–ด ๋ณ€ํ˜•๋œ ์ด๋ฏธ์ง€๊ฐ€ ์™„์ „ ๋žœ๋คํ•œ ํ…์„œ(random tensor)๊ฐ€ ๋ฉ๋‹ˆ๋‹ค. ์ผ๋ฐ˜์ ์œผ๋กœ 0์— ๊ฐ€๊นŒ์šธ์ˆ˜๋ก ์›๋ณธ ์ด๋ฏธ์ง€๋ฅผ ์œ ์ง€ํ•˜๋ฉฐ 1์— ๊ฐ€๊นŒ์šธ์ˆ˜๋ก ์ œ์•ฝ์—์„œ ๋ฒ—์–ด๋‚˜ ์ž์œ ๋กญ๊ฒŒ ์ด๋ฏธ์ง€๋ฅผ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค.
    • ์›๋ณธ ์ด๋ฏธ์ง€๊ฐ€ ์ฃผ์–ด์ง€์ง€ ์•Š์€ ๊ฒฝ์šฐ ์™„์ „ ๋žœ๋คํ•œ ํ…์„œ๋กœ ๋…ธ์ด์ฆˆ๋ฅผ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค.
  2. Inject Condition
    • ์ด๋ฏธ์ง€๋‚˜ ํ…์ŠคํŠธ ๋“ฑ ๋‹ค์–‘ํ•œ ํ”ผ์ณ๋ฅผ ๋…ธ์ด์ฆˆ๋ฅผ ์ œ๊ฑฐํ•˜๋Š” ๊ณผ์ •์— ํ™œ์šฉํ•˜์—ฌ ์‚ฌ์šฉ์ž๊ฐ€ ์š”๊ตฌํ•˜๋Š” ์ด๋ฏธ์ง€๋ฅผ ์ƒ์„ฑํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ์šฐ๋ฆฌ์˜ ๋ชจ๋ธ์€ ํ”„๋กฌํ”„ํŠธ๋ฅผ ์ž…๋ ฅ์œผ๋กœ ๋ฐ›์•„ CLIP ๋ชจ๋ธ์˜ ํ…์ŠคํŠธ ์ธ์ฝ”๋”๋ฅผ ํ™œ์šฉํ•˜์—ฌ ์ž„๋ฒ ๋”ฉ ๋ฒกํ„ฐ๋ฅผ ์ถ”์ถœํ•ฉ๋‹ˆ๋‹ค. ์ด ๋•Œ ํŠธ๋ฆฌ๊ฑฐ ๋‹จ์–ด๋ผ๋Š”, ์ผ๋ฐ˜์ ์ธ ๋‹จ์–ด๊ฐ€ ์•„๋‹Œ ํ•™์Šต์— ํ™œ์šฉํ•œ ๋ฐ์ดํ„ฐ์˜ ํŠน์ง•์„ ์ž์œจ์ ์œผ๋กœ ํ•™์Šตํ•  ์ˆ˜ ์žˆ๋Š” ๋‹จ์–ด๋ฅผ ํ™œ์šฉํ•˜์—ฌ(Text Inversion) ๋” ์ž์„ธํ•˜๊ณ  ๋†’์€ ํ’ˆ์งˆ์˜ ํŠน์ง•์„ ํ™œ์šฉํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
  3. Denoising Process
    • ์•ž์—์„œ ์ƒ์„ฑํ•œ ๋…ธ์ด์ฆˆ๋ฅผ ์ž…๋ ฅ์œผ๋กœ ๋ฐ›์•„ ๋‹จ๊ณ„๋ณ„๋กœ ๋…ธ์ด์ฆˆ๋ฅผ ์ œ๊ฑฐํ•ด ๋‚˜๊ฐ€๋ฉฐ ๊ณ  ํ•ด์ƒ๋„์˜ ์ด๋ฏธ์ง€๋ฅผ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. ํฌ๋กœ์Šค ์–ดํ…์…˜ ๋ฉ”์ปค๋‹ˆ์ฆ˜์„ ํ™œ์šฉํ•˜์—ฌ ์กฐ๊ฑด์œผ๋กœ ์ฃผ์–ด์ง€๋Š” ํ”ผ์ณ๊ฐ€ ๋…ธ์ด์ฆˆ ์ œ๊ฑฐ ๊ณผ์ •์„ ์œ ๋„ํ•ฉ๋‹ˆ๋‹ค. ์œ ๋„ํ•˜๋Š” ์ •๋„๋ฅผ guidance_score ํŒŒ๋ผ๋ฏธํ„ฐ๋กœ ์กฐ์ ˆํ•  ์ˆ˜ ์žˆ์œผ๋ฉฐ ์ผ๋ฐ˜์ ์œผ๋กœ 7.5๋ฅผ ์‚ฌ์šฉํ•ฉ๋‹ˆ๋‹ค. ํŒŒ๋ผ๋ฏธํ„ฐ ๊ฐ’์„ ํ‚ค์šธ์ˆ˜๋ก ๋…ธ์ด์ฆˆ๋ฅผ ์ œ๊ฑฐํ•˜๋Š” ๊ณผ์ •์—์„œ ์กฐ๊ฑด์œผ๋กœ ์ฃผ์–ด์ง€๋Š” ํ”ผ์ณ๋ฅผ ๋” ํฌ๊ฒŒ ๋ฐ˜์˜ํ•ฉ๋‹ˆ๋‹ค.

์ •๋ฆฌํ•˜๋ฉด, ์›นํˆฐ ์Šคํƒ€์ผ์„ ํ•™์Šตํ•œ ๋ชจ๋ธ์— ์›๋ณธ ์ด๋ฏธ์ง€๋‚˜ ํ”„๋กฌํ”„ํŠธ๋ฅผ ์ž…๋ ฅํ•˜๋ฉด ์ด๋ฅผ ๋ฐ”ํƒ•์œผ๋กœ ๋…ธ์ด์ฆˆ๋ฅผ ์ƒ์„ฑํ•˜๊ณ  ์ œ๊ฑฐํ•˜๋Š” ๊ณผ์ •์„ ์œ ๋„ํ•จ์œผ๋กœ์จ ์ž‘๊ฐ€๊ฐ€ ์š”๊ตฌํ•˜๋Š” ๋ฐฐ๊ฒฝ ์ด๋ฏธ์ง€๋ฅผ ์ƒ์„ฑํ•  ์ˆ˜ ์žˆ๊ฒŒ ํ•ฉ๋‹ˆ๋‹ค.

Train

๋ณด๋‹ค ์™„์„ฑ๋„ ๋†’์€ ์ด๋ฏธ์ง€๋ฅผ ์ƒ์„ฑํ•˜๊ธฐ ์œ„ํ•ด ๋ฐฐ๊ฒฝ ์ƒ์„ฑ ๋ชจ๋ธ์„ ํŒŒ์ธํŠœ๋‹ํ•˜๋Š” ๊ณผ์ •์„ ์ถ”๊ฐ€ํ•˜์˜€์Šต๋‹ˆ๋‹ค. ์ตœ๋Œ€ ํ•™์Šต ์‹œ๊ฐ„ 20๋ถ„, ์ƒ์„ฑ ์‹œ๊ฐ„์€ 1๋ถ„ ๋‚ด๋กœ ์ง„ํ–‰๋˜๋Š” ๋™์‹œ์— ์›นํˆฐ์˜ ์Šคํƒ€์ผ์„ ์ž˜ ์‚ด๋ฆฌ๋Š” ๋ชจ๋ธ์„ ๋ฆฌ์„œ์น˜ํ•˜์—ฌ ๋‹ค์Œ์˜ ๋‘ ๋ชจ๋ธ์„ ์„ ์ •ํ•˜์˜€์Šต๋‹ˆ๋‹ค.

  • LoRA
    • ํ•œ ์žฅ ์ด์ƒ์˜ ์ด๋ฏธ์ง€๋กœ ํ•™์Šตํ•˜์—ฌ ์ฒ˜์Œ ๋ณด๋Š” ์ด๋ฏธ์ง€์— ๋Œ€ํ•ด์„œ๋„ ์ž˜ ์ƒ์„ฑํ•˜๋ฉฐ ์›๋ณธ ์ด๋ฏธ์ง€๋ฅผ ์ž˜ ์œ ์ง€ํ•ฉ๋‹ˆ๋‹ค. ๋˜ํ•œ, ๋ฐ˜๋ณต์ ์œผ๋กœ ์‹คํ—˜ํ–ˆ์„ ๋•Œ ์•ˆ์ •์ ์ธ ์„ฑ๋Šฅ์„ ๋ณด์ž…๋‹ˆ๋‹ค.
    • ์ „์ฒด๋ฅผ ํŒŒ์ธํŠœ๋‹ํ•˜๋Š” ๊ฒƒ์— ๋น„ํ•ด ์ˆ˜๋ฐฑ๋ถ„์˜ ์ผ ๊ณ„์‚ฐ ๋น„์šฉ์œผ๋กœ ๋น„์Šทํ•œ ์„ฑ๋Šฅ์„ ๋‚ผ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
    • ํ•™์Šต๋œ ๊ฒฐ๊ณผ๊ฐ€ 10MB ์ •๋„๋กœ ๊ฐ€๋ณ๊ณ  ๊ฐ„๋‹จํ•˜๊ฒŒ ๊ธฐ์กด ๋ชจ๋ธ์— ๋”ํ•ด์„œ ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ๊ธฐ ๋•Œ๋ฌธ์— ์—ฌ๋Ÿฌ ํŒŒ์ธ ํŠœ๋‹๋œ ๋ชจ๋ธ์„ ๊ฒฐํ•ฉํ•˜์—ฌ ์‚ฌ์šฉํ•˜๋Š” ๊ฒƒ๋„ ์‰ฝ๊ฒŒ ๊ตฌํ˜„ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
  • DreamStyler
    • ํ•œ ์žฅ์˜ ํ•™์Šต ์‚ฌ์ง„๋งŒ ์‚ฌ์šฉํ•˜๊ธฐ ๋•Œ๋ฌธ์— ์„ฑ๋Šฅ์˜ ์•ˆ์ •์„ฑ์ด ์กฐ๊ธˆ ๋–จ์–ด์ง€์ง€๋งŒ LoRA ๋ชจ๋ธ๋ณด๋‹ค ํ”„๋กฌํ”„ํŠธ๋ฅผ ์ž˜ ๋ฐ˜์˜ํ•˜๊ณ  ์›นํˆฐ์˜ ์Šคํƒ€์ผ์„ ์ž˜ ์‚ด๋ฆฌ๋Š” ์„ฑ๋Šฅ์„ ๋ณด์ž…๋‹ˆ๋‹ค.
  • ๋‘ ๊ฐœ์˜ ํ•™์Šตํ•œ ๋ชจ๋ธ์€ ์„œ๋กœ ๋‹ค๋ฅธ ๊ฐ•์ ์„ ๊ฐ€์ง€๊ธฐ ๋•Œ๋ฌธ์— inference ๋‹จ๊ณ„์—์„œ ์‚ฌ์šฉ์ž๊ฐ€ ์›ํ•˜๋Š” ๋ชจ๋ธ์„ ์„ ํƒํ•˜์—ฌ ๋ฐฐ๊ฒฝ ์ด๋ฏธ์ง€๋ฅผ ์ƒ์„ฑํ•  ์ˆ˜ ์žˆ๊ฒŒ ํ•˜์˜€์Šต๋‹ˆ๋‹ค.

Result

image image

Modeling - Pose

์บ๋ฆญํ„ฐ ํฌ์ฆˆ ๋ณ€๊ฒฝ์€ ํฌ๊ฒŒ Pose Estimation, Pose Transfer ๋‘ ๋‹จ๊ณ„๋กœ ์ง„ํ–‰๋ฉ๋‹ˆ๋‹ค.

Pose Estimation

Pose Estimation ๋ชจ๋ธ์€ DWPose๋ฅผ ์‚ฌ์šฉํ•ฉ๋‹ˆ๋‹ค. DWPose๋Š” ์ž…๋ ฅ ์ด๋ฏธ์ง€์—์„œ ์‹ ์ฒ˜๋ฅผ ์ฐพ๋Š” Detector์™€ ์‹ ์ฒด keypoint๋ฅผ ๋ถ„๋ฅ˜ํ•˜๋Š” Classifier๋ฅผ ๊ฑฐ์ณ ์–ผ๊ตด, ์†, ํŒ”, ๋‹ค๋ฆฌ์— ๋Œ€ํ•ด ์ „์ฒด 133๊ฐœ์˜ keypoint(COCO Whole Body)๋ฅผ ์˜ˆ์ธกํ•ฉ๋‹ˆ๋‹ค.

Pose Transfer

Pose Transfer๋Š” Diffusion ๋ชจ๋ธ์„ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•œ AnimateAnyone ์„ ์‚ฌ์šฉํ•ฉ๋‹ˆ๋‹ค. ์ด์ „ Estimation์—์„œ ์ถ”์ถœ๋œ Target Pose ์ด๋ฏธ์ง€์™€ ์‚ฌ์šฉ์ž๊ฐ€ ์ž…๋ ฅํ•œ ์บ๋ฆญํ„ฐ ์ด๋ฏธ์ง€๊ฐ€ ์ž…๋ ฅ๋˜์–ด VAE, CLIP Encoder๋กœ ์ž„๋ฒ ๋”ฉ๋ฉ๋‹ˆ๋‹ค. Noise๋กœ๋ถ€ํ„ฐ ํฌ์ฆˆ๊ฐ€ ๋ณ€๊ฒฝ๋œ ์บ๋ฆญํ„ฐ ์ด๋ฏธ์ง€๊ฐ€ ์ƒ์„ฑ๋˜๋„๋ก Denoising UNet๊ณผ ReferenceNet์„ ์‚ฌ์šฉํ•˜๋ฉฐ VAE Decoder๋กœ ์ด๋ฏธ์ง€๋ฅผ ๋””์ฝ”๋”ฉํ•ด ๊ฒฐ๊ณผ ์ด๋ฏธ์ง€๋กœ ์ถœ๋ ฅํ•ฉ๋‹ˆ๋‹ค.

Project Roadmap

  • ํ•™์Šต ์ด๋ฏธ์ง€์˜ ์ˆ˜์— ๋”ฐ๋ผ ์Šคํƒ€์ผ ๊ฐ•๋„ ์กฐ์ ˆํ•˜๊ธฐ
  • ํ•œ๊ธ€ ํ”„๋กฌํ”„ํŠธ ์ฒ˜๋ฆฌํ•˜๊ธฐ
  • ๋ฐฐ๊ฒฝ์ด๋ฏธ์ง€๋กœ ํ•™์Šตํ•œ ๊ฐ€์ค‘์น˜๋ฅผ ํ™œ์šฉํ•ด ์ธ๋ฌผ ์ƒ์„ฑ ์ œ์–ดํ•˜๊ธฐ
  • ๋‹ค๋ฅธ ์˜ต์…˜์˜ ๋ชจ๋ธ ์ถ”๊ฐ€
  • ๊ฒฝ๋Ÿ‰ํ™” ๋œ ๋ชจ๋ธ ์ถ”๊ฐ€

Directory

Links

  • ๋ฐœํ‘œPPT
  • Wrapup Reports

About

๐ŸŒƒGenerative AI Assistant service for k-comics(webtoon) artists๐ŸŒ†

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 75.3%
  • JavaScript 23.7%
  • Other 1.0%