專案名稱 : 宅經濟 - 虛擬偶像辨識

(中 Chinese/ 英 English)

專案名稱 : 宅經濟 - 虛擬偶像辨識

簡報PDF: https://drive.google.com/drive/u/0/folders/1JBadWmJ16yvmhH8hIDYWHGj-sUE45_WS
簡報: https://www.youtube.com/watch?v=cYjT20G_zG4

專案目的 :

讓使用者透過 AI 模型, 與展覽場地中的人形立牌或二次創作的商品, 進行互動

技術概要 :

1. 前端 : LineBOT, 網頁, APP(Android only)
2. 後端 : 地端server
3. AI model : CNN ResNet 101 v2
              YOLO v8

AI model 訓練過程

第一次模型建立

蒐集資料 :
- 使用技術 : OpenCV (使用OpenCV 擷取Youtube上的直播影像)
- Vtuber : 2 位
模型 : 仿製少層的 VGG16
問題點 :
- 背景因素影響過大 -> AI model 以背景為判斷基準, 非人物

第二次模型建立

優化想法 :
- 擷取的圖片, 需要更集中在人物面部特徵
- 完全去除背景
蒐集資料 :
- 使用技術 : OpenCV, SAM (SAM是Meta團隊於 2023.4月發表的新技術 : https://segment-anything.com/)
- Vtuber : 6 位
模型 :
- VGG16
問題點 :
- AI model 訓練完成, 但是, 用於辨識人形立牌或二次創作的商品時, 效果不好
  (a. 訓練資料與真實場景, 差異過大)
- Vtuber數量增加, 特徵容易重複, 需要截取更多特徵
  (a. 增加模型深度 b. 更換深度更深的模型)

第三次模型建立

優化想法 :
- 在訓練資料中, 直接加入人形立牌或二次創作的商品等圖片
- 使用 ResNet 101 v2 模型
蒐集資料 :
- 使用技術 : OpenCV, SAM, 現實照片
- Vtuber : 6 位
模型 :
- ResNet 50 v2
- ResNet 101 v2

CNN 成果

第三次模型, 於訓練過程真實場景辨識效果良好, 將此模型部署
為了進一步提升互動性 -> 額外增加另一種模型

AI model - YOLO v8

YOLO v8 : https://github.com/ultralytics/ultralytics
CNN 於辨識單一目標的圖像時, 效果良好。
但使用者如同時拍攝多的人形立牌, 多個商品, CNN 較不適用 ; 且 CNN 無法用於影像識別。
所以額外學習 PyTorch 以了解 YOLO v8 如何建立與運作。

蒐集資料 : 直接使用人形立牌或二次創作的商品等圖片
標記資料 :
- API : https://hub.ultralytics.com/
模型成果影片 :
- https://www.youtube.com/playlist?list=PLjW4Ibuk2-7BTD_uz0qU2fADAdJYOdUiN

YOLO 成果

相較於YOLO v7, YOLO v8 在多目標圖像辨識, 即時影像辨識都取得較好的成果。
但受限於專案時間, 沒有部署YOLO系列模型, 僅展示成果影片。

後記

CNN 模型反應時間, 會影響使用者體驗
深度學習領域 PyTorch 使用者較多 -> 加強PyTorch 並練習建立模型
多目標任務 -> YOLO 表現良好
PyTorch 有其他類型的深度學習模型可額外加強

Project name : Vtuber Identify AI

Purpose :

Allow user interacts with Vtuber's life-sized cardboardcut-out or Vtuber's product.

Tech :

1. front-end : LineBOT, 網頁, APP(Android only)
2. back-end : local server
3. AI model : CNN ResNet 101 v2
              YOLO v8

AI model training record

First time

collecting data :
- tech : OpenCV (collecting images from Vtuber's stream on youtube)
- Vtuber : 2
model :
- VGG16
result (problems) :
- AI model focus on background too much

Second time

how to optimize :
- OpenCV : when collecting images, focus on Vtuber's facial features (less background)
- SAM : Image matting (Meta issue this API in 2023.4 https://segment-anything.com/)
collecting data :
- tech : OpenCV, SAM
- Vtuber : 6
model :
- VGG16
result (problems) :
- AI model can identify training data ; can not identify Vtuber's life-sized cardboardcut-out or Vtuber's product.
  (a. huge different between training data and real target)
- Vtuber increase from 2 to 6, AI need to capture more features (a. increase more layer)
  (b. use another model)

Third time

how to optimize :
- training data : put some Vtuber's life-sized cardboardcut-out or Vtuber's product into training data
- model : ResNet
collecting data :
- tech : OpenCV, SAM
- Vtuber : 6
model :
- ResNet 50 v2
- ResNet 101 v2

CNN result

ResNet 101 v2 has better result and accuracy.
But, if we want to achieve : multiple targets identify or real time identify, we need other model.

AI model - YOLO v8

YOLO v8 : https://github.com/ultralytics/ultralytics
Since YOLO v8 was based on PyTorch, self learn some PyTorch for better understanding YOLO v8.

collecting data : Vtuber's life-sized cardboardcut-out or Vtuber's product into training data
labeling data :
- API : https://hub.ultralytics.com/
YOLO v8 result :
- https://www.youtube.com/playlist?list=PLjW4Ibuk2-7BTD_uz0qU2fADAdJYOdUiN

YOLO result

v8 has better result than v7, but project time limit was close, we did not deploy YOLO.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
Code		Code
.gitignore		.gitignore
README.md		README.md
Target_Vtuber.txt		Target_Vtuber.txt
YOLO_v8_version1_from重誼.ipynb		YOLO_v8_version1_from重誼.ipynb
第一次.png		第一次.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

專案名稱 : 宅經濟 - 虛擬偶像辨識

專案目的 :

技術概要 :

AI model 訓練過程

第一次模型建立

第二次模型建立

第三次模型建立

CNN 成果

AI model - YOLO v8

YOLO 成果

後記

Project name : Vtuber Identify AI

Purpose :

Tech :

AI model training record

First time

Second time

Third time

CNN result

AI model - YOLO v8

YOLO result

About

Releases

Packages

Languages

h0806449f/Project_Tibame

Folders and files

Latest commit

History

Repository files navigation

專案名稱 : 宅經濟 - 虛擬偶像辨識

專案目的 :

技術概要 :

AI model 訓練過程

第一次模型建立

第二次模型建立

第三次模型建立

CNN 成果

AI model - YOLO v8

YOLO 成果

後記

Project name : Vtuber Identify AI

Purpose :

Tech :

AI model training record

First time

Second time

Third time

CNN result

AI model - YOLO v8

YOLO result

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages