Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] Add Digital Human Example #1208

Open
lvliang-intel opened this issue Nov 28, 2024 · 0 comments
Open

[Feature] Add Digital Human Example #1208

lvliang-intel opened this issue Nov 28, 2024 · 0 comments
Assignees
Labels
feature New feature or request

Comments

@lvliang-intel
Copy link
Collaborator

Priority

P2-High

OS type

Ubuntu

Hardware type

Gaudi2

Running nodes

Single Node

Description

Feature: Add Digital Human Example

Description:
This feature involves creating a Digital Human example that showcases real-time interaction capabilities using LLM, TTS, ASR, and video generation microservices. The Digital Human will be designed to respond dynamically and naturally to user inputs, enabling use cases such as livestream e-commerce and spoken language practice.

The implementation will include versions optimized for Intel Xeon and Gaudi hardware, leveraging their unique capabilities to achieve high-performance inference and seamless integration of the microservices. This example will demonstrate the potential of combining AI technologies to create engaging and interactive user experiences.

Tasks:
Design the Digital Human Workflow

Define the interaction flow between LLM, TTS, ASR, and video generation microservices.
Ensure low-latency, real-time processing for a smooth user experience.

Develop the Microservice Integration

Set up and configure microservices for LLM, TTS, ASR, and video generation.
Design APIs and communication protocols to enable seamless data exchange between services.

Optimize for Xeon and Gaudi Hardware

Leverage Xeon CPUs for general-purpose workloads and Gaudi accelerators.
Optimize model inference and data processing pipelines for each hardware platform.

Create Livestream and Language Practice Use Cases

Build a demo showcasing livestream e-commerce capabilities, such as responding to customer inquiries in real-time.
Develop a spoken language practice scenario with the Digital Human providing interactive feedback.

Testing and Validation

Conduct functional testing to ensure accurate responses and smooth interactions.
Benchmark performance on both Xeon and Gaudi hardware to validate optimization efforts.

Documentation and Deployment

Document the setup process, architecture, and usage of the Digital Human example.
Deploy the example in a demo environment for showcase and further testing.

Expected Outcomes:

Interactive Digital Human Prototype

A fully functional Digital Human example capable of real-time interactions.

Optimized Performance

Hardware-specific optimizations for Xeon and Gaudi ensuring high efficiency.

Real-World Use Case Demonstrations

Compelling examples of livestream e-commerce and spoken language practice.

@lvliang-intel lvliang-intel added the feature New feature or request label Nov 28, 2024
@lvliang-intel lvliang-intel added this to the v1.2 milestone Nov 28, 2024
@joshuayao joshuayao moved this to In progress in OPEA Nov 28, 2024
@joshuayao joshuayao removed this from the v1.2 milestone Dec 5, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature New feature or request
Projects
Status: In progress
Development

No branches or pull requests

3 participants