From bdb3c1e2db088dade047ba63b18320b3628f8dda Mon Sep 17 00:00:00 2001
From: poroblem <cheonyujun10@gmail.com>
Date: Sun, 11 Sep 2022 23:30:17 +0900
Subject: [PATCH 1/3] =?UTF-8?q?waveglow=EB=B2=88=EC=97=AD?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

---
 nvidia_deeplearningexamples_waveglow.md | 45 +++++++++++++------------
 1 file changed, 23 insertions(+), 22 deletions(-)

diff --git a/nvidia_deeplearningexamples_waveglow.md b/nvidia_deeplearningexamples_waveglow.md
index 65d3bf0..08b5549 100644
--- a/nvidia_deeplearningexamples_waveglow.md
+++ b/nvidia_deeplearningexamples_waveglow.md
@@ -3,7 +3,7 @@ layout: hub_detail
 background-class: hub-background
 body-class: hub
 title: WaveGlow
-summary: WaveGlow model for generating speech from mel spectrograms (generated by Tacotron2)
+summary: 멜 스펙트로그램스(mel spectrograms)에서 발생시키기 위한 웨이브글로우(WaveGlow) 모델입니다 (타코트론2(Tacotron2) 모델에서 발생했다)
 category: researchers
 image: nvidia_logo.png
 author: NVIDIA
@@ -18,58 +18,59 @@ demo-model-link: https://huggingface.co/spaces/pytorch/WaveGlow
 ---
 
 
-### Model Description
+### 모델 설명
 
-The Tacotron 2 and WaveGlow model form a text-to-speech system that enables user to synthesise a natural sounding speech from raw transcripts without any additional prosody information. The Tacotron 2 model (also available via torch.hub) produces mel spectrograms from input text using encoder-decoder architecture. WaveGlow is a flow-based model that consumes the mel spectrograms to generate speech.
 
-### Example
+타코트론2 및 웨이브글로우 모델은 사용자가 추가 운율 정보 없이 원본 텍스트에서 자연스러운 음성을 합성할 수 있는 텍스트 음성 변환 시스템을 형성합니다. 트코트론 2 모델(torch.com를 통해서도 사용 가능)은 인코더-디코더 아키텍쳐를 사용하여 입력 텍스트로부터 멜 스텍트로그램스를 생성합니다. 웨이브글로우는 음성을 생성하기 위해 멜 스펙토그램스를 소비하는 흐름 기반 모델입니다.
 
-In the example below:
-- pretrained Tacotron2 and Waveglow models are loaded from torch.hub
-- Tacotron2 generates mel spectrogram given tensor represantation of an input text ("Hello world, I missed you so much")
-- Waveglow generates sound given the mel spectrogram
-- the output sound is saved in an 'audio.wav' file
+### 예제
 
-To run the example you need some extra python packages installed.
-These are needed for preprocessing the text and audio, as well as for display and input / output.
+아래의 예시에서 :
+- 사전 학습을 받은 타코트론2 및 웨이브글로우 모델들은 torch.hub에서 로드됩니다.
+- 타코트론2는 입력 텍스트의 텐서 표현("Hello world, I missed you so much")을 주어진 멜 스펙트로그램을 생성합니다.
+- 웨이브글로우는 멜 스펙트로그램이 준 소리를 발생시킵니다
+- 출력된 소리는 'audio.wav' 파일에 저장됩니다
+
+예제를 실행하려면 추가 파이썬 패키지가 설치되어 있어야 합니다.
+텍스트 및 오디오는 물론 디스플레이 및 입력/출력 전처리에 필요합니다.
 ```bash
 pip install numpy scipy librosa unidecode inflect librosa
 apt-get update
 apt-get install -y libsndfile1
 ```
 
-Load the WaveGlow model pre-trained on [LJ Speech dataset](https://keithito.com/LJ-Speech-Dataset/)
+[LJ Speech datase]에 대해 사전 학습을 받은 웨이브글로우 모델을 로드합니다(https://keithito.com/LJ-Speech-Dataset/)
 ```python
 import torch
 waveglow = torch.hub.load('NVIDIA/DeepLearningExamples:torchhub', 'nvidia_waveglow', model_math='fp32')
 ```
 
-Prepare the WaveGlow model for inference
+추론을 위해 웨이브글로우 모델을 준비합니다
 ```python
 waveglow = waveglow.remove_weightnorm(waveglow)
 waveglow = waveglow.to('cuda')
 waveglow.eval()
 ```
 
-Load a pretrained Tacotron2 model
+사전 학습을 받은 타코트론2 모델을 로드합니다
 ```python
 tacotron2 = torch.hub.load('NVIDIA/DeepLearningExamples:torchhub', 'nvidia_tacotron2', model_math='fp32')
 tacotron2 = tacotron2.to('cuda')
 tacotron2.eval()
 ```
 
-Now, let's make the model say:
+이제 모델에게 이렇게 말해봅니다:
 ```python
 text = "hello world, I missed you so much"
 ```
 
-Format the input using utility methods
+유용한 체계성을 사용하여 입력 형식을 지정합니다
 ```python
 utils = torch.hub.load('NVIDIA/DeepLearningExamples:torchhub', 'nvidia_tts_utils')
 sequences, lengths = utils.prepare_input_sequence([text])
 ```
 
-Run the chained models
+체인 모델을 실행합니다
 ```python
 with torch.no_grad():
     mel, _, _ = tacotron2.infer(sequences, lengths)
@@ -78,22 +79,22 @@ audio_numpy = audio[0].data.cpu().numpy()
 rate = 22050
 ```
 
-You can write it to a file and listen to it
+당신은 그것을 파일에 쓰고 들을 수 있습니다
 ```python
 from scipy.io.wavfile import write
 write("audio.wav", rate, audio_numpy)
 ```
 
-Alternatively, play it right away in a notebook with IPython widgets
+또는 아이파이썬(IPython) 위젯을 사용하여 노트북에서 바로 재생할 수 있습니다
 ```python
 from IPython.display import Audio
 Audio(audio_numpy, rate=rate)
 ```
 
-### Details
-For detailed information on model input and output, training recipies, inference and performance visit: [github](https://github.com/NVIDIA/DeepLearningExamples/tree/master/PyTorch/SpeechSynthesis/Tacotron2) and/or [NGC](https://ngc.nvidia.com/catalog/resources/nvidia:tacotron_2_and_waveglow_for_pytorch)
+### 세부사항
+모델 입력 및 출력, 교육 방안, 추론 및 성과에 대한 자세한 내용은 방문하십시오: [github](https://github.com/NVIDIA/DeepLearningExamples/tree/master/PyTorch/SpeechSynthesis/Tacotron2) 그리고/또는 [NGC](https://ngc.nvidia.com/catalog/resources/nvidia:tacotron_2_and_waveglow_for_pytorch)
 
-### References
+### 출처
 
  - [Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions](https://arxiv.org/abs/1712.05884)
  - [WaveGlow: A Flow-based Generative Network for Speech Synthesis](https://arxiv.org/abs/1811.00002)

From 9e00f712dce5f1def06ecaa5f0ab89d1c9e62fa9 Mon Sep 17 00:00:00 2001
From: poroblem <cheonyujun10@gmail.com>
Date: Wed, 14 Sep 2022 02:21:03 +0900
Subject: [PATCH 2/3] =?UTF-8?q?waveglow=EC=88=98=EC=A0=95=EB=B3=B8?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

---
 nvidia_deeplearningexamples_waveglow.md | 30 ++++++++++++-------------
 1 file changed, 15 insertions(+), 15 deletions(-)

diff --git a/nvidia_deeplearningexamples_waveglow.md b/nvidia_deeplearningexamples_waveglow.md
index 08b5549..01a5bf8 100644
--- a/nvidia_deeplearningexamples_waveglow.md
+++ b/nvidia_deeplearningexamples_waveglow.md
@@ -3,7 +3,7 @@ layout: hub_detail
 background-class: hub-background
 body-class: hub
 title: WaveGlow
-summary: 멜 스펙트로그램스(mel spectrograms)에서 발생시키기 위한 웨이브글로우(WaveGlow) 모델입니다 (타코트론2(Tacotron2) 모델에서 발생했다)
+summary: mel spectrograms에서 발생시키기 위한 WaveGlow 모델입니다 (Tacotron2 모델에서 발생했다)
 category: researchers
 image: nvidia_logo.png
 author: NVIDIA
@@ -21,15 +21,15 @@ demo-model-link: https://huggingface.co/spaces/pytorch/WaveGlow
 ### 모델 설명
 
 
-타코트론2 및 웨이브글로우 모델은 사용자가 추가 운율 정보 없이 원본 텍스트에서 자연스러운 음성을 합성할 수 있는 텍스트 음성 변환 시스템을 형성합니다. 트코트론 2 모델(torch.com를 통해서도 사용 가능)은 인코더-디코더 아키텍쳐를 사용하여 입력 텍스트로부터 멜 스텍트로그램스를 생성합니다. 웨이브글로우는 음성을 생성하기 위해 멜 스펙토그램스를 소비하는 흐름 기반 모델입니다.
+Tacotron2 및 WaveGlow 모델은 사용자가 추가 운율 정보 없이 원본 텍스트에서 자연스러운 음성을 합성할 수 있는 텍스트 음성 변환 시스템을 형성합니다. Tacotron2 모델(torch.com를 통해서도 사용 가능)은 인코더-디코더 아키텍쳐를 사용하여 입력 텍스트로부터 mel spectrograms를 생성합니다. WaveGlow는 음성을 생성하기 위해 mel spectrograms를 소비하는 흐름 기반 모델입니다.
 
 ### 예제
 
 아래의 예시에서 :
-- 사전 학습을 받은 타코트론2 및 웨이브글로우 모델들은 torch.hub에서 로드됩니다.
-- 타코트론2는 입력 텍스트의 텐서 표현("Hello world, I missed you so much")을 주어진 멜 스펙트로그램을 생성합니다.
-- 웨이브글로우는 멜 스펙트로그램이 준 소리를 발생시킵니다
-- 출력된 소리는 'audio.wav' 파일에 저장됩니다
+- 사전 학습을 받은 Tacotron2 및 WaveGlow 모델들은 torch.hub에서 로드됩니다.
+- Tacotron2는 입력 텍스트의 텐서 표현("Hello world, I missed you so much")을 주어진 mel spectrogram을 생성합니다.
+- WaveGlow는 mel spectrogram이 준 소리를 발생시킵니다.
+- 출력된 소리는 'audio.wav' 파일에 저장됩니다.
 
 예제를 실행하려면 추가 파이썬 패키지가 설치되어 있어야 합니다.
 텍스트 및 오디오는 물론 디스플레이 및 입력/출력 전처리에 필요합니다.
@@ -39,20 +39,20 @@ apt-get update
 apt-get install -y libsndfile1
 ```
 
-[LJ Speech datase]에 대해 사전 학습을 받은 웨이브글로우 모델을 로드합니다(https://keithito.com/LJ-Speech-Dataset/)
+[LJ Speech datase](https://keithito.com/LJ-Speech-Dataset/)에 대해 사전 학습을 받은 WaveGlow 모델을 로드합니다.
 ```python
 import torch
 waveglow = torch.hub.load('NVIDIA/DeepLearningExamples:torchhub', 'nvidia_waveglow', model_math='fp32')
 ```
 
-추론을 위해 웨이브글로우 모델을 준비합니다
+추론을 위해 WaveGlow 모델을 준비합니다.
 ```python
 waveglow = waveglow.remove_weightnorm(waveglow)
 waveglow = waveglow.to('cuda')
 waveglow.eval()
 ```
 
-사전 학습을 받은 타코트론2 모델을 로드합니다
+사전 학습을 받은 Tacotron2 모델을 로드합니다.
 ```python
 tacotron2 = torch.hub.load('NVIDIA/DeepLearningExamples:torchhub', 'nvidia_tacotron2', model_math='fp32')
 tacotron2 = tacotron2.to('cuda')
@@ -64,13 +64,13 @@ tacotron2.eval()
 text = "hello world, I missed you so much"
 ```
 
-유용한 체계성을 사용하여 입력 형식을 지정합니다
+유용한 체계성을 사용하여 입력 형식을 지정합니다.
 ```python
 utils = torch.hub.load('NVIDIA/DeepLearningExamples:torchhub', 'nvidia_tts_utils')
 sequences, lengths = utils.prepare_input_sequence([text])
 ```
 
-체인 모델을 실행합니다
+체인 모델을 실행합니다.
 ```python
 with torch.no_grad():
     mel, _, _ = tacotron2.infer(sequences, lengths)
@@ -79,22 +79,22 @@ audio_numpy = audio[0].data.cpu().numpy()
 rate = 22050
 ```
 
-당신은 그것을 파일에 쓰고 들을 수 있습니다
+그것을 파일에 쓰고 들을 수 있습니다.
 ```python
 from scipy.io.wavfile import write
 write("audio.wav", rate, audio_numpy)
 ```
 
-또는 아이파이썬(IPython) 위젯을 사용하여 노트북에서 바로 재생할 수 있습니다
+또는 IPython 위젯을 사용하여 노트북에서 바로 재생할 수 있습니다.
 ```python
 from IPython.display import Audio
 Audio(audio_numpy, rate=rate)
 ```
 
 ### 세부사항
-모델 입력 및 출력, 교육 방안, 추론 및 성과에 대한 자세한 내용은 방문하십시오: [github](https://github.com/NVIDIA/DeepLearningExamples/tree/master/PyTorch/SpeechSynthesis/Tacotron2) 그리고/또는 [NGC](https://ngc.nvidia.com/catalog/resources/nvidia:tacotron_2_and_waveglow_for_pytorch)
+모델 입력 및 출력, 교육 방안, 추론 및 성과에 대한 자세한 내용은 방문하십시오 : [github](https://github.com/NVIDIA/DeepLearningExamples/tree/master/PyTorch/SpeechSynthesis/Tacotron2) 그리고/또는 [NGC](https://ngc.nvidia.com/catalog/resources/nvidia:tacotron_2_and_waveglow_for_pytorch)
 
-### 출처
+### 참고 문헌
 
  - [Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions](https://arxiv.org/abs/1712.05884)
  - [WaveGlow: A Flow-based Generative Network for Speech Synthesis](https://arxiv.org/abs/1811.00002)

From 1a7af1c9c4414a8f66bc8719f4bbb4cd3b742fbb Mon Sep 17 00:00:00 2001
From: poroblem <cheonyujun10@gmail.com>
Date: Tue, 20 Sep 2022 23:12:05 +0900
Subject: [PATCH 3/3] =?UTF-8?q?waveglow=20=EC=88=98=EC=A0=95=20=EC=B5=9C?=
 =?UTF-8?q?=EC=A2=85=EB=B3=B8?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

---
 nvidia_deeplearningexamples_waveglow.md | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/nvidia_deeplearningexamples_waveglow.md b/nvidia_deeplearningexamples_waveglow.md
index 01a5bf8..e4303ca 100644
--- a/nvidia_deeplearningexamples_waveglow.md
+++ b/nvidia_deeplearningexamples_waveglow.md
@@ -3,7 +3,7 @@ layout: hub_detail
 background-class: hub-background
 body-class: hub
 title: WaveGlow
-summary: mel spectrograms에서 발생시키기 위한 WaveGlow 모델입니다 (Tacotron2 모델에서 발생했다)
+summary: Tacotron2이 만들어낸 mel spectrograms을 소비하여 음성을 생성하기 위한 WaveGlow 모델입니다.
 category: researchers
 image: nvidia_logo.png
 author: NVIDIA
@@ -32,7 +32,7 @@ Tacotron2 및 WaveGlow 모델은 사용자가 추가 운율 정보 없이 원본
 - 출력된 소리는 'audio.wav' 파일에 저장됩니다.
 
 예제를 실행하려면 추가 파이썬 패키지가 설치되어 있어야 합니다.
-텍스트 및 오디오는 물론 디스플레이 및 입력/출력 전처리에 필요합니다.
+텍스트 및 오디오 뿐만 아니라 디스플레이 및 입력/출력 전처리에 필요합니다.
 ```bash
 pip install numpy scipy librosa unidecode inflect librosa
 apt-get update
@@ -59,7 +59,7 @@ tacotron2 = tacotron2.to('cuda')
 tacotron2.eval()
 ```
 
-이제 모델에게 이렇게 말해봅니다:
+이제 모델이 이렇게 말하게 해봅시다 :
 ```python
 text = "hello world, I missed you so much"
 ```