You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello,
I have a bit of issues when trying to train fastspeech2. I've checked task manager and it's only using ram. I've read that maybe precomputing only needs ram and doesn't want gpu but that seems odd to me. The gpu is available and updated to use CUDA but it doesn't seem to be taking advantage of it.
Im using the code.
I'm running it on a machine with a GPU 4050, 40Gb of ram. Thank you.
To Reproduce
import os
import torch
from trainer import Trainer, TrainerArgs
from TTS.config.shared_configs import BaseAudioConfig, BaseDatasetConfig
from TTS.tts.configs.fastspeech2_config import Fastspeech2Config
from TTS.tts.datasets import load_tts_samples
from TTS.tts.models.forward_tts import ForwardTTS
from TTS.tts.utils.text.tokenizer import TTSTokenizer
from TTS.utils.audio import AudioProcessor
from TTS.utils.manage import ModelManager
import subprocess
There must be an issue because when I set the parallel processing line, for example, precompute_num_workers=4, a "killed" error appears, and the precomputation does not start.
If I set precompute_num_workers=1, it takes an extremely long time. I'm not sure what the problem is, but I have a dataset of 600 WAV files (each less than 10 seconds long) in LJSpeech format, and I don't think it's normal for it to take this long.
Describe the bug
Hello,
I have a bit of issues when trying to train fastspeech2. I've checked task manager and it's only using ram. I've read that maybe precomputing only needs ram and doesn't want gpu but that seems odd to me. The gpu is available and updated to use CUDA but it doesn't seem to be taking advantage of it.
Im using the code.
I'm running it on a machine with a GPU 4050, 40Gb of ram. Thank you.
To Reproduce
import os
import torch
from trainer import Trainer, TrainerArgs
from TTS.config.shared_configs import BaseAudioConfig, BaseDatasetConfig
from TTS.tts.configs.fastspeech2_config import Fastspeech2Config
from TTS.tts.datasets import load_tts_samples
from TTS.tts.models.forward_tts import ForwardTTS
from TTS.tts.utils.text.tokenizer import TTSTokenizer
from TTS.utils.audio import AudioProcessor
from TTS.utils.manage import ModelManager
import subprocess
def main():
output_path = os.path.dirname(os.path.abspath(file))
if name == "main":
main()
Expected behavior
No response
Logs
Environment
Additional context
No response
The text was updated successfully, but these errors were encountered: