WhisperX Server

WhisperX Server is a powerful backend application that provides advanced audio and video processing capabilities, including transcription, text-to-speech conversion, and voice conversion. It's designed to work in conjunction with the Banana Client project.

Features

Audio transcription using WhisperX
Text-to-speech synthesis with multiple voices and backends
Voice conversion using RVC (Retrieval-based Voice Conversion)
YouTube video downloading and processing
Subtitle generation
Storyboard creation from videos
API endpoints for integration with frontend applications

Installation

Clone the repository:

git clone https://github.com/your-username/whisperx-server.git
cd whisperx-server

Create and activate a virtual environment:

python -m venv my-venv
source my-venv/bin/activate  # On Windows, use: my-venv\Scripts\activate

Install PyTorch following the instructions at https://pytorch.org/
Install the required dependencies:
```
pip install -r requirements.txt
```
Note: Some dependencies may need to be installed manually or may require additional setup.

Set up environment variables: Create a .env file in the project root and add the following:

HF_TOKEN=your_huggingface_token
MODEL_DIRECTORY=path/to/model/directory
OUTPUT_DIRECTORY=path/to/output/directory
VOICES_DIRECTORY=path/to/voices/directory
API_TOKEN=your_api_token

Usage

Start the server:
```
python main.py
```
The server will be available at http://localhost:8127 (or the port specified in your config.ini file).
Use the provided API endpoints to interact with the server. For example:
- Transcribe a YouTube video: POST /api/transcribe/url
- Generate text-to-speech: POST /api/text2speech
- Process audio with voice conversion: POST /api/rvc

Configuration

Network settings, model parameters, and other options can be configured in the config.ini file.
Additional settings are available in the settings.py file.

TODO

Contributing

Fork the repository
Create your feature branch (git checkout -b feature/AmazingFeature)
Commit your changes (git commit -m 'Add some AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request

Contact and Support

I'm still learning and improving my skills with this project. If you have any questions, suggestions, or if you'd like to contribute, please don't hesitate to reach out:

GitHub Issues: For bug reports, feature requests, or general questions, please open an issue on this repository.
Discussions: For broader conversations about the project, use the GitHub Discussions feature in this repository.

License

This project is licensed under the MIT License - see the LICENSE.md file for details.

Acknowledgements

WhisperX for improved transcription capabilities
RVC (Retrieval-based Voice Conversion) for voice conversion
YT-DLP for YouTube video downloading
FastAPI for the API framework
PyTorch for deep learning capabilities

Note

This project is designed to work in conjunction with the Banana Client project. Make sure to set up and configure both projects for full functionality.

Name		Name	Last commit message	Last commit date
Latest commit History 162 Commits
rvc		rvc
.env.example		.env.example
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
audio_processor.py		audio_processor.py
client.py		client.py
config.ini		config.ini
deepspeed-0.13.1+cu121-cp310-cp310-win_amd64.whl		deepspeed-0.13.1+cu121-cp310-cp310-win_amd64.whl
deepspeed-0.14.0+ce78a63-cp311-cp311-win_amd64.whl		deepspeed-0.14.0+ce78a63-cp311-cp311-win_amd64.whl
fairseq-0.12.3-cp311-cp311-win_amd64.whl		fairseq-0.12.3-cp311-cp311-win_amd64.whl
main.py		main.py
requirements.txt		requirements.txt
rvc_processing.py		rvc_processing.py
schema.py		schema.py
settings.py		settings.py
test.py		test.py
tts_functions.py		tts_functions.py
util.py		util.py
video_download.py		video_download.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WhisperX Server

Features

Installation

Usage

Configuration

TODO

Contributing

Contact and Support

License

Acknowledgements

Note

About

Releases

Packages

Contributors 2

Languages

License

lobsterchan27/WhisperX-Server

Folders and files

Latest commit

History

Repository files navigation

WhisperX Server

Features

Installation

Usage

Configuration

TODO

Contributing

Contact and Support

License

Acknowledgements

Note

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages