This repository contains three algorithms that solve the following problems respectively:
- Recognize faces from videos clips.
- Recongizes voices from audios.
- Separate speeches from a videos of three speakers speaking together based on given visual and audio information of the speakers.
This project is implemented in Python with the following packages: Face Recognition, Resemblyzer, Speechbrain.
Clone this project:
git clone https://github.com/Lukeli0425/VASP.git
Install the required packages:
pip install -r requirements.txt