futureagi commited on
Commit
dceebd1
1 Parent(s): 95b0517

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +106 -4
README.md CHANGED
@@ -7,10 +7,112 @@ language:
7
  - tr
8
  ---
9
 
10
- Voicecover AI has introduced an AI-powered video translation tool that features voice cloning and lip sync capabilities. This platform enables creators to translate videos into 30 languages, including Arabic, Mandarin, and Spanish, while matching the speaker's lip movements to the translated audio.
11
 
12
- Building on its previous success with AI animation, Voicecover raised $25 million in Series B funding in 2022, expanding its services for enterprise customers. The AI Video Translate technology is currently free for subscribers, with plans starting at $2 per minute.
13
 
14
- This tool aims to reduce localization costs for businesses in marketing, entertainment, and social media. Voicecover's proprietary model, Localizer 3000, powers this feature, which is accessible through Voicecover Studio and its API.
15
 
16
- A model card repository for the video-to-video translation model and API specifications for web service usage will be available soon. Videos can range from 10 seconds to 55 minutes, with a maximum file size of 2GB, and work best with a single subject facing the camera.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
7
  - tr
8
  ---
9
 
10
+ # Voicecover AI Launches Video Translation Tool
11
 
12
+ Voicecover AI has introduced an AI-powered video translation tool that features voice cloning and lip sync capabilities. This platform enables creators to translate videos into 14 languages, including Arabic, Mandarin, and Spanish, while matching the speaker's lip movements to the translated audio.
13
 
14
+ This tool aims to reduce localization costs for businesses in marketing, entertainment, and social media. Voicecover's proprietary model, Localizer v3, powers this feature, which is accessible through Voicecover Studio and its API.
15
 
16
+ A model card repository for the video-to-video translation model and API specifications for web service usage will be available soon. Videos can range from 10 seconds to 55 minutes, with a maximum file size of 2GB, and work best with a single subject facing the camera.
17
+
18
+ ### API Specification for Video Translation Service
19
+
20
+ #### Overview
21
+ The Voicecover API allows developers to integrate video translation capabilities into their applications, enabling seamless localization and voice cloning.
22
+
23
+ #### Base URL
24
+ ```
25
+ https://api.voicecover.ai/v1
26
+ ```
27
+
28
+ #### Authentication
29
+ - **Method:** Bearer Token
30
+ - **Header:** `Authorization: Bearer <your_access_token>`
31
+
32
+ #### Endpoints
33
+
34
+ 1. **Translate Video**
35
+ - **Endpoint:** `/translate`
36
+ - **Method:** POST
37
+ - **Description:** Submits a video for translation.
38
+ - **Request Body:**
39
+ ```json
40
+ {
41
+ "video_url": "string",
42
+ "target_languages": ["string"],
43
+ "voice_cloning": true,
44
+ "lip_sync": true
45
+ }
46
+ ```
47
+ - **Response:**
48
+ ```json
49
+ {
50
+ "job_id": "string",
51
+ "status": "pending"
52
+ }
53
+ ```
54
+
55
+ 2. **Check Translation Status**
56
+ - **Endpoint:** `/status/{job_id}`
57
+ - **Method:** GET
58
+ - **Description:** Retrieves the status of a translation job.
59
+ - **Response:**
60
+ ```json
61
+ {
62
+ "job_id": "string",
63
+ "status": "completed",
64
+ "download_url": "string"
65
+ }
66
+ ```
67
+
68
+ 3. **Get Supported Languages**
69
+ - **Endpoint:** `/languages`
70
+ - **Method:** GET
71
+ - **Description:** Lists all supported translation languages.
72
+ - **Response:**
73
+ ```json
74
+ {
75
+ "languages": ["string"]
76
+ }
77
+ ```
78
+
79
+ #### Rate Limits
80
+ - **Requests:** 100 requests per minute
81
+ - **Video Length:** Up to 55 minutes
82
+ - **File Size:** Max 2GB
83
+
84
+ ---
85
+
86
+ ### Model Specification for Video Translation
87
+
88
+ #### Model Name
89
+ **Localizer v3**
90
+
91
+ #### Model Type
92
+ - **Architecture:** Deep Learning (Convolutional Neural Networks for video, Recurrent Neural Networks for audio)
93
+
94
+ #### Input Specifications
95
+ - **Video Format:** MP4, AVI, MOV
96
+ - **Audio Format:** AAC, MP3
97
+ - **Resolution:** Up to 1080p
98
+
99
+ #### Output Specifications
100
+ - **Translated Video:** MP4 format with synchronized audio and lip movements
101
+ - **Supported Languages:** 14 languages including Russian, Arabic, Mandarin, Japanese, Hindi, Spanish, and French
102
+
103
+ #### Performance Metrics
104
+ - **Translation Accuracy:** 95% (based on internal benchmarks)
105
+ - **Lip Sync Accuracy:** 90% (measured against human dubbing)
106
+
107
+ #### Use Cases
108
+ - Marketing videos
109
+ - Educational content
110
+ - Entertainment localization
111
+
112
+ #### Limitations
113
+ - Supports only one subject in the frame
114
+ - Best results with face fully visible and facing the camera
115
+
116
+ ---
117
+
118
+ This specification provides a comprehensive overview of the API and model functionalities for integrating and utilizing the Voicecover video translation service.