iVocaloid论坛
标题: 【数据结构】CVS3 与 RVS3数据结构设计 [打印本页]
作者: Zleepwalking 时间: 2013/9/1 11:19 标题: 【数据结构】CVS3 与 RVS3数据结构设计
本帖最后由 Zleepwalking 于 2013/9/1 16:57 编辑
反倒比CVS2结构简单了。
其实东西都挂在GH上了,我只是在这里做个转播,另附:
As I wrote in README.md before, the next version will be totally rewritten again, like the evolution from Rocaloid1 to Rocaloid1.6.
Currently the version of RSC, CVS, and CDT format has already reached 2.x, which means they are in different version with the synthesizer.
Also considering the significant change in synthesis algorithm(TDPSM -> FECSOLA), I've decided to name the next generation as "Rocaloid Engine 3" instead of 2, along with CVE 3.
Here I have to restate the definition and relations of "Rocaloid Engine":
- Rocaloid Engine includes Cybervoice Engine(CVE) and CVS Generator and provides I/O of RSC(Rocaloid SCript) and CVS(CyberVoice Script).
- CVE is the synthesis engine in Rocaloid Project.
- CVS is the file format for storing phonetic information, can be directly synthesized by CVE. CVS contains much more detailed information(e.g. duration of each phoneme) than RSC.
- RSC is the file format for the note editor, can only be synthesized by transforming into CVS, with CVS Generator and CDT.
- CVS Generator is the sub program of RSCCommon(which includes CVS Gen, I/O of RSC and vsqx), which is used to transform RSC into CVS using CDT so that the RSC file can be synthesized.
- CDT is the dictionary used by CVS Generator. Contains phonetic definitions, which are data derived from lots of phonetic experiments.
RSC will not be included in Rocaloid Engine anymore, because RSC is strongly related to the note editor, and dealing with editor settings and musical notations is not the business of Rocaloid Engine.
RSC will be replaced by RVS(Rocaloid Vocal Script), which describes the general (but not in detail) information of notes and lyrics (but not phonemes). CVS Generator will be responsible for transforming RVS into CVS. The transformation from RSC (or .vsqx, .vsq, .ust, .nn, etc.) to RVS should be simple (does not require professional phonetics knowledge).
Altogether, the major components and formats in RE3(Rocaloid Engine 3) will be:
- CVE 3
- CGTOR 3 (Cvs GeneraTOR 3)
- CVS 3
- RVS 3
- CDT 3
Additionally, CVS 3 and RVS 3 will be stored in binary instead of text. This is because formant data will be included in CVS and RVS, which will greatly increase the file size, and slow down the IO performance. (approximately a CVS 3 text file which contains a song will be 10MB)
RVS3:
图片附件: Rocaloid3.png (2013/9/1 11:19, 19.32 KB) / 下载次数 28
http://bbs.ivocaloid.com/forum.php?mod=attachment&aid=NzMzNDB8YWJkNmEzMzF8MTc1MzQ1NDg5M3wwfDA%3D

图片附件: CVS3Structure.png (2013/9/1 11:18, 65.16 KB) / 下载次数 32
http://bbs.ivocaloid.com/forum.php?mod=attachment&aid=NzMzMzl8MjkxOWVjMTN8MTc1MzQ1NDg5M3wwfDA%3D

图片附件: RVS3Structure.png (2013/9/1 16:57, 79.04 KB) / 下载次数 30
http://bbs.ivocaloid.com/forum.php?mod=attachment&aid=NzMzNDh8YTNjZDc0NWZ8MTc1MzQ1NDg5M3wwfDA%3D

作者: Zleepwalking 时间: 2013/9/17 18:37
刚刚经斟酌决定先使用RDL的纯文本格式储存CVS/RVS,等结构稳定了再使用Binary。1. 便于调试 2. Binary扩展性差。
欢迎光临 iVocaloid论坛 (http://bbs.ivocaloid.com/) |
Powered by Discuz! X2 |