Neural speech editing enables seamless partial edits to speech utterances, allowing modifications to selected content while preserving the rest of the audio unchanged. This useful technique, however, also poses new risks of deepfakes. To encourage research on detecting such partially edited deepfake speech, we introduce PartialEdit, a deepfake speech dataset curated using advanced neural editing techniques. We explore both detection and localization tasks on PartialEdit. Our experiments reveal that models trained on the existing PartialSpoof dataset fail to detect partially edited speech generated by neural speech editing models. As recent speech editing models almost all involve neural audio codecs, we also provide insights into the artifacts the model learned on detecting these deepfakes. Further information about the PartialEdit dataset and audio samples can be found on the project page: https://yzyouzhang.com/PartialEdit/index.html.
(Publisher abstract provided.)
Similar Publications
- Advancing genotype-phenotype analysis through 3D facial morphometry: insights from Cri-du-Chat syndrome
- Optimization of Total Vaporization Solid-Phase Microextraction (TV-SPME) for the Determination of lipid profiles of Phormia Regina, a Forensically Important Blow Fly Species
- Optimization and Validation of a Probe Capture/NGS Assay for Sequencing the Whole Mitochondrial Genome on Forensically Relevant Samples