
{"id":13916,"date":"2025-09-05T07:01:06","date_gmt":"2025-09-05T07:01:06","guid":{"rendered":"https:\/\/whiteriversmediasolutions.com\/Sony\/laspa-breaking-language-barriers-in-speaker-recognition-with-prefix-tuned-cross-attention-copy\/"},"modified":"2025-09-05T07:11:23","modified_gmt":"2025-09-05T07:11:23","slug":"summarizing-rewind-speech-time-reversal-for-enhancing-speaker-representations-in-diffusion-based-voice-conversion","status":"publish","type":"post","link":"https:\/\/whiteriversmediasolutions.com\/Sony\/summarizing-rewind-speech-time-reversal-for-enhancing-speaker-representations-in-diffusion-based-voice-conversion\/","title":{"rendered":"Summarizing \u2018REWIND: Speech Time Reversal for Enhancing Speaker Representations in Diffusion-based Voice Conversion\u2019"},"content":{"rendered":"\t\t<div data-elementor-type=\"wp-post\" data-elementor-id=\"13916\" class=\"elementor elementor-13916\" data-elementor-post-type=\"post\">\n\t\t\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-cd44eb5 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"cd44eb5\" data-element_type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-9f11b70\" data-id=\"9f11b70\" data-element_type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-215a70e elementor-widget elementor-widget-heading\" data-id=\"215a70e\" data-element_type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">BLOGS<\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-28dc161 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"28dc161\" data-element_type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-63cf269\" data-id=\"63cf269\" data-element_type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-6837436 elementor-widget elementor-widget-heading\" data-id=\"6837436\" data-element_type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">Summarizing \u2018REWIND: Speech Time Reversal for Enhancing Speaker Representations in Diffusion-based Voice Conversion\u2019<\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-9bd1630 elementor-widget elementor-widget-text-editor\" data-id=\"9bd1630\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tIshan D. Biyani, Nirmesh Shah, Ashishkumar Gudmalwar, Pankaj Wasnik, Rajiv Ratn Shah\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-7a034cb elementor-hidden-desktop elementor-hidden-tablet elementor-hidden-mobile elementor-widget elementor-widget-text-editor\" data-id=\"7a034cb\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>30<sup>th<\/sup> September 2024<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-454e546 elementor-widget elementor-widget-text-editor\" data-id=\"454e546\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Nirmesh Shah summarises paper titled, \u201c<a href=\"https:\/\/arxiv.org\/pdf\/2505.20756\">REWIND: Speech Time Reversal for Enhancing Speaker Representations in Diffusion-based Voice Conversion<\/a>\u201d, co-authored by Ishan Biyani, Nirmesh Shah, Ashishkumar Gudmalwar, Pankaj Wasnik and Rajiv Ratn Shah, accepted at <a href=\"https:\/\/www.interspeech2025.org\/home\">the 26<sup>th<\/sup> edition of the INTERSPEECH conference<\/a>.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-f0a3e28 elementor-widget elementor-widget-text-editor\" data-id=\"f0a3e28\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<h4><strong>Introduction<\/strong><\/h4>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-9202657 elementor-widget elementor-widget-text-editor\" data-id=\"9202657\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Speech time reversal refers to the process of reversing the entire speech signal in time, causing it to play backward. Such signals are completely unintelligible since the fundamental structures of phonemes and syllables are destroyed. However, they still retain tonal patterns that enable perceptual speaker identification despite losing linguistic content. In this paper, we propose leveraging speaker representations learned from time reversed speech as an augmentation strategy to enhance speaker representation. Notably, speaker and language disentanglement in voice conversion (VC) is essential to accurately preserve a speaker\u2019s unique vocal traits while minimizing interference from linguistic content.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-682ac84 elementor-widget elementor-widget-text-editor\" data-id=\"682ac84\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p><strong>To address the limitations of the conventional speaker representations in VC, we introduce following components:<\/strong><\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-f6c8504 elementor-widget elementor-widget-text-editor\" data-id=\"f6c8504\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<ul><li><strong>Speech Time Reversal (STR) as Data Augmentation<\/strong>: We propose the use of complete speech time reversal as a novel augmentation technique to enhance speaker representations. Unlike short-time reversal, STR produces unintelligible speech while preserving speaker-specific rhythmic and tonal patterns.<\/li><li><strong>Dual Speaker Embedding Fusion<\/strong>: Speaker embeddings are extracted both from original and time-reversed speech signals. These complementary representations are fused to capture a more robust and disentangled speaker identity.<\/li><li><strong>Diffusion-Based Voice Conversion with Enhanced Conditioning<\/strong>: The fused speaker embeddings are used to condition a diffusion-based VC model (as shown in Figure 1), enabling high-quality voice conversion with improved speaker similarity and generalization to unseen target voices.<\/li><\/ul>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-a7d1e72 elementor-widget elementor-widget-image\" data-id=\"a7d1e72\" data-element_type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img fetchpriority=\"high\" decoding=\"async\" width=\"707\" height=\"323\" src=\"https:\/\/whiteriversmediasolutions.com\/Sony\/uvaftoap\/2025\/09\/blog-nirmesh.png\" class=\"attachment-medium_large size-medium_large wp-image-13919\" alt=\"blog-nirmesh\" srcset=\"https:\/\/whiteriversmediasolutions.com\/Sony\/uvaftoap\/2025\/09\/blog-nirmesh.png 707w, https:\/\/whiteriversmediasolutions.com\/Sony\/uvaftoap\/2025\/09\/blog-nirmesh-300x137.png 300w\" sizes=\"(max-width: 707px) 100vw, 707px\" style=\"width:100%;height:45.69%;max-width:707px\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-c793aaa elementor-widget elementor-widget-text-editor\" data-id=\"c793aaa\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Fig. 1: Block-diagram of the propose approach in Diffusion-based Voice Conversion.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-4364049 elementor-widget elementor-widget-text-editor\" data-id=\"4364049\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p><strong>Results: <\/strong><\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-5f41fc9 elementor-widget elementor-widget-text-editor\" data-id=\"5f41fc9\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Demo samples are available on our <a href=\"https:\/\/research.sri-media-analysis.com\/interspeech25-rewind-vc\/\">Demo Page<\/a>.<\/p><p>We initially conducted perceptual studies to determine whether speaker identity is retained in time-reversed speech signals.\u00a0 Total 25 participants participated in it. We found that subjects could identify the correct speaker with 80.3% accuracy from the time reversed speech signals. Corresponding confusion matrices for the perceptual study is shown in Figure 2.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-7128f76 elementor-widget elementor-widget-image\" data-id=\"7128f76\" data-element_type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img decoding=\"async\" width=\"595\" height=\"258\" data-src=\"https:\/\/whiteriversmediasolutions.com\/Sony\/uvaftoap\/2025\/09\/blog-nirmesh-results.png\" class=\"attachment-medium_large size-medium_large wp-image-13920 lazyload\" alt=\"blog-nirmesh-results\" data-srcset=\"https:\/\/whiteriversmediasolutions.com\/Sony\/uvaftoap\/2025\/09\/blog-nirmesh-results.png 595w, https:\/\/whiteriversmediasolutions.com\/Sony\/uvaftoap\/2025\/09\/blog-nirmesh-results-300x130.png 300w\" data-sizes=\"(max-width: 595px) 100vw, 595px\" style=\"--smush-placeholder-width: 595px; --smush-placeholder-aspect-ratio: 595\/258;width:100%;height:43.36%;max-width:595px\" src=\"data:image\/gif;base64,R0lGODlhAQABAAAAACH5BAEKAAEALAAAAAABAAEAAAICTAEAOw==\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-f457432 elementor-widget elementor-widget-text-editor\" data-id=\"f457432\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Figure 2: Confusion matrices for the perceptual study of speaker identification from the time reversed speech. Here, M1, M2, M3 and F1, F2, F3 represents three different Male and Female speakers, respectively.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-0efe63b elementor-widget elementor-widget-text-editor\" data-id=\"0efe63b\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Additionally, to compare the proposed speech time reversal strategy with the short-time speech reversal approach, we analysed the spectrographic outputs from both methods as shown in Figure 3. Our analysis revealed that, in the case of complete speech reversal, the harmonic structures are prominently visible, which strongly indicates the retention of speaker-specific information. The clear presence of these harmonic patterns suggests that even though the reversed speech is rendered unintelligible, it preserves critical acoustic cues such as timbre and pitch that are unique to the speaker.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-762718d elementor-widget elementor-widget-image\" data-id=\"762718d\" data-element_type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img decoding=\"async\" width=\"750\" height=\"394\" data-src=\"https:\/\/whiteriversmediasolutions.com\/Sony\/uvaftoap\/2025\/09\/blog-nirmesh-results-1.png\" class=\"attachment-medium_large size-medium_large wp-image-13921 lazyload\" alt=\"blog-nirmesh-results\" data-srcset=\"https:\/\/whiteriversmediasolutions.com\/Sony\/uvaftoap\/2025\/09\/blog-nirmesh-results-1.png 768w, https:\/\/whiteriversmediasolutions.com\/Sony\/uvaftoap\/2025\/09\/blog-nirmesh-results-1-300x157.png 300w\" data-sizes=\"(max-width: 750px) 100vw, 750px\" style=\"--smush-placeholder-width: 750px; --smush-placeholder-aspect-ratio: 750\/394;width:100%;height:52.47%;max-width:768px\" src=\"data:image\/gif;base64,R0lGODlhAQABAAAAACH5BAEKAAEALAAAAAABAAEAAAICTAEAOw==\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-36de7e0 elementor-widget elementor-widget-text-editor\" data-id=\"36de7e0\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Fig. 3: Spectrographic visualization of (a) original speech (b) 20 ms short-time, (c) 100 ms short-time speech reversal, and (d) complete speech time reversal.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-79dee33 elementor-widget elementor-widget-text-editor\" data-id=\"79dee33\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Finally, we perform several subjective and objective evaluations to measure effectiveness of the proposed data augmentation strategy in the context of VC. Table 1 provides a summary of the results, encompassing both subjective and objective assessments.\u00a0<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-57bbe2b elementor-widget elementor-widget-image\" data-id=\"57bbe2b\" data-element_type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img decoding=\"async\" width=\"550\" height=\"317\" data-src=\"https:\/\/whiteriversmediasolutions.com\/Sony\/uvaftoap\/2025\/09\/blog-nirmesh-results-2.png\" class=\"attachment-medium_large size-medium_large wp-image-13922 lazyload\" alt=\"\" data-srcset=\"https:\/\/whiteriversmediasolutions.com\/Sony\/uvaftoap\/2025\/09\/blog-nirmesh-results-2.png 550w, https:\/\/whiteriversmediasolutions.com\/Sony\/uvaftoap\/2025\/09\/blog-nirmesh-results-2-300x173.png 300w\" data-sizes=\"(max-width: 550px) 100vw, 550px\" style=\"--smush-placeholder-width: 550px; --smush-placeholder-aspect-ratio: 550\/317;width:100%;height:57.64%;max-width:550px\" src=\"data:image\/gif;base64,R0lGODlhAQABAAAAACH5BAEKAAEALAAAAAABAAEAAAICTAEAOw==\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-cd9e70e elementor-widget elementor-widget-text-editor\" data-id=\"cd9e70e\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<h4><strong>Conclusion<\/strong><\/h4>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-487414c elementor-widget elementor-widget-text-editor\" data-id=\"487414c\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>We leverage full-utterance speech time reversal (STR) as a targeted data-augmentation strategy within a diffusion-based voice-conversion pipeline to strengthen speaker embeddings. Whereas earlier efforts have applied reversal only to short segments or portions of the signal, our method inverts the entire utterance. This destroys intelligible linguistic content yet preserves global rhythmic and tonal contours, which, as our perceptual studies show, are alone sufficient to maintain speaker-specific information. This finding supports the fusion of conventional speaker embeddings with those derived from time-reversed speech, providing a robust means to disentangle speaker identity from linguistic content. Our experimental evaluations on the LibriSpeech and VCTK databases, using a diffusion-based voice conversion framework, reveal that the proposed approach significantly improves speaker similarity scores while maintaining high speech quality. These results underscore the potential of STR to overcome data limitations in zero-shot VC scenarios and pave the way for future research to further refine and integrate such unconventional signal transformations into voice conversion systems.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-ef1ef1e elementor-widget elementor-widget-text-editor\" data-id=\"ef1ef1e\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<h4><strong>Citation<\/strong><\/h4>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-817d816 elementor-widget elementor-widget-text-editor\" data-id=\"817d816\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>@inproceedings{rewind-vc-2025,<\/p><p>\u00a0\u00a0\u00a0 title={REWIND: Speech Time Reversal for Enhancing Speaker Representations in Diffusion-based Voice Conversion },<\/p><p>\u00a0\u00a0 author={Biyani, Ishan and Shah, Nirmesh and Gudmalwar, Ashishkumar and Wasnik, Pankaj and Shah, Rajiv Ratn},<\/p><p>\u00a0\u00a0 booktitle={INTERSPEECH},<\/p><p>\u00a0\u00a0 year={2025}<\/p><p>\u00a0 }<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-61b5c1a elementor-widget elementor-widget-text-editor\" data-id=\"61b5c1a\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>To know more about Sony Research India\u2019s Research Publications, visit the \u2018Publications\u2019 section on our \u2018Open Innovation\u2019s page:\u00a0<a href=\"https:\/\/www.sonyresearchindia.com\/open-innovation\/\">Open Innovation with Sony R&amp;D \u2013 Sony Research India<\/a><\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-0362925 elementor-hidden-desktop elementor-hidden-tablet elementor-hidden-mobile elementor-widget elementor-widget-text-editor\" data-id=\"0362925\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>In most of the cases, it has been found that Content Driven sessions outperform the time driven sessions. The results are obtained on 6 baselines: STAMP, NARM, GRU4Rec, CD-HRNN, Tr4Rec on datasets like Movielens (Movies), GoodRead Book, LastFM (Music), Amazon (e-commerce).<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-c0518a1 elementor-hidden-desktop elementor-hidden-tablet elementor-hidden-mobile elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"c0518a1\" data-element_type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-33 elementor-top-column elementor-element elementor-element-b15be70\" data-id=\"b15be70\" data-element_type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap\">\n\t\t\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t<div class=\"elementor-column elementor-col-33 elementor-top-column elementor-element elementor-element-55dd72b\" data-id=\"55dd72b\" data-element_type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-e06d72d elementor-widget elementor-widget-image\" data-id=\"e06d72d\" data-element_type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img decoding=\"async\" width=\"512\" height=\"322\" data-src=\"https:\/\/whiteriversmediasolutions.com\/Sony\/uvaftoap\/2024\/02\/19th-Cover-Image-2.png\" class=\"attachment-full size-full wp-image-11786 lazyload\" alt=\"\" data-srcset=\"https:\/\/whiteriversmediasolutions.com\/Sony\/uvaftoap\/2024\/02\/19th-Cover-Image-2.png 512w, https:\/\/whiteriversmediasolutions.com\/Sony\/uvaftoap\/2024\/02\/19th-Cover-Image-2-300x189.png 300w\" data-sizes=\"(max-width: 512px) 100vw, 512px\" style=\"--smush-placeholder-width: 512px; --smush-placeholder-aspect-ratio: 512\/322;width:100%;height:62.89%;max-width:512px\" src=\"data:image\/gif;base64,R0lGODlhAQABAAAAACH5BAEKAAEALAAAAAABAAEAAAICTAEAOw==\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t<div class=\"elementor-column elementor-col-33 elementor-top-column elementor-element elementor-element-fd52b32\" data-id=\"fd52b32\" data-element_type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap\">\n\t\t\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-9b69060 elementor-hidden-desktop elementor-hidden-tablet elementor-hidden-mobile elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"9b69060\" data-element_type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-cfbe302\" data-id=\"cfbe302\" data-element_type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-6d045fb elementor-widget elementor-widget-text-editor\" data-id=\"6d045fb\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tThe introduced modules and techniques help the proposed method to align known class\nrepresentations effectively so that it can detect the unknown objects accurately. To validate\nthis, we carried out extensive experiments &#038; ablation studies and found that the proposed\nmethod outperforms existing SOTA methods with significant improvement on the MS-COCO\n&#038; PASCAL VOC dataset for the OSOD task.\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-f97c4c4 elementor-widget elementor-widget-text-editor\" data-id=\"f97c4c4\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tTo know more about the paper, visit: <a href=\"https:\/\/openaccess.thecvf.com\/content\/WACV2024\/papers\/Sarkar_Open-Set_Object_Detection_by_Aligning_Known_Class_Representations_WACV_2024_paper.pdf\" target=\"_blank\" rel=\"noopener\">Open-Set Object Detection by Aligning Known Class\nRepresentations (thecvf.com)<\/a>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-9e2f9cc elementor-widget elementor-widget-text-editor\" data-id=\"9e2f9cc\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\tTo know more about Sony Research India\u2019s Research Publications, visit the \u2018Publications\u2019\nsection on our \u2018Open Innovation\u2019s page: <a href=\"https:\/\/www.sonyresearchindia.com\/open-innovation\/\" target=\"_blank\" rel=\"noopener\">Open Innovation with Sony R&amp;D \u2013 Sony Research\nIndia<\/a>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<\/div>\n\t\t","protected":false},"excerpt":{"rendered":"<p>Nirmesh Shah summarises paper titled, \u201cREWIND&#8230;<\/p>\n","protected":false},"author":1,"featured_media":13926,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"elementor_header_footer","format":"standard","meta":{"footnotes":""},"categories":[22,17],"tags":[],"class_list":["post-13916","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-all-blogs","category-technology","entry"],"yoast_head":"\n<title>Summarizing \u2018REWIND: Speech Time Reversal for Enhancing Speaker Representations in Diffusion-based Voice Conversion\u2019 - Sony Research India<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/whiteriversmediasolutions.com\/Sony\/summarizing-rewind-speech-time-reversal-for-enhancing-speaker-representations-in-diffusion-based-voice-conversion\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Summarizing \u2018REWIND: Speech Time Reversal for Enhancing Speaker Representations in Diffusion-based Voice Conversion\u2019 - Sony Research India\" \/>\n<meta property=\"og:description\" content=\"Nirmesh Shah summarises paper titled, \u201cREWIND...\" \/>\n<meta property=\"og:url\" content=\"https:\/\/whiteriversmediasolutions.com\/Sony\/summarizing-rewind-speech-time-reversal-for-enhancing-speaker-representations-in-diffusion-based-voice-conversion\/\" \/>\n<meta property=\"og:site_name\" content=\"Sony Research India\" \/>\n<meta property=\"article:published_time\" content=\"2025-09-05T07:01:06+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-09-05T07:11:23+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/whiteriversmediasolutions.com\/Sony\/uvaftoap\/2025\/09\/Blog-Thumbnail_-Nirmesh-Shah_STR_-INTERSPEECH25.png\" \/>\n\t<meta property=\"og:image:width\" content=\"380\" \/>\n\t<meta property=\"og:image:height\" content=\"190\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"sri_user@2021\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"sri_user@2021\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"6 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/whiteriversmediasolutions.com\/Sony\/summarizing-rewind-speech-time-reversal-for-enhancing-speaker-representations-in-diffusion-based-voice-conversion\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/whiteriversmediasolutions.com\/Sony\/summarizing-rewind-speech-time-reversal-for-enhancing-speaker-representations-in-diffusion-based-voice-conversion\/\"},\"author\":{\"name\":\"sri_user@2021\",\"@id\":\"https:\/\/whiteriversmediasolutions.com\/Sony\/#\/schema\/person\/589cf1e285a7c37cf0cb9feba7ae4338\"},\"headline\":\"Summarizing \u2018REWIND: Speech Time Reversal for Enhancing Speaker Representations in Diffusion-based Voice Conversion\u2019\",\"datePublished\":\"2025-09-05T07:01:06+00:00\",\"dateModified\":\"2025-09-05T07:11:23+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/whiteriversmediasolutions.com\/Sony\/summarizing-rewind-speech-time-reversal-for-enhancing-speaker-representations-in-diffusion-based-voice-conversion\/\"},\"wordCount\":910,\"publisher\":{\"@id\":\"https:\/\/whiteriversmediasolutions.com\/Sony\/#organization\"},\"image\":{\"@id\":\"https:\/\/whiteriversmediasolutions.com\/Sony\/summarizing-rewind-speech-time-reversal-for-enhancing-speaker-representations-in-diffusion-based-voice-conversion\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/whiteriversmediasolutions.com\/Sony\/uvaftoap\/2025\/09\/Blog-Thumbnail_-Nirmesh-Shah_STR_-INTERSPEECH25.png\",\"articleSection\":[\"All Blogs\",\"Technology\"],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/whiteriversmediasolutions.com\/Sony\/summarizing-rewind-speech-time-reversal-for-enhancing-speaker-representations-in-diffusion-based-voice-conversion\/\",\"url\":\"https:\/\/whiteriversmediasolutions.com\/Sony\/summarizing-rewind-speech-time-reversal-for-enhancing-speaker-representations-in-diffusion-based-voice-conversion\/\",\"name\":\"Summarizing \u2018REWIND: Speech Time Reversal for Enhancing Speaker Representations in Diffusion-based Voice Conversion\u2019 - Sony Research India\",\"isPartOf\":{\"@id\":\"https:\/\/whiteriversmediasolutions.com\/Sony\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/whiteriversmediasolutions.com\/Sony\/summarizing-rewind-speech-time-reversal-for-enhancing-speaker-representations-in-diffusion-based-voice-conversion\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/whiteriversmediasolutions.com\/Sony\/summarizing-rewind-speech-time-reversal-for-enhancing-speaker-representations-in-diffusion-based-voice-conversion\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/whiteriversmediasolutions.com\/Sony\/uvaftoap\/2025\/09\/Blog-Thumbnail_-Nirmesh-Shah_STR_-INTERSPEECH25.png\",\"datePublished\":\"2025-09-05T07:01:06+00:00\",\"dateModified\":\"2025-09-05T07:11:23+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/whiteriversmediasolutions.com\/Sony\/summarizing-rewind-speech-time-reversal-for-enhancing-speaker-representations-in-diffusion-based-voice-conversion\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/whiteriversmediasolutions.com\/Sony\/summarizing-rewind-speech-time-reversal-for-enhancing-speaker-representations-in-diffusion-based-voice-conversion\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/whiteriversmediasolutions.com\/Sony\/summarizing-rewind-speech-time-reversal-for-enhancing-speaker-representations-in-diffusion-based-voice-conversion\/#primaryimage\",\"url\":\"https:\/\/whiteriversmediasolutions.com\/Sony\/uvaftoap\/2025\/09\/Blog-Thumbnail_-Nirmesh-Shah_STR_-INTERSPEECH25.png\",\"contentUrl\":\"https:\/\/whiteriversmediasolutions.com\/Sony\/uvaftoap\/2025\/09\/Blog-Thumbnail_-Nirmesh-Shah_STR_-INTERSPEECH25.png\",\"width\":380,\"height\":190,\"caption\":\"Blog Thumbnail\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/whiteriversmediasolutions.com\/Sony\/summarizing-rewind-speech-time-reversal-for-enhancing-speaker-representations-in-diffusion-based-voice-conversion\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/whiteriversmediasolutions.com\/Sony\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Summarizing \u2018REWIND: Speech Time Reversal for Enhancing Speaker Representations in Diffusion-based Voice Conversion\u2019\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/whiteriversmediasolutions.com\/Sony\/#website\",\"url\":\"https:\/\/whiteriversmediasolutions.com\/Sony\/\",\"name\":\"Sony Research India\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\/\/whiteriversmediasolutions.com\/Sony\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/whiteriversmediasolutions.com\/Sony\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/whiteriversmediasolutions.com\/Sony\/#organization\",\"name\":\"sonyresearchindia\",\"url\":\"https:\/\/whiteriversmediasolutions.com\/Sony\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/whiteriversmediasolutions.com\/Sony\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/whiteriversmediasolutions.com\/Sony\/uvaftoap\/2023\/03\/Sony_Logo.png\",\"contentUrl\":\"https:\/\/whiteriversmediasolutions.com\/Sony\/uvaftoap\/2023\/03\/Sony_Logo.png\",\"width\":168,\"height\":31,\"caption\":\"sonyresearchindia\"},\"image\":{\"@id\":\"https:\/\/whiteriversmediasolutions.com\/Sony\/#\/schema\/logo\/image\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\/\/whiteriversmediasolutions.com\/Sony\/#\/schema\/person\/589cf1e285a7c37cf0cb9feba7ae4338\",\"name\":\"sri_user@2021\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/whiteriversmediasolutions.com\/Sony\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/e0c9edcfb42567c720cc449d4b1e0812298e8172a5a7e4296127a0adba7e705b?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/e0c9edcfb42567c720cc449d4b1e0812298e8172a5a7e4296127a0adba7e705b?s=96&d=mm&r=g\",\"caption\":\"sri_user@2021\"},\"sameAs\":[\"http:\/\/whiteriversmediasolutions.com\/staging\/SRI\"]}]}<\/script>\n","yoast_head_json":{"title":"Summarizing \u2018REWIND: Speech Time Reversal for Enhancing Speaker Representations in Diffusion-based Voice Conversion\u2019 - Sony Research India","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/whiteriversmediasolutions.com\/Sony\/summarizing-rewind-speech-time-reversal-for-enhancing-speaker-representations-in-diffusion-based-voice-conversion\/","og_locale":"en_US","og_type":"article","og_title":"Summarizing \u2018REWIND: Speech Time Reversal for Enhancing Speaker Representations in Diffusion-based Voice Conversion\u2019 - Sony Research India","og_description":"Nirmesh Shah summarises paper titled, \u201cREWIND...","og_url":"https:\/\/whiteriversmediasolutions.com\/Sony\/summarizing-rewind-speech-time-reversal-for-enhancing-speaker-representations-in-diffusion-based-voice-conversion\/","og_site_name":"Sony Research India","article_published_time":"2025-09-05T07:01:06+00:00","article_modified_time":"2025-09-05T07:11:23+00:00","og_image":[{"width":380,"height":190,"url":"https:\/\/whiteriversmediasolutions.com\/Sony\/uvaftoap\/2025\/09\/Blog-Thumbnail_-Nirmesh-Shah_STR_-INTERSPEECH25.png","type":"image\/png"}],"author":"sri_user@2021","twitter_card":"summary_large_image","twitter_misc":{"Written by":"sri_user@2021","Est. reading time":"6 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/whiteriversmediasolutions.com\/Sony\/summarizing-rewind-speech-time-reversal-for-enhancing-speaker-representations-in-diffusion-based-voice-conversion\/#article","isPartOf":{"@id":"https:\/\/whiteriversmediasolutions.com\/Sony\/summarizing-rewind-speech-time-reversal-for-enhancing-speaker-representations-in-diffusion-based-voice-conversion\/"},"author":{"name":"sri_user@2021","@id":"https:\/\/whiteriversmediasolutions.com\/Sony\/#\/schema\/person\/589cf1e285a7c37cf0cb9feba7ae4338"},"headline":"Summarizing \u2018REWIND: Speech Time Reversal for Enhancing Speaker Representations in Diffusion-based Voice Conversion\u2019","datePublished":"2025-09-05T07:01:06+00:00","dateModified":"2025-09-05T07:11:23+00:00","mainEntityOfPage":{"@id":"https:\/\/whiteriversmediasolutions.com\/Sony\/summarizing-rewind-speech-time-reversal-for-enhancing-speaker-representations-in-diffusion-based-voice-conversion\/"},"wordCount":910,"publisher":{"@id":"https:\/\/whiteriversmediasolutions.com\/Sony\/#organization"},"image":{"@id":"https:\/\/whiteriversmediasolutions.com\/Sony\/summarizing-rewind-speech-time-reversal-for-enhancing-speaker-representations-in-diffusion-based-voice-conversion\/#primaryimage"},"thumbnailUrl":"https:\/\/whiteriversmediasolutions.com\/Sony\/uvaftoap\/2025\/09\/Blog-Thumbnail_-Nirmesh-Shah_STR_-INTERSPEECH25.png","articleSection":["All Blogs","Technology"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/whiteriversmediasolutions.com\/Sony\/summarizing-rewind-speech-time-reversal-for-enhancing-speaker-representations-in-diffusion-based-voice-conversion\/","url":"https:\/\/whiteriversmediasolutions.com\/Sony\/summarizing-rewind-speech-time-reversal-for-enhancing-speaker-representations-in-diffusion-based-voice-conversion\/","name":"Summarizing \u2018REWIND: Speech Time Reversal for Enhancing Speaker Representations in Diffusion-based Voice Conversion\u2019 - Sony Research India","isPartOf":{"@id":"https:\/\/whiteriversmediasolutions.com\/Sony\/#website"},"primaryImageOfPage":{"@id":"https:\/\/whiteriversmediasolutions.com\/Sony\/summarizing-rewind-speech-time-reversal-for-enhancing-speaker-representations-in-diffusion-based-voice-conversion\/#primaryimage"},"image":{"@id":"https:\/\/whiteriversmediasolutions.com\/Sony\/summarizing-rewind-speech-time-reversal-for-enhancing-speaker-representations-in-diffusion-based-voice-conversion\/#primaryimage"},"thumbnailUrl":"https:\/\/whiteriversmediasolutions.com\/Sony\/uvaftoap\/2025\/09\/Blog-Thumbnail_-Nirmesh-Shah_STR_-INTERSPEECH25.png","datePublished":"2025-09-05T07:01:06+00:00","dateModified":"2025-09-05T07:11:23+00:00","breadcrumb":{"@id":"https:\/\/whiteriversmediasolutions.com\/Sony\/summarizing-rewind-speech-time-reversal-for-enhancing-speaker-representations-in-diffusion-based-voice-conversion\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/whiteriversmediasolutions.com\/Sony\/summarizing-rewind-speech-time-reversal-for-enhancing-speaker-representations-in-diffusion-based-voice-conversion\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/whiteriversmediasolutions.com\/Sony\/summarizing-rewind-speech-time-reversal-for-enhancing-speaker-representations-in-diffusion-based-voice-conversion\/#primaryimage","url":"https:\/\/whiteriversmediasolutions.com\/Sony\/uvaftoap\/2025\/09\/Blog-Thumbnail_-Nirmesh-Shah_STR_-INTERSPEECH25.png","contentUrl":"https:\/\/whiteriversmediasolutions.com\/Sony\/uvaftoap\/2025\/09\/Blog-Thumbnail_-Nirmesh-Shah_STR_-INTERSPEECH25.png","width":380,"height":190,"caption":"Blog Thumbnail"},{"@type":"BreadcrumbList","@id":"https:\/\/whiteriversmediasolutions.com\/Sony\/summarizing-rewind-speech-time-reversal-for-enhancing-speaker-representations-in-diffusion-based-voice-conversion\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/whiteriversmediasolutions.com\/Sony\/"},{"@type":"ListItem","position":2,"name":"Summarizing \u2018REWIND: Speech Time Reversal for Enhancing Speaker Representations in Diffusion-based Voice Conversion\u2019"}]},{"@type":"WebSite","@id":"https:\/\/whiteriversmediasolutions.com\/Sony\/#website","url":"https:\/\/whiteriversmediasolutions.com\/Sony\/","name":"Sony Research India","description":"","publisher":{"@id":"https:\/\/whiteriversmediasolutions.com\/Sony\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/whiteriversmediasolutions.com\/Sony\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/whiteriversmediasolutions.com\/Sony\/#organization","name":"sonyresearchindia","url":"https:\/\/whiteriversmediasolutions.com\/Sony\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/whiteriversmediasolutions.com\/Sony\/#\/schema\/logo\/image\/","url":"https:\/\/whiteriversmediasolutions.com\/Sony\/uvaftoap\/2023\/03\/Sony_Logo.png","contentUrl":"https:\/\/whiteriversmediasolutions.com\/Sony\/uvaftoap\/2023\/03\/Sony_Logo.png","width":168,"height":31,"caption":"sonyresearchindia"},"image":{"@id":"https:\/\/whiteriversmediasolutions.com\/Sony\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/whiteriversmediasolutions.com\/Sony\/#\/schema\/person\/589cf1e285a7c37cf0cb9feba7ae4338","name":"sri_user@2021","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/whiteriversmediasolutions.com\/Sony\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/e0c9edcfb42567c720cc449d4b1e0812298e8172a5a7e4296127a0adba7e705b?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/e0c9edcfb42567c720cc449d4b1e0812298e8172a5a7e4296127a0adba7e705b?s=96&d=mm&r=g","caption":"sri_user@2021"},"sameAs":["http:\/\/whiteriversmediasolutions.com\/staging\/SRI"]}]}},"_links":{"self":[{"href":"https:\/\/whiteriversmediasolutions.com\/Sony\/wp-json\/wp\/v2\/posts\/13916","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/whiteriversmediasolutions.com\/Sony\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/whiteriversmediasolutions.com\/Sony\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/whiteriversmediasolutions.com\/Sony\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/whiteriversmediasolutions.com\/Sony\/wp-json\/wp\/v2\/comments?post=13916"}],"version-history":[{"count":5,"href":"https:\/\/whiteriversmediasolutions.com\/Sony\/wp-json\/wp\/v2\/posts\/13916\/revisions"}],"predecessor-version":[{"id":13927,"href":"https:\/\/whiteriversmediasolutions.com\/Sony\/wp-json\/wp\/v2\/posts\/13916\/revisions\/13927"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/whiteriversmediasolutions.com\/Sony\/wp-json\/wp\/v2\/media\/13926"}],"wp:attachment":[{"href":"https:\/\/whiteriversmediasolutions.com\/Sony\/wp-json\/wp\/v2\/media?parent=13916"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/whiteriversmediasolutions.com\/Sony\/wp-json\/wp\/v2\/categories?post=13916"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/whiteriversmediasolutions.com\/Sony\/wp-json\/wp\/v2\/tags?post=13916"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}