Pelin Dogan

About

Welcome! I am a software engineer at Google. Previously, I was a postdoctoral researcher at Media Technology Center ETH Zürich, where I was leading the project on the development of the first voice assistant that can speak different Swiss German dialects by creating low-resourced neural machine translation and text-to-speech synthesis models (project video). Before, I was a joint doctoral student of Computer Science at Disney Research Studios and in Computer Graphics Laboratory at ETH Zürich, where I was advised by Markus Gross . I completed my masters studies at the Department of Electrical Engineering at EPFL, and spent time at EMPA, Disney Research Zürich, and Disney Research Pittsburgh as an intern, and University of British Columbia as a visitor.

Research

My general research interests lie in image processing, video processing, natural language processing, speech synthesis, visual-textual data alignment, computer vision. More specifically, my research is mostly about exploring the correspondances between visual, textual and audio elements.

Publications

2023

Towards Universal Image Embeddings: A Large-Scale Dataset and Challenge for Generic Image Representations Nikolaos-Antonios Ypsilantis, Kaifeng Chen, Bingyi Cao, Mário Lipovský, Pelin Dogan-Schönberger, Grzegorz Makosa, Boris Bluntschli, Mojtaba Seyedhosseini, Ondřej Chum, André Araujo International Conference on Computer Vision (ICCV)

PDF / Bibtex

2021

SwissDial: Parallel Multidialectal Corpus of Spoken Swiss German Pelin Dogan-Schönberger, Julian Mäder, Thomas Hofmann arXiv preprint

PDF / Bibtex / Relevant Video / Audio Samples / Data Access

2020

Enriching Video Captions With Contextual Text Philipp Rimle, Pelin Dogan-Schönberger, Markus Gross International Conference on Pattern Recognition 2020 (ICPR)

PDF / Bibtex / Video

2019

Neural Sequential Phrase Grounding (SeqGROUND) Pelin Dogan, Leonid Sigal, Markus Gross Conference on Computer Vision and Pattern Recognition 2019 (CVPR)

PDF / PDF (supp.) / Bibtex
Controlling Motion Blur in Synthetic Long Time Exposures Marcel Lancelle, Pelin Dogan, Markus Gross Eurographics 2019

PDF / PDF (supp.) / Video / Bibtex

2018

A Neural Multi-sequence Alignment TeCHnique (NeuMATCH) Pelin Dogan, Boyang Li, Leonid Sigal, Markus Gross Conference on Computer Vision and Pattern Recognition 2018 (CVPR) (Spotlight)

PDF / PDF (supp.) / Bibtex

2016

Label-Based Automatic Alignment of Video with Narrative Sentences Pelin Dogan, Markus Gross, Jean-Charles Bazin European Conference on Computer Vision 2016, Workshop on Web-scale Vision and Social Media

PDF / Bibtex
A Simple, Fast and Low-cost Method for in Situ Monitoring of Topographical Changes and Wear Rate of a Complex Tribo-system under Mixed Lubrication Bastian Meylan, Pelin Dogan, Daniel Sage, Kilian Wasmer Wear 364 (2016)

PDF / Bibtex

2015

Key-frame Based Spatiotemporal Scribble Propagation Pelin Dogan, Tunc Ozan Aydin, Nikolce Stefanoski, Aljoscha Smolic Proceedings of the Eurographics Workshop on Intelligent Cinematography and Editing

Website / PDF / PDF (supp.) / Video / Bibtex

Patents

Alignment of video and textual sequences for metadata analysis US Patent No: US10558761B2, 2020 Techniques for performing contextual phrase grounding US Patent Application No: US20200272695A1, 2020

Pelin Dogan Schönberger Software EngineerGoogle pelindogan08@gmail.com