Futa (Kai) Waseda

Futa (Kai) Waseda

PhD Student in Information Science and Technology

The University of Tokyo

Biography

Welcome to my homepage! I am Futa Waseda, a PhD student in Information Science and Technology at The University of Tokyo.

My research focuses on the robustness and reliability of deep learning models. I am particularly fascinated by computer vision and vision-language multi-modal learning, aiming to understand and mitigate real-world risks.

Explore my work and join me in the journey to make AI more reliable and create a better future!

Interests

  • Deep Learning
  • Computer Vision
  • Vision-Language Model
  • Robustness, Reliability

Education

  • BEng in Systems Innovation, 2020

    The University of Tokyo

  • MS in Informatics, 2023

    The University of Tokyo

News

  • 2024.05: 1 paper (as 1st author) accepted at MIRU2024 Oral presentation.
  • 2024.03: Reviewed 5 papers for ICML2024.
  • 2024.02 - (current): I started research internship at CyberAgent AI Lab
  • 2024.11: Reviewed 2 papers for ICLR2024.
  • 2023.10: Reviewed 1 paper for IEEE TIFS2024.
  • 2023.09: Selected for research fellowship JST DC2.
  • 2023.08-2023.12: Research internship at NEC Japan.
  • 2023.07: Reviewed 1 paper for NeurIPS2023.
  • 2023.07: Attended ICML2023 at Honolulu (Hawaii, USA) in person.
  • 2023.07 - 2024.03: Received a research grant of 1 million yen from AIP Challenge Program, JST.
  • 2023.04: 1 paper (as 1st author) accepted at ICML2023.
  • 2023.04: Started PhD at The University of Tokyo.
  • 2023.03: Selected for research fellowship JST SPRING GX.
  • 2023.01: Attended WACV2023 at Waikoloa (Hawaii, USA) in person.
  • 2022.10: 1 paper (as 1st author) accepted at WACV2023.

Skills

python

Machine Learning/Deep Learning

Teamwork

Experience

 
 
 
 
 

Research Internship

CyberAgent AI Lab

Feb 2024 – Present Tokyo, Japan.

Research keywords:

  • Deep Learning
  • Vision-Language Models
  • Adversarial Robustness
 
 
 
 
 

Research Internship

NEC Corporation

Aug 2023 – Dec 2023 Tokyo, Japan.

Research keywords:

  • Deep Learning
  • Computer Vision
  • Adversarial Robustness
  • Parameter-Efficient Training
 
 
 
 
 

Exchange Student

Technical University of Munich

Apr 2021 – Mar 2022 Munich, Germany.
Conducted research, supervised by Christian Tomani.
 
 
 
 
 

Machine learning Engineer

Ollo inc.

May 2020 – Present Tokyo, Japan

Responsibilities include:

  • Researcher
  • Data Scientist
  • Software Engineer
 
 
 
 
 

Research Assistant

National Institute of Informatics

May 2020 – Present Tokyo, Japan.

Research keywords:

  • Deep Learning
  • Computer Vision
  • Adversarial Robustness
 
 
 
 
 

Masters Student

The University of Tokyo

Apr 2020 – Mar 2023 Tokyo, Japan
Supervised by Prof. Isao Echizen.

Recent Publications

See all publications.
[arXiv] Rethinking Invariance Regularization in Adversarial Training to Improve Robustness-Accuracy Trade-off

[arXiv] Rethinking Invariance Regularization in Adversarial Training to Improve Robustness-Accuracy Trade-off

Although adversarial training has been the state-of-the-art approach to defend against adversarial examples (AEs), it suffers from a robustness-accuracy trade-off, where high robustness is achieved at the cost of clean accuracy. In this work, we leverage invariance regularization on latent representations to learn discriminative yet adversarially invariant representations, aiming to mitigate this trade-off. We analyze two key issues in representation learning with invariance regularization: (1) a gradient conflict between invariance loss and classification objectives, leading to suboptimal convergence, and (2) the mixture distribution problem arising from diverged distributions of clean and adversarial inputs. To address these issues, we propose Asymmetrically Representation-regularized Adversarial Training (AR-AT), which incorporates asymmetric invariance loss with stop-gradient operation and a predictor to improve the convergence, and a split-BatchNorm (BN) structure to resolve the mixture distribution problem. Our method significantly improves the robustness-accuracy trade-off by learning adversarially invariant representations without sacrificing discriminative ability. Furthermore, we discuss the relevance of our findings to knowledge-distillation-based defense methods, contributing to a deeper understanding of their relative successes.

Projects

Twitter Image Captioning

Twitter Image Captioning

Made a model which outputs text from a image like human tweets, using Encoder-Decoder Model. Application of image captioning technique.

Oshaberi-Bot(おしゃべりぼっと)

Oshaberi-Bot(おしゃべりぼっと)

My first twitter bot app. He learns japanese from his followers, by fitting retrieved data to Markov model.

trip map

trip map

Demo web application I made in school. In this app, you can clip the place you want to go in the future, find the shortest way to go through the chosen spots. I was responsible for front-end system using html, css, javascript.

Awards

Won the special prize at SAS analytics hackathon 2019.(SAS社のThe Analytics Hackathon 2019にて特別賞)

In the contest, participants were given data and asked to construct machine learning system with high accuracy. (article url: https://enterprisezine.jp/article/detail/12209?p=2)

Won the first prize at MDS data science contest 2018.(MDSデータサイエンスコンテストで優勝)

In the contest, participants were given big data and asked to perform value-generating analysis freely. Our group won the first prize and we were able to submit a paper. See the publication section.

Accomplish­ments

Summer School for Deep Generative Models 2020

Learned deep generative models from basics to state-of-the-art.

Chair for Global Consumer Intelligence (GCI 2018)

Learned how to utilize big data by machine learning technology.

Contact