What's New

We will present the following papers at ICASSP2023 (2023/6)

  • Qianying Liu, Zhuo Gong, Zhengdong Yang, Yuhang Yang, Sheng Li, Chenchen Ding, Nobuaki Minematsu, Hao Huang, Fei Cheng, Chenhui Chu, Sadao Kurohashi:
    Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition
  • Kak Soky, Sheng Li, Chenhui Chu, Tatsuya Kawahara:
    Domain and Language Adaptation Using Heterogeneous Datasets for Wav2vec2.0-based Speech Recognition of Low-resource Language

Post-doctoral researcher Huang Yin Jou starts working at the Graduate School of Informatics, Kyoto University as a program-specific assistant professor.

We will present the following papers at EACL2023 (2023/5).

  • Zhuoyuan Mao and Tetsuji Nakagawa:
    LEALLA: Learning Lightweight Language-agnostic Sentence Embedding with Knowledge Distillation
  • Qianying Liu, Wenyu Guan, Jianhao Shen, Fei Cheng and Sadao Kurohashi:
    ComSearch:Equation Searching with Combinatorial Strategy for Solving Math Word Problems with Weak Supervision
  • Zhen Wan, Fei Cheng, Qianying Liu, Zhuoyuan Mao, Haiyue Song and Sadao Kurohashi:
    Relation Extraction with Weighted Contrastive Pre-training on Distant Supervision (Findings)

Prof. Sadao Kurohashi will assume the position of Director of the National Institute of Informatics (NII) in April 2023. The cross-appointment procedure between Kyoto University and NII is under way.

We released the Japanese Culinary Interview Dialog Corpus (CIDC).

The following paper has been selected as Outstanding Paper of COLING2022. (10/15)

  • Kazumasa Omura and Sadao Kurohashi:
    Improving Commonsense Contingent Reasoning by Pseudo-data and its Application to the Related Tasks

We will present the following papers at EMNLP2022 (2022/12)

  • Zhen Wan, Qianying Liu, Zhuoyuan Mao, Fei Cheng, Sadao Kurohashi and Jiwei Li:
    Rescue Implicit and Long-tail Cases: Nearest Neighbor Relation Extraction
  • Yibin Shen, Qianying Liu, Zhuoyuan Mao, Fei Cheng and Sadao Kurohashi:
    Textual Enhanced Contrastive Learning for Solving Math Word Problems (Findings)
  • Jumon Nozaki and Yugo Murawaki:
    Addressing Segmentation Ambiguity in Neural Linguistic Steganography

Research Overview

Language is the most reliable medium of human intellectual activities. Our objective is to establish the technology and academic discipline for handling and understanding language, in a manner that is as close as possible to that of humans, using computers. These include syntactic language analysis, semantic analysis, context analysis, text comprehension, text generation and dictionary systems to develop various application systems for machine translation and information retrieval.

Search Engine Infrastructure based on Deep Natural Language Processing

TSUBAKI.png

The essential purpose of information retrieval is not to retrieve just a relevant document but to acquire the information or knowledge in the document. We have been developing a next-generation infrastructure of information retrieval on the basis of the following techniques of deep natural language processing: precise processing based not on words but on predicate-argument structures, identifying the variety of linguistic expressions and providing a bird's-eye view of search results via clustering and interaction.

Machine Translation

EBMT.png

To bring automatic translation by computers to the level of human translation, we have been studying next-generation methodology of machine translation on the basis of text understanding and a large collection of translation examples. We have already accomplished practical translation on the domain of travel conversation, and constructed a translation-aid system that can be used by experts of patent translation.

Fundamental Studies on Text Understanding

To make computers understand language, it is essential to give computers world knowledge. This was a very hard problem ten years ago, but it has become possible to acquire knowledge from a massive amount of text in virtue of the drastic progress of computing power and network. We have successfully acquired linguistic patterns of predicate-argument structures from automatic parses of 7 billion Japanese sentences crawled from the Web using grid computing machines. By utilizing such knowledge, we study text understanding, i.e., recognizing the relationships between words and phrases in text.

Policy Regarding Acceptance of Students from Outside

Master course

PhD course

Access


Front page   New List of pages Search Recent changes   Help   RSS of recent changes