Backup of LANGUAGE-MEDIA-PROCESSING-LAB(No. 57) - LANGUAGE MEDIA PROCESSING LAB

What's New †

↑

We will present the following papers at ICASSP2023 (2023/6) †

Qianying Liu, Zhuo Gong, Zhengdong Yang, Yuhang Yang, Sheng Li, Chenchen Ding, Nobuaki Minematsu, Hao Huang, Fei Cheng, Chenhui Chu, Sadao Kurohashi:
Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition
Kak Soky, Sheng Li, Chenhui Chu, Tatsuya Kawahara:
Domain and Language Adaptation Using Heterogeneous Datasets for Wav2vec2.0-based Speech Recognition of Low-resource Language

↑

Post-doctoral researcher Huang Yin Jou starts working at the Graduate School of Informatics, Kyoto University as a program-specific assistant professor. †

↑

We will present the following papers at EACL2023 (2023/5). †

Zhuoyuan Mao and Tetsuji Nakagawa:
LEALLA: Learning Lightweight Language-agnostic Sentence Embedding with Knowledge Distillation

Qianying Liu, Wenyu Guan, Jianhao Shen, Fei Cheng and Sadao Kurohashi:
ComSearch:Equation Searching with Combinatorial Strategy for Solving Math Word Problems with Weak Supervision

Zhen Wan, Fei Cheng, Qianying Liu, Zhuoyuan Mao, Haiyue Song and Sadao Kurohashi:
Relation Extraction with Weighted Contrastive Pre-training on Distant Supervision (Findings)

↑

Prof. Sadao Kurohashi will assume the position of Director of the National Institute of Informatics (NII) in April 2023. The cross-appointment procedure between Kyoto University and NII is under way. †

↑

We released the Japanese Culinary Interview Dialog Corpus (CIDC). †

↑

The following paper has been selected as Outstanding Paper of COLING2022. (10/15) †

Kazumasa Omura and Sadao Kurohashi:
Improving Commonsense Contingent Reasoning by Pseudo-data and its Application to the Related Tasks

↑

We will present the following papers at EMNLP2022 (2022/12) †

Zhen Wan, Qianying Liu, Zhuoyuan Mao, Fei Cheng, Sadao Kurohashi and Jiwei Li:
Rescue Implicit and Long-tail Cases: Nearest Neighbor Relation Extraction
Yibin Shen, Qianying Liu, Zhuoyuan Mao, Fei Cheng and Sadao Kurohashi:
Textual Enhanced Contrastive Learning for Solving Math Word Problems (Findings)

Jumon Nozaki and Yugo Murawaki:
Addressing Segmentation Ambiguity in Neural Linguistic Steganography

Back Number

↑

Research Overview †

Language is the most reliable medium of human intellectual activities. Our objective is to establish the technology and academic discipline for handling and understanding language, in a manner that is as close as possible to that of humans, using computers. These include syntactic language analysis, semantic analysis, context analysis, text comprehension, text generation and dictionary systems to develop various application systems for machine translation and information retrieval.

↑

Search Engine Infrastructure based on Deep Natural Language Processing †

The essential purpose of information retrieval is not to retrieve just a relevant document but to acquire the information or knowledge in the document. We have been developing a next-generation infrastructure of information retrieval on the basis of the following techniques of deep natural language processing: precise processing based not on words but on predicate-argument structures, identifying the variety of linguistic expressions and providing a bird's-eye view of search results via clustering and interaction.

↑

Machine Translation †

To bring automatic translation by computers to the level of human translation, we have been studying next-generation methodology of machine translation on the basis of text understanding and a large collection of translation examples. We have already accomplished practical translation on the domain of travel conversation, and constructed a translation-aid system that can be used by experts of patent translation.

↑

Fundamental Studies on Text Understanding †

To make computers understand language, it is essential to give computers world knowledge. This was a very hard problem ten years ago, but it has become possible to acquire knowledge from a massive amount of text in virtue of the drastic progress of computing power and network. We have successfully acquired linguistic patterns of predicate-argument structures from automatic parses of 7 billion Japanese sentences crawled from the Web using grid computing machines. By utilizing such knowledge, we study text understanding, i.e., recognizing the relationships between words and phrases in text.

↑

Policy Regarding Acceptance of Students from Outside †

↑

Master course †

You can take the master course entrance exam of our department, but note that recently it is highly competitive.
Due to the capacity of our lab, we do not accept research students in principle, but you may contact in case that you have applied for a MEXT scholarship.

↑

PhD course †

You may contact in case that you have already published papers in top NLP conferences as the first author.
Being able to communicate in English or Japanese.

↑

Access †

Address
- Yoshida-honmachi, Sakyo-ku, Kyoto, 606-8501, Japan
- S208, Research Bldg. No.9, Yoshida Main Campus, Kyoto University
  - Map of Yoshida Main Campus
  - Location of Yoshida Main Campus
Contact
- Tel/Fax: +81-75-753-5962
- Email: contact at nlp.ist.i.kyoto-u.ac.jp