CKIP Lab
Chinese Knowledge and Information Processing
The CKIP (Chinese Knowledge and Information Processing) group is a research team formed by the Institute of Information Science and the Institute of Linguistics of Academia Sinica in 1986. Its purpose is to establish a fundamental research environment for Chinese natural language processing. The preliminary goal of the project was to construct research infrastructures with reusable resources that could be shared by domestic and international research institutes. The accomplished resources include Chinese electronic dictionaries, Mandarin Chinese corpora, and processing technologies for Chinese texts. With these environments and technologies now well established, we are focusing on knowledge-based information processing. This area of research is motivated by the flood of information on the WWW for which effective and autonomous information processing tools are still lacking. To achieve high-level intelligent information processing, many of the most challenging research problems in the areas of knowledge acquisition, knowledge representation, and knowledge utilization are currently being addressed.
Join Us
News
Jan 2025 Our research paper — “Not All LLM-Generated Data Are Equal: Rethinking Data Weighting in Text Classification” has been accepted by “ICLR 2025”. |
Jun 2024 Our research paper — “Generating Attractive and Authentic Copywriting from Customer Reviews” has been accepted by “NAACL 2024”. |
Jun 2024 Our research paper — “Plug-in Language Model: Controlling Text Generation with a Simple Regression Model” has been accepted by “NAACL 2024 Findings”. |
Mar 2024 Our research paper — “Automatic Construction of a Chinese Review Dataset for Aspect Sentiment Triplet Extraction via Iterative Weak Supervision” has been accepted by “LREC-Coling 2024”. |
Nov 2022 Our research paper — “HanTrans: An Empirical Study on Cross-Era Transferability of Chinese Pre-trained Language Model” has been accepted by “ROCLING 2022”. |
Research Areas
Deep Learning
Deep learning architectures such as deep neural networks, deep belief networks, recurrent neural networks and convolutional neural networks have been applied to fields including computer vision, speech recognition, natural language processing, audio recogn……
Knowledge Representation
On knowledge representation area, we focus on the basic theory of knowledge ontology structure and the representation models for meticulous semantics. By analysis the nuance of synonyms, we found the representation method for meticulous semantics, and know……
Language Processing
We focus on concept-centric Chinese processing technology. The developed technology uses the statistics, language grammar, and common sense information obtained by automatic extraction as the basic knowledge to analyze the conceptual structure of the file ……