ChinaXiv.org 中国科学院科技论文预发布平台

按提交时间

2022
5

按主题分类

计算机科学的集成理论
5

按作者

按机构

当前资源共 5条

隐藏摘要

点击量

时间

下载量

您选择的条件: Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China

1. ChinaXiv:202211.00390
下载全文

Overview of CCKS 2020 Task 3: Named Entity Recognition and Event Extraction in Chinese Electronic Medical Records

分类：计算机科学 >> 计算机科学的集成理论提交时间： 2022-11-27 合作期刊: 《数据智能（英文）》

Xia, Li Qinghua, Wen Hu, Lin Zengtao, Jiao Jiangtao, Zhang

摘要： The China Conference on Knowledge Graph and Semantic Computing (CCKS) 2020 Evaluation Task 3 presented clinical named entity recognition and event extraction for the Chinese electronic medical records. Two annotated data sets and some other additional resources for these two subtasks were provided for participators. This evaluation competition attracted 354 teams and 46 of them successfully submitted the valid results. The pre-trained language models are widely applied in this evaluation task. Data argumentation and external resources are also helpful.

点击量 543 下载量 137 评论 0
2. ChinaXiv:202211.00394
下载全文

Overview of SMP-CAIL2020-Argmine: The Interactive Argument-Pair Extraction in Judgement Document Challenge

分类：计算机科学 >> 计算机科学的集成理论提交时间： 2022-11-27 合作期刊: 《数据智能（英文）》

Jian, Yuan Zhongyu, Wei Yixu, Gao Wei, Chen Yun, Song Donghua, Zhao Jinglei, Ma Zhen, Hu Shaokun, Zou Donghai, Li Xuanjing, Huang

摘要： In this paper we present the results of the Interactive Argument-Pair Extraction in Judgement Document Challenge held by both the Chinese AI and Law Challenge (CAIL) and the Chinese National Social Media Processing Conference (SMP), and introduce the related data set SMP-CAIL2020-Argmine. The task challenged participants to choose the correct argument among five candidates proposed by the defense to refute or acknowledge the given argument made by the plaintiff, providing the full context recorded in the judgement documents of both parties. We received entries from 63 competing teams, 38 of which scored higher than the provided baseline model (BERT) in the first phase and entered the second phase. The best performing system in the two phases achieved accuracy of 0.856 and 0.905, respectively. In this paper, we will present the results of the competition and a summary of the systems, highlighting commonalities and innovations among participating systems. The SMP-CAIL2020-Argmine data set and baseline models have been already released.

点击量 1531 下载量 172 评论 0
3. ChinaXiv:202211.00395
下载全文

An Evaluation of Chinese Human-Computer Dialogue Technology

分类：计算机科学 >> 计算机科学的集成理论提交时间： 2022-11-27 合作期刊: 《数据智能（英文）》

Zixian, Feng Caihai, Zhu Weinan, Zhang Zhigang, Chen Wanxiang, Che Minlie, Huang Linlin, Li

摘要： There is a growing interest in developing human-computer dialogue systems which is an important branch in the field of artificial intelligence (AI). However, the evaluation of large-scale Chinese human-computer dialogues is still a challenging task. To attract more attention to dialogue evaluation work, we held the fourth Evaluation of Chinese Human-Computer Dialogue Technology (ECDT). It consists of few-shot learning in spoken language understanding (SLU) (Task 1) and knowledge-driven multi-turn dialogue competition (Task 2), the data sets of which are provided by Harbin Institute of Technology and Tsinghua University. In this paper, we will introduce the evaluation tasks and data sets in detail. Meanwhile, we will also analyze the evaluation results and the existing problems in the evaluation.

点击量 1547 下载量 160 评论 0
4. ChinaXiv:202211.00337
下载全文

AMiner: Search and Mining of Academic Social Networks

分类：计算机科学 >> 计算机科学的集成理论提交时间： 2022-11-25 合作期刊: 《数据智能（英文）》

Wan，Huaiyu Zhang，Yutao Zhang，Jing Tang，Jie

摘要： AMiner is a novel online academic search and mining system, and it aims to provide a systematic modeling approach to help researchers and scientists gain a deeper understanding of the large and heterogeneous networks formed by authors, papers, conferences, journals and organizations. The system is subsequently able to extract researchers profiles automatically from the Web and integrates them with published papers by a way of a process that first performs name disambiguation. Then a generative probabilistic model is devised to simultaneously model the different entities while providing a topic-level expertise search. In addition, AMiner offers a set of researcher-centered functions, including social influence analysis, relationship mining, collaboration recommendation, similarity analysis and community evolution. The system has been in operation since 2006 and has been accessed from more than 8 million independent IP addresses residing in more than 200 countries and regions.

点击量 750 下载量 197 评论 0
5. ChinaXiv:202211.00340
下载全文

XLORE2: Large-Scale Cross-Lingual Knowledge Graph Construction and Application

分类：计算机科学 >> 计算机科学的集成理论提交时间： 2022-11-25 合作期刊: 《数据智能（英文）》

Jin，Hailong Li，Chengjiang Zhang，Jing Hou，Lei Li ，Juanzi Zhang，Peng

摘要： Knowledge bases (KBs) are often greatly incomplete, necessitating a demand for KB completion. Although XLORE is an English-Chinese bilingual knowledge graph, there are only 423,974 cross-lingual links between English instances and Chinese instances. We present XLORE2, an extension of the XLORE that is built automatically from Wikipedia, Baidu Baike and Hudong Baike. We add more facts by making cross-lingual knowledge linking, cross-lingual property matching and fine-grained type inference. We also design an entity linking system to demonstrate the effectiveness and broad coverage of XLORE2.

点击量 820 下载量 221 评论 0

Overview of CCKS 2020 Task 3: Named Entity Recognition and Event Extraction in Chinese Electronic Medical Records

Overview of SMP-CAIL2020-Argmine: The Interactive Argument-Pair Extraction in Judgement Document Challenge

An Evaluation of Chinese Human-Computer Dialogue Technology

AMiner: Search and Mining of Academic Social Networks

XLORE2: Large-Scale Cross-Lingual Knowledge Graph Construction and Application