ChinaXiv.org 中国科学院科技论文预发布平台

按提交时间

2022
4
2019
1

按主题分类

按作者

按机构

Computer School, University of South China 42,1001, China
2
Center of Basic Molecular Science (CBMS), Department of Chemistry, Tsinghua University, Beijing, 100084, China
1
City University of New York, New York , USA
1
DataGrand Inc., Shanghai 201203, China
1
Department of Library, Information and Archives Management, University of Chinese Academy of Science, Beijing 100190, China
1
Hubei University of Technology, School of Computer Science, Wuhan 430068, China
1
Hunan provincial base for scientific and technological innovation cooperation, Hunan, China
1
National Science Library, Chinese Academy of Science, Beijing 100190, China
1
Technical Training Center of State Grid Hubei Electric Power Co., Ltd. Wuhan 430070, China
1

当前资源共 5条

隐藏摘要

点击量

时间

下载量

1. ChinaXiv:202211.00421
下载全文

Bi-GRU Relation Extraction Model Based on Keywords Attention

分类：计算机科学 >> 计算机科学的集成理论提交时间： 2022-11-28 合作期刊: 《数据智能（英文）》

Yuanyuan, Zhang Yu, Chen Shengkang, Yu Xiaoqin, Gu Mengqiong, Song Yu, Peng Jianxia, Chen Qi, Liu

摘要： Relational extraction plays an important role in the field of natural language processing to predict semantic relationships between entities in a sentence. Currently, most models have typically utilized the natural language processing tools to capture high-level features with an attention mechanism to mitigate the adverse effects of noise in sentences for the prediction results. However, in the task of relational classification, these attention mechanisms do not take full advantage of the semantic information of some keywords which have information on relational expressions in the sentences. Therefore, we propose a novel relation extraction model based on the attention mechanism with keywords, named Relation Extraction Based on Keywords Attention (REKA). In particular, the proposed model makes use of bi-directional GRU (Bi-GRU) to reduce computation, obtain the representation of sentences , and extracts prior knowledge of entity pair without any NLP tools. Besides the calculation of the entity-pair similarity, Keywords attention in the REKA model also utilizes a linear-chain conditional random field (CRF) combining entity-pair features, similarity features between entity-pair features, and its hidden vectors, to obtain the attention weight resulting from the marginal distribution of each word. Experiments demonstrate that the proposed approach can utilize keywords incorporating relational expression semantics in sentences without the assistance of any high-level features and achieve better performance than traditional methods.

点击量 3021 下载量 421 评论
2. ChinaXiv:202211.00387
下载全文

Data Set and Evaluation of Automated Construction of Financial Knowledge Graph

分类：计算机科学 >> 计算机科学的集成理论提交时间： 2022-11-27 合作期刊: 《数据智能（英文）》

Wenguang, Wang Yonglin, Xu Chunhui, Du Yunwen, Chen Yijie, Wang Hui, Wen

摘要： With the technological development of entity extraction, relationship extraction, knowledge reasoning, and entity linking, the research on knowledge graph has been carried out in full swing in recent years. To better promote the development of knowledge graph, especially in the Chinese language and in the financial industry, we built a high-quality data set, named financial research report knowledge graph (FR2KG), and organized the automated construction of financial knowledge graph evaluation at the 2020 China Knowledge Graph and Semantic Computing Conference (CCKS2020). FR2KG consists of 17,799 entities, 26,798 relationship triples, and 1,328 attribute triples covering 10 entity types, 19 relationship types, and 6 attributes. Participants are required to develop a constructor that will automatically construct a financial knowledge graph based on the FR2KG. In addition, we summarized the technologies for automatically constructing knowledge graphs, and introduced the methods used by the winners and the results of this evaluation.

点击量 1254 下载量 146 评论
3. ChinaXiv:202211.00425
下载全文

Ensemble Making Few-Shot Learning Stronger

分类：计算机科学 >> 计算机科学的集成理论提交时间： 2022-11-28 合作期刊: 《数据智能（英文）》

Qiang, Lin Yongbin, Liu Wen, Wen Zhihua, Tao Chunping, Ouyang Yaping, Wan

摘要： Few-shot learning has been proposed and rapidly emerging as a viable means for completing various tasks. Many few-shot models have been widely used for relation learning tasks. However, each of these models has a shortage of capturing a certain aspect of semantic features, for example, CNN on long-range dependencies part, Transformer on local features. It is difficult for a single model to adapt to various relation learning, which results in a high variance problem. Ensemble strategy could be competitive in improving the accuracy of few-shot relation extraction and mitigating high variance risks. This paper explores an ensemble approach to reduce the variance and introduces fine-tuning and feature attention strategies to calibrate relation-level features. Results on several few-shot relation learning tasks show that our model significantly outperforms the previous state-of-the-art models.

点击量 2601 下载量 347 评论
4. ChinaXiv:202211.00151
下载全文

Ensemble Making Few-Shot Learning Stronger

分类：计算机科学 >> 计算机科学技术其他学科提交时间： 2022-11-15

刘永彬

摘要： Few-shot learning has been proposed and rapidly emerging as a viable means for completing various tasks. Many few-shot models have been widely used for relation learning tasks. However, each of these models has a shortage of capturing a certain aspect of semantic features, for example, CNN on long-range dependencies part, Transformer on local features. It is difficult for a single model to adapt to various relation learning, which results in a high variance problem. Ensemble strategy could be competitive in improving the accuracy of few-shot relation extraction and mitigating high variance risks. This paper explores an ensemble approach to reduce the variance and introduces fine-tuning and feature attention strategies to calibrate relation-level features. Results on several few-shot relation learning tasks show that our model significantly outperforms the previous state-of-the-art models.

通过

点击量 2211 下载量 388 评论
5. ChinaXiv:201905.00012
下载全文

Transfer Learning for Scientific Data Chain Extraction in Small Chemical Corpus with BERT-CRF Model

分类：计算机科学 >> 自然语言理解与机器翻译提交时间： 2019-05-12

Na Pang Li Qian Weimin Lyu Jin-Dong Yang

摘要： Abstract. Computational chemistry develops fast in recent years due to the rapid growth and breakthroughs in AI. Thanks for the progress in natural language processing, researchers can extract more fine-grained knowledge in publications to stimulate the development in computational chemistry. While the works and corpora in chemical entity extraction have been restricted in the biomedicine or life science field instead of the chemistry field, we build a new corpus in chemical bond field anno- tated for 7 types of entities: compound, solvent, method, bond, reaction, pKa and pKa value. This paper presents a novel BERT-CRF model to build scientific chemical data chains by extracting 7 chemical entities and relations from publications. And we propose a joint model to ex- tract the entities and relations simultaneously. Experimental results on our Chemical Special Corpus demonstrate that we achieve state-of-art and competitive NER performance.

同行评议状态:待评议

点击量 24851 下载量 2028 评论

Bi-GRU Relation Extraction Model Based on Keywords Attention

Data Set and Evaluation of Automated Construction of Financial Knowledge Graph

Ensemble Making Few-Shot Learning Stronger

Ensemble Making Few-Shot Learning Stronger

Transfer Learning for Scientific Data Chain Extraction in Small Chemical Corpus with BERT-CRF Model