ChinaXiv.org 中国科学院科技论文预发布平台

按提交时间

按主题分类

按作者

按机构

当前资源共 55条

隐藏摘要

点击量

时间

下载量

您选择的条件: 自然语言理解与机器翻译

1. ChinaXiv:202408.00082
下载全文

一种改进Google人工智能AlphaStar的动态算法

分类：计算机科学 >> 自然语言理解与机器翻译提交时间： 2024-08-10

郭峰

摘要： [目的] 分析AlphaStar在星际争霸2中败给人类选手的原因，并提出一种更好的策略算法。[方法] 提出了一种动态策略算法，根据敌方策略改变自身策略；设计了一种默认策略方法，随机选取默认策略以避免被对手预判；实现了jiahuibot代码以验证策略的可行性和效果，并对多人对战方法论提供了理论指导。[结果] 动态策略算法和默认策略算法有助于提高人工智能在即时战略游戏中的能力，jiahuibot的实验验证了策略的可行性，为AlphaStar及其他星际争霸AI研究员提供了实施指导。[局限]研究局限于星际争霸2游戏内的策略算法，未涉及更广泛的现实领域应用。[结论] 研究有助于提升AI在即时战略游戏中的效果，并为后续研究提供了完善方向。

同行评议状态:待评议

点击量 59 下载量 12 评论 0
2. ChinaXiv:202406.00020
下载全文

面向低资源语言机器翻译的平行语料句对齐评分

分类：语言学及应用语言学 >> 语言学及应用语言学分类：计算机科学 >> 自然语言理解与机器翻译提交时间： 2024-06-05

李林霞陈波周毛克赵小兵

摘要：目的量化低资源语言平行语料的句对齐评分，获取高质量平行语料，提升机器翻译的性能。方法提出基于神经网络的无监督句嵌入双语平行语料句对齐评分方法 NeuroAlign：将平行句对嵌入至同一向量空间，计算平行语料中给定候选句对的对齐评分，然后根据评分排序过滤分值较低的平行句对，获得高质量的低资源语言双语平行语料。结果 BUCC2018 平行文本挖掘任务中 F1 值可提升 0.5-0.8；CCMT2021 低资源语言神经机器翻译中 BLEU 值可提升 0.1-10.9；句对齐评分可接近人工评分。局限限于低资源双语平行语料的资源匮乏，未在藏汉、维汉、蒙汉以外的语言对上进行探索研究。结论可以有效应用至低资源语言平行语料的句对齐评分，从数据源端提升语料质量，进而改进机器翻译的效果。

通过

点击量 794 下载量 216 评论 0
3. ChinaXiv:202402.00204
下载全文

Does GPT-4 Play Dice?

分类：计算机科学 >> 自然语言理解与机器翻译提交时间： 2024-02-20

Qiang Liu

摘要： OpenAI's Generative Pre-trained Transformer 4 (GPT-4) is a powerful large language model with a certain degree of intelligence in understanding and generating coherent text. We are exploring whether GPT-4 is capable of acting as a die, i.e. generating random numbers. We show that GPT-4 does not appear to generate independent and identically distributed random numbers. Examples imply that GPT-4 tries to compensate for the uniformity of random numbers by sacrificing independence when acting as a die.

同行评议状态:待评议

点击量 1356 下载量 319 评论 0
4. ChinaXiv:202401.00173
下载全文

大语言模型时代的语言学研究新机遇-以歧义分析为例

分类：语言学及应用语言学 >> 语言学及应用语言学分类：计算机科学 >> 自然语言理解与机器翻译提交时间： 2024-01-11

邵研

摘要：以GPT系列为代表的大规模预训练语言模型的快速发展，深刻改变了自然语言处理领域的科研与工程范式，对医疗、教育、司法、金融等相关领域产生了深远影响。同时，这也为语言本身的研究带来了一些新的可能性。本文从歧义分析出发，简要评估GPT4、百川2、ChatGLM3等模型对以歧义为代表的复杂语言现象的理解和分析能力。实验结果表明，GPT4可以融合歧义消解和句法分析等方法，有效感知和理解复杂的语言现象。对于百川2，我们可以通过提示词工程引导其对语言现象进行深入思考，在不进行参数优化时，提升其分析能力。此外，通过监测大模型在处理不同语言现象时的内部特征与神经元活动，可以直观展现语言现象与大模型之间的关系。实验结果表明，大语言模型可以辅助人类更好地理解语言的本质，揭示语言现象深层次规律，从而为语言学研究提供新的思路。

通过

点击量 2223 下载量 518 评论 0
5. ChinaXiv:202401.00080
下载全文

基于多层感知机的图像分类

分类：计算机科学 >> 自然语言理解与机器翻译提交时间： 2024-01-07

尹怀志

摘要：多层感知机（MLP）是一种前馈神经网络，通过在网络中加入一个或者更多个隐藏层，克服了线性模型的限制，打开了深度学习的大门。本文利用了多层感知机完成图像分类，在Fashion MNIST数据集上进行了探索，并尝试迁移到MNIST数据集中。在Fashion MNIST上我们进行特征预处理后，选择了不同的优化方法并进行比较，此外分别通过增加丢弃法和权重衰减法等正则化方法，实现了对多层感知机的优化、改进。通过实验表明，适当的特征处理能够提高模型的数值稳定性。动量法显著提高了模型效果，同时权重衰减等方法对提高模型的泛化效果起到了帮助。

同行评议状态:待评议

点击量 1079 下载量 267 评论 0
6. ChinaXiv:202401.00082
下载全文

基于深度学习的中文命名实体识别

分类：计算机科学 >> 自然语言理解与机器翻译提交时间： 2024-01-07

李昕蓉

摘要：该中文命名实体识别项目的目标主要包括以下两个方面。首先是实现高精度的中文命名实体识别，通过对中文文本进行深度学习，提高中文实体识别的准确率，减少误识别和漏识别的现象。其次是实现标准化流程建立，形成一套标准化的中文命名实体识别流程，包括数据预处理、模型训练、实体识别等，为后续研究提供基础。代码提交在了GitHub，网址为https://github.com/Blue88888/DL_CNER。

同行评议状态:待评议

点击量 816 下载量 178 评论 0
7. ChinaXiv:202401.00086
下载全文

基于长短期记忆网络的中文命名实体识别

分类：计算机科学 >> 自然语言理解与机器翻译提交时间： 2024-01-07

管浩宇

摘要：在文献调研过程中，我们关注到有关LSTM模型的命名实体识别相关工作，该种模型在命名实体识别领域具有十分广泛的应用，英文NER领域发展较快，因项目目标定为拟实现中文命名实体识别工作，所以在关注中文的NER发展情况过程中，发现了LSTM在中文NER中的逐步应用，LSTM是RNN的一种变体，其核心概念在于细胞状态以及门结构。

同行评议状态:待评议

点击量 654 下载量 212 评论 0
8. ChinaXiv:202401.00105
下载全文

中文命名实体识别

分类：计算机科学 >> 自然语言理解与机器翻译提交时间： 2024-01-07

罗飞彬

摘要：针对目前中文命名实体识别研究中存在的语义特征提取不充分、不全面等问题，Transformers(BERT)在各种相关 NLP 任务中显示出惊人的改进，并且已经提出了连续的变体来进一步提高预训练语言模型的性能。在本文中，我们的目标是重新审视中文预训练语言模型，以检验它们在非英语语言中的有效性。本文基于 RoBERT 模型进行微调，实验结果表明，在许多 NLP 任务上表现良好。

同行评议状态:待评议

点击量 677 下载量 211 评论 0
9. ChinaXiv:202401.00052
下载全文

小样本条件下基于条件变分自编码器的数据生成方法研究

分类：计算机科学 >> 自然语言理解与机器翻译提交时间： 2024-01-05

叶立锴

摘要：本文以小样本问题为基础，分别建立一维振动信号和二维振动信号的条件变分自编码器网络模型，将整理好的数据集代入模型中，利用生成模型良好的生成数据能力，生成新的不同类型的故障分量信号，然后将生成的分量信号进行重构，并通过原始信号与重构信号的对比图来初步观察数据的生成效果，然后通过降维可视化、余弦相似度、最大均值差异来验证生成重构信号与原始信号之间的相似性，用信号的最大值、最小值等参数来验证多样性，综合评估生成数据的质量。最后，建立故障信息分类模型，设置三类故障原始样本数量为10、30、50，获得故障信息分类的准确率分别为43.3%、64.4%、84.7%，样本扩充后，故障信息分类的准确率为98.3%，充分验证了本方法生成故障数据的准确性。

同行评议状态:待评议

点击量 757 下载量 221 评论 0
10. ChinaXiv:202401.00045
下载全文

大模型与易学研究浅议

分类：计算机科学 >> 自然语言理解与机器翻译提交时间： 2024-01-04

杨浩邓泽琨

摘要：本文从易学研究的特点、大语言模型应用的主要场景出发，探讨了大语言模型在易学研究方面的可能方向，并以大语言模型在易占中的具体应用作为实例进行说明，旨在探索大语言模型在易学研究中的潜在作用。易学研究以其博大的文化内涵、深奥的理论体系和实用的应用价值而备受关注，具有专业性强，入门门槛高等特点。大语言模型在自然语言处理、知识图谱构建等领域取得显著成果，主要应用场景包括机器翻译、文本摘要、智能问答、语义理解等。然而，在易学研究中，大语言模型的应用远未充分发挥，为此，本文提出了大语言模型在易学研究中可能的方向，包括易学文献分析、易占结果解读等。最后，通过详细阐述大语言模型在易学研究中的应用实例易占，揭示了大语言模型在易学领域的潜力。通过对易学研究特点的深入理解和对大语言模型应用场景的充分挖掘，将有助于推动大语言模型在易学研究中的更广泛应用，提升研究效率，促进文化传承。

同行评议状态:待评议

点击量 686 下载量 155 评论 0
11. ChinaXiv:202401.00038
下载全文

Predicting League of Legends Match Results Based on Machine

分类：计算机科学 >> 自然语言理解与机器翻译提交时间： 2024-01-03

Wang Donghua

摘要： League of Legends (LoL) is a highly popular multiplayer online competitive game, featuring intricate game mechanics and team cooperation, making the prediction of match outcomes a challenging task. This study utilizes a dataset from Kaggle, comprising 9,879 ranked matches ranging from Diamond I to Master tier, to build a machine learning model predicting the ultimate winner, either the blue or red team, based on the features of the first 10 minutes of gameplay. Through steps such as data loading, preprocessing, and feature engineering, we provided effective inputs for the model. For model selection, we opted for the Logistic Regression algorithm, achieving a model accuracy of 0.7277 through data splitting and training. This accuracy robustly supports predictions of the winning side, whether blue or red. However, to further enhance model performance, we recommend exploring additional feature en#2;gineering methods, investigating alternative machine learning algorithms, and fine-tuning hyperpa#2;rameters. The introduction of deep learning models is also a promising avenue to better capture the complex relationships within the game. Through these improvements, we anticipate increasing the models predictive accuracy for future matches, offering valuable insights for game development and enhancement.

同行评议状态:待评议

点击量 1245 下载量 186 评论 0
12. ChinaXiv:202401.00039
下载全文

基于Bi-LSTM和CRF的中文命名实体识别方法实现

分类：计算机科学 >> 自然语言理解与机器翻译提交时间： 2024-01-03

焦新宇

摘要：中文命名实体识别（NER）在自然语言处理领域扮演着关键角色，对于提高信息提取和文本理解的效果至关重要。本文实现了一种基于双向长短时记忆网络（Bi-LSTM）和条件随机场（CRF）的中文命名实体识别方法。首先，我们通过Bi-LSTM网络对输入的中文文本进行特征抽取，利用双向序列学习的优势捕捉上下文信息，从而更好地理解语境。该网络能够有效处理长距离依赖关系，提高了对文本中复杂结构和嵌套实体的识别能力。其次，引入CRF作为序列标注任务的解码层，对Bi-LSTM的输出进行全局优化，以考虑实体标签之间的关系。CRF模型通过捕捉标签序列的全局依赖性，有助于纠正局部错误，提高了NER系统的整体性能。实验结果表明，Bi-LSTM-CRF模型在中文NER任务中表现出色，在各项指标中取得了较好的性能。

同行评议状态:待评议

点击量 740 下载量 171 评论 0
13. ChinaXiv:202401.00033
下载全文

不会一直下降的大模型交叉熵

分类：计算机科学 >> 自然语言理解与机器翻译提交时间： 2023-12-17

何沧平

摘要：训练大语言模型时，损失函数值会一直下降，难于确定最佳停止时机。本文设计了一个定长交叉熵，使得模型损失不会一直下降，在模型充分训练以后就保持不变，便于选择训练停止时间，节省训练成本。

同行评议状态:待评议

点击量 729 下载量 139 评论 0
14. ChinaXiv:202311.00040
下载全文

A Novel Framework for Future Natural Language Processing From a Database Perspective

分类：计算机科学 >> 自然语言理解与机器翻译提交时间： 2023-11-01

Limin Zhang

摘要： Most research and applications on natural language still concentrate on its superficial features and structures. However, natural language is essentially a way of encoding information and knowledge. Thus, the focus should be on what is encoded and how it is encoded. In line with this, we suggest a database-based approach for natural language processing that emulates the encoding of information and knowledge to build models. Based on these models, 1) generating sentences becomes akin to reading data from the models (or databases) and encoding it following some rules; 2) understanding sentences involves decoding rules and a series of boolean operations on the databases; 3) learning can be accomplished by writing on the databases. Our method closely mirrors how the human brain processes information, offering excellent interpretability and expandability.

同行评议状态:待评议

点击量 927 下载量 228 评论 0
15. ChinaXiv:202311.00027
下载全文

A Conversation with ChatGPT: Think Tank Theory and Practice in the Age of AI

分类：管理学 >> 公共管理分类：计算机科学 >> 自然语言理解与机器翻译分类：图书馆学、情报学 >> 情报学分类：其他 >> 综合提交时间： 2023-10-31

Chen Yu

摘要： Purpose/significance ChatGPT is a chatbot program developed by OpenAI in the United States. A dialog with ChatGPT can provide insights into the theory and practice of think tanks. Method/process Currently, GPT-3.5 offers users a free query quota of 30 queries per day. Chen Yu has engaged in a dialog with ChatGPT on a number of issues related to the theory and practice of think tanks by creating an outline for the dialog. Result/conclusion AI technology, represented by ChatGPT, offers many opportunities for the think tank industry, including enhanced research capabilities, data-driven decision-making, and improved public engagement. However, it also poses challenges related to ethics, expertise, transparency, and workforce adaptability that think tanks need to seriously address. In the age of AI, Chinese think tanks and experts need to keep up with the trend and proactively adopt the AI technology represented by ChatGPT.

同行评议状态:待评议

点击量 1584 下载量 323 评论 0
16. ChinaXiv:202310.03455
下载全文

A Conversation with ChatGPT: Dialogue of Civilizations in the Age of AI

分类：其他 >> 综合分类：计算机科学 >> 自然语言理解与机器翻译分类：图书馆学、情报学 >> 情报学分类：管理学 >> 公共管理提交时间： 2023-10-30

Chen Yu

摘要： Purpose/significance ChatGPT is a chatbot program developed by OpenAI in the United States. Conversations with ChatGPT can shed light on Dialogue of Civilizations in the age of AI. Method/process Currently, GPT-3.5 offers users 30 free query credits per day. By creating an outline for the conversation, Chen Yu engaged in a dialog with ChatGPT on various issues of Dialogue of Civilizations. Result/conclusion Today, the Standard of Civilization has long been abandoned, and the Clash of Civilizations has been widely criticized. In the era of AI, the AI technology represented by ChatGPT can help promote the Dialogue of Civilizations, help realize real-time communication between people of different cultural backgrounds, enhance the understanding and appreciation of different civilizations, and identify and alleviate prejudices in the dialogue of civilizations. At the same time, the AI technology represented by ChatGPT can also help promote Dialogue within Civilizations and play a positive role in resolving civil conflicts, promoting the integration of immigrants, protecting the voices of vulnerable groups, giving full play to the unique value of women, and building an age-friendly society. However, AI technologies must be developed and used with caution and with due regard to ethical considerations, in particular to prevent AI algorithms from perpetuating prejudices and reinforcing existing inequalities.

同行评议状态:待评议

点击量 1414 下载量 347 评论 0
17. ChinaXiv:202310.03454
下载全文

对话ChatGPT：AI时代的“文明的对话”

分类：其他 >> 综合分类：计算机科学 >> 自然语言理解与机器翻译分类：图书馆学、情报学 >> 情报学分类：管理学 >> 公共管理提交时间： 2023-10-30

陈瑜

摘要：目的/意义 ChatGPT是美国OpenAI公司研发的一种聊天机器人程序。与ChatGPT进行对话，能够为AI时代的文明的对话提供启示。方法/过程目前，GPT-3.5每日向用户免费提供30次的查询额度。陈瑜通过精心设计对话提纲，与ChatGPT就文明的对话的若干问题展开了对话。结果/结论今天，文明的标准早已被摒弃，文明的冲突也受到了广泛的批评。在AI时代，以ChatGPT为代表的AI技术有助于促进文明的对话，帮助实现不同文化背景的人们之间的实时交流，增进对不同文明的理解和欣赏，识别和缓解文明对话中的偏见。同时，以ChatGPT为代表的AI技术也有助于促进文明内的对话，在解决国内冲突、促进移民融合、保护弱势群体的话语权、发挥妇女的独特价值、建设老年友好型社会等方面发挥积极作用。但是，在开发和运用AI技术时，必须谨慎从事，充分考虑伦理因素，特别是要防止AI算法延续偏见并强化现有的不平等。

同行评议状态:待评议

点击量 1731 下载量 396 评论 0
18. ChinaXiv:202310.03412
下载全文

A Conversation with ChatGPT: The Media and Communications Industry in the Age of AI

分类：数字出版 >> 数字新闻分类：计算机科学 >> 自然语言理解与机器翻译分类：图书馆学、情报学 >> 情报学分类：法学 >> 法律人工智能分类：其他 >> 综合提交时间： 2023-10-25

Chen Yu

摘要： Purpose/significance ChatGPT is a chatbot program developed by OpenAI in the United States. Conversations with ChatGPT can shed light on the media and communication industry in the age of AI. Method/process Currently, GPT-3.5 offers users 30 free query credits per day. By creating an outline for the conversation, Chen Yu engaged in a dialog with ChatGPT on various issues of the media and communication industry. Result/conclusion AI technology, represented by ChatGPT, has a huge impact on the media and communication industry. In the AI era, the media and communication industry should enthusiastically embrace AI technology and use it responsibly to provide a better experience for audiences. At the same time, the government, technology companies, civil society organizations, individuals, etc. should work together with the media and communication industry to solve the problems of fake news, cyber harassment, and information cocoon that may be brought about by AI technology.

同行评议状态:待评议

点击量 1419 下载量 291 评论 0
19. ChinaXiv:202310.03411
下载全文

对话ChatGPT：AI时代的新闻传播

分类：数字出版 >> 数字新闻分类：计算机科学 >> 自然语言理解与机器翻译分类：图书馆学、情报学 >> 情报学分类：法学 >> 法律人工智能分类：其他 >> 综合提交时间： 2023-10-25

陈瑜

摘要：目的/意义 ChatGPT是美国OpenAI公司研发的一种聊天机器人程序。与ChatGPT进行对话，能够为AI时代的新闻传播提供启示。方法/过程目前，GPT-3.5每日向用户免费提供30次的查询额度。陈瑜通过精心设计对话提纲，与ChatGPT就新闻传播的若干问题展开了对话。结果/结论以ChatGPT为代表的AI技术正在给新闻传播带来巨大的冲击。在AI时代，新闻传播行业要热情拥抱AI技术，负责任地运用AI技术，为受众提供更好的体验。同时，政府、技术公司、民间社会组织、个人等要和新闻传播行业一道，共同解决好AI技术可能带来的假新闻、网络骚扰、信息茧房等问题。

同行评议状态:待评议

点击量 1672 下载量 404 评论 0
20. ChinaXiv:202310.03395
下载全文

A Conversation with ChatGPT: Digital Government Transformation in the Age of AI

分类：计算机科学 >> 自然语言理解与机器翻译分类：管理学 >> 公共管理分类：法学 >> 法律人工智能分类：图书馆学、情报学 >> 情报学分类：其他 >> 综合提交时间： 2023-10-23

Chen Yu

摘要： Purpose/significance Today, countries around the world are accelerating their transformation to digital government. Conversations with ChatGPT can shed light on the digital government transformation in the age of AI. Method/process Currently, GPT-3.5 offers users 30 free query credits per day. By creating an outline for the conversation, Chen Yu engaged in a dialog with ChatGPT on various issues of the digital government transformation. Result/conclusion In the age of AI, AI technologies, such as ChatGPT, have the potential to revolutionize digital government transformation by increasing efficiency, improving service delivery, and enabling data-driven decision making. While the benefits are immense, governments must also address issues of ethics, bias, and workforce adaptation to ensure responsible and inclusive AI deployments that deliver better services and work outcomes for their citizens.

同行评议状态:待评议

点击量 1337 下载量 304 评论 0

1 2 3 后页尾页