Guiding Large Language Models to Generate Computer-Parsable Content

作者： Jiaye Wang ^1,2
作者单位：

1. School of Software, South China Normal University

2. Platform and Content Group, Tencent Inc.
通讯作者： Jiaye Wang Email:hk-shao@outlook.com
提交时间：2024-04-23 14:22:16

摘要: We propose a method to guide Large Language Models (LLMs) in generating structured content adhering to specific conventions without fine-tuning. By utilizing coroutine-based content generation constraints through a pre-agreed context-free grammar (CFG), LLMs are directed during decoding to produce formal language compliant outputs. This enhances stability and consistency in generating target data structures, types, or instructions, reducing application development complexities. Experimentally, error rates of GPT-2 and Gemma exceed 95% for DSLs longer than 36 and 282 tokens, respectively. We introduce YieldLang, a coroutine-based DSL generation framework, and evaluate it with LLMs on various tasks including JSON and Mermaid flowchart generation. Compared to benchmarks, our approach improves accuracy by 1.09 to 11.6 times, with LLMs requiring only about 16.5% of the samples to generate JSON effectively. This enhances usability of LLM-generated content for computer programs.

Large language models Structured content generation Constrained decoding Coroutine Metalanguage

来自： HK-SHAO
分类： 计算机科学 >> 计算机软件
说明： 44 pages, 39 figures, 8 tables, Chinese version: https://chinaxiv.org/abs/202403.00340, Slides:https://chinaxiv.org/abs/202404.00273
投稿状态： 未投稿
引用： ChinaXiv:202404.00272 (或此版本 ChinaXiv:202404.00272V2)
DOI:10.12074/202404.00272V2
CSTR:32003.36.ChinaXiv.202404.00272.V2
推荐引用方式： Jiaye Wang.(2024).Guiding Large Language Models to Generate Computer-Parsable Content.中国科学院科技论文预发布平台.doi:10.12074/202404.00272V2 (点此复制)

版本历史

[V2]	2024-04-23 14:22:16	ChinaXiv:202404.00272V2	下载全文
[V1]	2024-04-21 22:45:22	ChinaXiv:202404.00272v1 查看此版本	下载全文

相关论文推荐

1. Brief Discussion on Scenes and Strategies in Capital Markets Manipulation Detection: From Influence Diffusion Perspectives	2024-04-24
2. SteganoDDPM: A high-quality image steganography self-learning method using diffusion model	2024-04-23
3. Multimodal Physical Fitness Monitoring (PFM) Framework Based on TimeMAE-PFM in Wearable Scenarios	2024-04-07
4. 引导大语言模型生成计算机可解析内容	2024-04-07
5. Application of Deep Learning Methods Combined with Physical Background in Wide Field of View Imaging Atmospheric Cherenkov Telescopes	2024-03-10
6. Does GPT-4 Play Dice?	2024-02-20
7. Confident Association for Long-term Tracking	2024-01-07
8. Overview of deep learning theory and its application	2024-01-06
9. Predicting League of Legends Match Results Based on Machine	2024-01-03
10. Modeling of New Energy Vehicles’ Impact on Urban Ecology Focusing on Behavior	2024-01-01


公开评论匿名评论仅发给作者