分类: 图书馆学、情报学 >> 情报资料的搜集、保管 提交时间: 2016-02-22
摘要: [目的/意义]结合数据起源的内容和长期保存特点,全面研究和分析数据起源在长期保存中的应用,为长期保存系统组织管理起源提供参考。 [方法/过程]分析长期保存领域中相关标准如OAIS、PREMIS和TRAC对起源的解释和要求,对比起源在已有的长期保存系统中的应用情况。[结果/结论]提出以事件为核心的长期保存起源管理框架,总结起源的详细内容、捕获方法、组织方案、存储封装策略和技术方案等。
分类: 图书馆学、情报学 >> 图书馆学 提交时间: 2016-02-02
摘要: If they find some words closely related to their target task of titles or abstracts, which are definedIf they find some words closely related to their target task of titles or abstracts, which are definedas intelligent sensitive words in this paper, they will decide to read these resources in depth tofind more useful information. These sensitive words may refer to special persons, organisations,programmes, terms and so on. All of these sensitive words and their featured information areincluded in our object. Basing on the above processes, we have determined that some featurescould be used for automatic computation.Downloaded by [National Science Library] at 03:28 19 January 2015 Profiling science and innovation policy by object-based computing 587? Authority of source. If one piece of news is released in an official website, then this resource ismore reliable than other resources from no
分类: 图书馆学、情报学 >> 图书馆学 提交时间: 2016-02-02
摘要: 【目的】构建国际重要科研机构 Web 存档系统。【方法】基于 IIPC 开源软件拓展采集存档框架, 在采集端采用三层扩展策略, 在采集客户端增加自动上传及报告等管理功能, 开发WARC文件内容解析模块, 利用Solr进行索引。【结果】在采集端实现三层扩展, 通过增加采集客户端功能提高存档流程自动化程度, 通过增加的WARC文件内容解析功能抽取更多信息, 实现索引及检索服务的扩展。【局限】没有使用大规模采集存档进行检验。【结论】扩展后的采集存档框架初步具备分布式、可扩展、全自动化的特点。