师资队伍

全职教师
访问教授

Full-time teachers

全职教师
  • 姓 名:汤步洲
  • 职 称:副教授,硕导
  • 职 务:
  • 邮 件:tangbuzhou@gmail.com
  • 研究方向:自然语言处理、医学信息处理、机器学习、数据挖掘
  • 电 话:
  • 个人主页:http://www.hitsz.edu.cn/teacher/view/id-1166.html
  • 通讯地址:
教育经历

2005.7    毕业于吉林大学获计算机科学与技术专业理学学士学位

2005.9    哈尔滨工业大学攻读博士学位

2011.3    毕业于哈尔滨工业大学获计算机应用专业工学博士学位

工作经历

2011.7    进入哈尔滨工业大学博士后流动站从事博士后研究工作

2011.12-2013.7     访问美国范德堡大学(综合排名全美第17)医学信息系(专业排名全美前五),得州大学休斯敦健康科学中心生物医学信息学院(全美唯一的一个生物医学信息学院)做博士后研究员

科研项目

主持项目

自然科学基金青年:面向临床医疗文本的实体时序化问题研究,2015-2017

博士后科学基金:在线半监督结构化预测技术研究,2011-2013

符号计算与知识工程教育部重点实验室开放基金:面向全科诊疗的中文知识图谱自动构建方法研究,2016-2017

深圳市科技创新基础研究项目:中文浅层知识图谱自动构建方法研究,2016-2018

哈尔滨工业大学科研创新基金临床医学文本中实体时间标引,2016-2017

公司合作项目:智能预诊,2015-2016

腾讯犀牛鸟创新基金:中文浅层知识图谱自动构建方法研究,2016-2017

 

参与项目

国家自然科学基金重点项目:问答式信息检索理论与方法研究,2005-2008

国家863项目(目标导向类):基于NLP的智能搜索引擎,2006-2008

国家863项目:基于手势的拟人化人机交互系统,2007-2009

国家自然科学基金:面向网络知识服务的中文动态语义分析关键技术研究2010-2012

国家自然科学基金:面向真实环境的异构信息交互式问答理论与方法研究,2013-2016

国家自然科学基金:文本情绪计算框架、模型和方法研究,2014-2017

广东省重大科技计划:基于心电信息大数据处理的心脏病猝死的早期预警及其设备研发,2016-2018

发表论文

书籍章节

1. Mei Liu, Yong Hu, Buzhou Tang. Role of Text Mining in Early Identification of Potential Drug Safety Issues (Chapter 13). Biomedical Literature Mining, Methods in Molecular Biology, 1159:227-251. Humana Press. 2014.

期刊论文

1. Buzhou Tang, Yonghui Wu, Min Jiang, Yukun Chen, Joshua C Denny, Hua Xu. A hybrid system for temporal information extraction from clinical text. Journal of the American Medical Informatics Association. 20(5):828–835, 2013. (JCR一区, 中科院一区, CCF B类期刊, IF: 3.932) (领域顶级期刊).

2.  Buzhou Tang, Yudong Feng, Xiaolong Wang, Yonghui Wu, Yaoyun Zhang, Min Jiang, Jingqi Wang, Hua Xu. A Comparision of Conditional Random Fields and Structured Support Vector Machines for Chemical Entity Recognition in Biomedical Literature. Journal of Cheminformatics, 7(Suppl 1):S8. 2015. (JCR一区, 中科院一区IF: 3.949).

3. Jianbo Lei, Buzhou Tangʂ, Xueqin Lu, Kaihua Gao, Min Jiang, Hua Xu. A Comprehensive Study of Named Entity Recognition in Chinese Clinical Text. Journal of the American Medical Informatics Association. 21(5):808-814, 2014. (JCR一区, 中科院一区, CCF B类期刊, IF: 3.504) (领域顶级期刊).

4.  Yaoyun Zhang, Buzhou Tangʂ, Min Jiang, Jingqi Wang, Yonghui Wu, Hua Xu. Domain Adaptation for Semantic Role Labeling of Clinical Text. Journal of the American Medical Informatics Association. 22(5):967-979. 2015. (JCR一区, 中科院一区, CCF B类期刊, IF: 3.428) (领域顶级期刊).

5.  Haodi Li, Buzhou Tang*, Qingcai Chen, Kai Chen, Xiaolong Wang, Zhe Wang, Baohua Wang. HITSZ_CDR:An End-to-end Chemical and Disease Relation Extraction System for BioCreative V. DATABASE. 2016:baw077doi:10.1093/database/baw07. 2016. (JCR一区, 中科院二区, 5-year IF: 3.983)

6.  Yi Chen, Xiaolong Wang, Xin Xiang, Buzhou Tang*, Junzhao Bu. Network Structure Exploration via Bayesian Nonparametric Models. Journal of Statistical Mechanics: Theory and Experiment. doi:dx.doi.org/10.1088/1742-5468/2015/10/P10004. 2015. (JCR一区, 中科院二区, IF: 2.091)

7.  Yi Chen, Xiaolong Wang, Bo Yuan, Buzhou Tang*. Overlapping community detection in networks with positive and negative links. Journal of Statistical Mechanics: Theory and Experiment. doi:dx.doi.org/10.1088/1742-5468/2014/03/P03021. 2014. (JCR一区, 中科院二区, IF: 2.404).

8.  Zengjian Liu, Yangxin Chen, Buzhou Tang*, Xiaolong Wang, Qingcai Chen, Haodi Li, Jingfeng Wang, Qiwen Deng, Suisong Zhu. Automatic De-identification of Electronic Medical Records using Token-level and Character-level Conditional Random Fields. Journal of Biomedical Informatics. dio:10.1016/j.jbi.2015.06.009. 2015. (JCR, 中科院二区, IF: 2.447).

9.  Qingcai Chen, Haodi Li, Buzhou Tang*, Xiaolong Wang, Xin Liu, Zengjian Liu, Shu Liu, Weida Wang, Qiwen Deng, Suisong Zhu, Yangxin Chen, Jingfeng Wang. An automatic risk factor identification system for heart disease in clinical texts over time. Journal of Biomedical Informatics. doi:10.1016/j.jbi.2015.09.002. 2015. (JCR, 中科院二区, IF: 2.447).

10.  Yi Chen, Xiaolong Wang, Junzhao Bu, Buzhou Tang*, Xin Xiang. Network structure exploration in networks with node attributes. Physica A: Statistical Mechanics and its Applications. 499: 240–253. 2016. (JCR, 中科院二区, 5-year IF: 1.738)

11.  Buzhou Tang, Hongxin Cao, Xiaolong Wang, Qingcai Chen, Hua Xu. Evaluating Word Representation Features in Biomedical Named Entity Recognition Tasks. Biomed Research International. vol. 2014, Article ID 240403, 6 pages, 2014. doi:10.1155/2014/240403. (JCR二区, IF: 3.169).

12.  Shengyu Liu, Buzhou Tangʂ, Qingcai Chen, Xialong Wang, Xiaoming Fan. Feature Engineering for Drug Name Recognition in Biomedical Texts: Feature Conjunction and Feature Selection. Computational and Mathematical Methods in Medicine. Volume 2015, Article ID 913489, 9 pages, dio:10.1155/2015/913489. 2015. (JCR, IF: 0.887).

13.  Shengyu Liu, Buzhou Tangʂ, Qingcai Chen, Xiaolong Wang. Drug-drug Interaction Extraction via Convolutional Neural Networks. Computational and Mathematical Methods in Medicine. Volume 2016, Article ID 6918381, 8 pages, 10.1155/2016/6918381. 2016 (JCR, 5-year IF: 0.846)

14.  Martin Krallinger, Obdulia Rabal, Florian Leitner, Miguel Vazquez, David Salgado, Zhiyong Lu, Robert Leaman, Yanan Lu, Donghong Ji, Daniel M Lowe, Roger A Sayle, Riza Theresa Batista-Navarro, Rafal Rak, Torsten Huber, Tim Rocktäschel, Sérgio Matos, David Campos, Buzhou Tang, Hua Xu, Tsendsuren Munkhdalai, Keun Ho Ryu, SV Ramanan, Senthil Nathan, Slavko Žitnik, Marko Bajec, Lutz Weber, Matthias Irmer, Saber A Akhondi, Jan A Kors, Shuo Xu, Xin An, Utpal Kumar Sikdar, Asif Ekbal, Masaharu Yoshioka, Thaer M Dieb, Miji Choi, Karin Verspoor, Madian Khabsa, C Lee Giles, Hongfang Liu, Komandur Elayavilli Ravikumar, Andre Lamurias, Francisco M Couto, Hong-Jie Dai, Richard Tzong-Han Tsai, Caglar Ata, Tolga Can, Anabel Usié, Rui Alves, Isabel Segura-Bedmar, Paloma Martínez, Julen Oyarzabal and Alfonso Valencia1. The CHEMDNER corpus of chemicals and drugs and its annotation principles. Journal of Cheminformatics, 7(Suppl 1):S2. 2015. (JCR一区, 中科院一区, IF: 3.949).

15. Baotian Hu, Buzhou Tang, Qingcai Chen, Longbiao Kang. A Novel Word Embedding Learning Model Using the Dissociation between Nouns and Verbs. Neurocomputing. 2015. doi:10.1016/j.neucom.2015.07.046 (JCR. IF: 2.392)

16.  JiaYuan Chen, Buzhou Tang, YongQing Lin, Ying Ru, MaoXiong Wu, Xiaolong Wang, Qingcai Chen, YangXin Chen, JingFeng Wang. Validation of the Ability of SYNTAX and Clinical SYNTAX Scores to Predict Adverse Cardiovascular Events After Stent Implantation: A Systematic Review and Meta-Analysis. Angiology. 2015. 0003319715618803. (JCR二区,IF: 2.931)

17.  Min Jiang, Yang Huang, Jung-wei Fan, Buzhou Tang, Josh Denny, Hua Xu. Parsing clinical text: how good are the state-of-the-art parsers?. BMC Medical Informatics and Decision Making, 15(Suppl 1):S2. 2015. (JCR, IF: 2.042).

18. Jingting Mai, Fei Wang, Qiong Qiu, Buzhou Tang, YongQing Lin, NianSang Luo, WoLiang Yuan, XiaoLong Wang, Qingcai Chen, JingFeng Wang, YangXin Chen. Tachycardia pacing induces myocardial neovascularization and mobilizes circulating endothelial progenitor cells partly via SDF-1 pathway in canines. Heart and vessels. dio:10.1007/s00380-014-0613-5. 2014. (JCR三区, IF: 2.065).

19.  Shengyu Liu, Buzhou Tangʂ, Qingcai Chen, Xiaolong Wang. Effects of Semantic Features on Machine Learning-Based Drug Name Recognition Systems: Word Embeddings vs. Manually Constructed Dictionaries. Information. 6(4): 848-865. 2015.

20. Shengyu Liu, Buzhou Tangʂ, Qingcai Chen, Xiaolong Wang. Drug Name Recognition: Approaches and Resources. Information. 6(4): 790-810. 2015.

21.  Bin Liu, Xiaolong Wang, Ruifeng Xu, Buzhou Tang. Protein Remote Homology Detection by Combining Profile-based Protein Representation with Local Alignment Kernel. Journal of Medical and Bioengineering. Vol 3(1):17-22, 2014.

会议论文

1. Buzhou Tang, Qingcai Chen, Xiaolong Wang, Yonghui Wu, Yaoyun Zhang, Min Jiang, Jingqi Wang, Hua Xu. Recognizing Disjoint Clinical Concepts in Clinical Text Using Machine Learning-based Methods. Proceedings of American Medical Informatics Association Annual Symposium, pages 1184-1193 . 2015. (CCF C) (领域顶级会议)

2. Yi Chen, Xiaolong Wang, Buzhou Tang*, Junzhao Bu, Qingcai Chen, Xin Xiang. Structural regularity exploration in multidimensional networks. Proceedings of the International Conference on Neural Information Processing. pages 532-540. 2015. (CCF C)

3. Yi Chen, Xiaolong Wang, Buzhou Tang*, Junzhao Bu, Xin Xiang. User recommendation based on network structure in social networks. Proceedings of the International Conference on Neural Information Processing. pages 488-496. 2015. (CCF C)

4. Junzhao Bu, Buzhou Tang*, Xiaolong Wang, Qingcai Chen, Zengjian Liu, Haodi Li. HTSZ_CEM System for Chemical Entity Mention Recognition in Patents. Proceedings of the Fifth BioCreative Challenge Evaluation Workshop. pages 116-118. 2015.

5. Haodi Li, Qingcai Chen, Kai Chen, Buzhou Tang*. HITSZ_CDR System for Disease and Chemical Named Entity Recognition and Relation Extraction. Proceedings of the Fifth BioCreative Challenge Evaluation Workshop. pages 196-201. 2015.

6. Yi Chen, Xiaolong Wang, Buzhou Tang*, Ruifeng Xu, Bo Yuan, Xin Xiang, Junzhao Bu. Identifying Opinion Leaders from Online Comments. Social Media Processing Communications in Computer and Information Science. 489:231-239. 2014. (20150600492132)

7. Xiaoqiang Zhou, Baotian Hu, Qingcai Chen, Buzhou Tang, Xiaolong Wang. Answer Sequence Learning with Neural Networks for Answer Selection in Community Question Answering. Proceedings of Association for Computitional Linguistics. pages 713-718. 2015. (CCF A). (领域顶级会议)

8. Shixi Fan, Lidan Chen, Xuan Wang, Buzhou Tang. Shallow Parsing with Hidden Markov Support Vector Machines. Proceedings of the International Conference on Machine Learning and Cybernetics (ICMLC). v2, 827-830. 2015. (20150400454112)

9. Min Jiang, Yang Huang, Jung-wei Fan, Buzhou Tang, Josh Denny, Hua Xu. Parsing clinical text: how good are the state-of-the-art parsers?. Proceedings of the ACM Seventh International Workshop on Data and Text Mining in Biomedical Informatics. pages 1-5. 2014.

10.Yaoyun Zhang, Jingqi Wang, Buzhou Tang, Yonghui Wu, Min Jiang, Yukun Chen, Hua Xu. UTH_CCB: A Report for SemEval 2014–Task 7 Analysis of Clinical Text. Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014). pages 802-806. 2014.

获奖

1. 王晓龙, 王轩, 刘秉权, 陈清财, 林磊, 刘远超, 单丽莉, 孙承杰, 汤步洲, 王平, 刘铭. 网络环境拼音语句输入技术. 黑龙江省科学技术发明一等奖. 2011.

2. 王轩, 王晓龙, 黄荷姣, 陈清才, 丁宇新, 姚霖, 汤步洲, 张耀允, 许欣欣, 吴堃, 赫兰光. 中国手势语言表达系统. 深圳市科技创新奖. 2008.

3. 组织参加相关领域国际公开评测成绩:

(1) 2010年参加CoNLL关于自然语言中不确定性分析的国际评测,在不确定词抽取任务上获得第一名。

(2) 2012年参加i2b2关于临床医学文本中时间信息抽取的国际评测,在时间关系抽取任务上获得两个第一名,在医学事件抽取任务上获得第二名,在时间表达式抽取任务上获得第四名。

(3) 2013年参加ShARe/CLEF eHealth关于临床医学文本信息抽取、语义编码以及信息检索国际评测,在信息抽取任务上获得第一名,在语义编码任务上获得第三名,在概念缩写语义编码任务上获得第一名。

(4) 2013年参加BioCreative IV Track 2关于化学复合物及药物实体识别国际评测,在实体识别任务上获得无化学背景组第一名。

(5) 2014年参加SemEval-2014 Task 7关于临床医疗文本分析的国际评测,在信息抽取和语义编码任务上均获得第一名。

(6) 2014年参加i2b2关于“去隐私化”和“随时间变迁的心脏病风险因子识别”的国际评测,在两个任务上均获得了第二名(国内第一)。

(7) 2015年参加BioCreative V Track 3关于“药物和疾病关系抽取”的国际评测,在药物实体识别任务上获得第一名。

著作、软件著作权与专利

申请/获得专利

1. 王晓龙, 刘秉权, 汤步洲, 林磊, 刘远超, 王轩, 陈清财. 语句级中英文混合输入方法. 专利号(201010566505.6), 授权日期:20120627.

2. 王晓龙, 刘秉权, 汤步洲, 单丽莉, 孙承杰, 刘铭, 陈清财, 王轩. 词汇自适应中文输入方法. 专利号(201010551084.X), 授权日期:20120704.


Return Top
© 2014 哈尔滨工业大学深圳研究生院·智能计算研究中心 All rights reserved.