CMedTEX is a rule-based temporal expression extraction and normalization system for Chinese electronic medical records, which can identify four types of temporal expressions (Date, Time, Duration and Frequency) and normalize the value to standard format (as TIMEX3). Evaluation on the independent manually annotated Chinese clinical set (include 1,778 records) shows that our system achieves a precision of 90.83%, a recall of 96.11% and an F-score of 93.40% on TE extraction, and an accuracy of 92.58% on TE normalization under exact-match criterion, which are promising.
 Zengjian Liu, Buzhou Tang, Xiaolong Wang, Qingcai Chen. CMedTEX: A Rule-based Temporal Expression Extraction and Normalization System for Chinese Clinical Notes[C]. AMIA 2016 Annual Symposium, 2016.
Please fill the [Application Form] and send to Zengjian liu (email@example.com) or Buzhou Tang (firstname.lastname@example.org) to get the source code of CMedTEX.
1) Environmental Requirement:
Operating System (Linux or Windows)
2) Show the help infomation.
#python runCMedTEX.py -h
3) Ensure the "resources" directory and "src" are existed.
4) Run CMedTEX must assign a input directory.
#python runCMedTEX.py -i [inputdir]
-- The default output directory is: "CMedTEX/output/",
-- or use '-o' parameter to assign a new output directory.
Author: Zengjian Liu, Buzhou Tang
Address: Key Laboratory of Network Oriented Intelligent Computation, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, China, 518055.