A data quality framework for process mining of electronic health record data

dc.contributor.authorFox, Frank
dc.contributor.authorAggarwal, Vishal R.
dc.contributor.authorWhelton, Helen
dc.contributor.authorJohnson, Owen
dc.date.accessioned2018-09-03T11:47:08Z
dc.date.available2018-09-03T11:47:08Z
dc.date.issued2018-07
dc.date.updated2018-08-16T09:01:26Z
dc.description.abstractReliable research demands data of known quality. This can be very challenging for electronic health record (EHR) based research where data quality issues can be complex and often unknown. Emerging technologies such as process mining can reveal insights into how to improve care pathways but only if technological advances are matched by strategies and methods to improve data quality. The aim of this work was to develop a care pathway data quality framework (CP-DQF) to identify, manage and mitigate EHR data quality in the context of process mining, using dental EHRs as an example. Objectives: To: 1) Design a framework implementable within our e-health record research environments; 2) Scale it to further dimensions and sources; 3) Run code to mark the data; 4) Mitigate issues and provide an audit trail. Methods: We reviewed the existing literature covering data quality frameworks for process mining and for data mining of EHRs and constructed a unified data quality framework that met the requirements of both. We applied the framework to a practical case study mining primary care dental pathways from an EHR covering 41 dental clinics and 231,760 patients in the Republic of Ireland. Results: Applying the framework helped identify many potential data quality issues and mark-up every data point affected. This enabled systematic assessment of the data quality issues relevant to mining care pathways. Conclusion: The complexity of data quality in an EHR-data research environment was addressed through a re-usable and comprehensible framework that met the needs of our case study. This structured approach saved time and brought rigor to the management and mitigation of data quality issues. The resulting metadata is being used within cohort selection, experiment and process mining software so that our research with this data is based on data of known quality. Our framework is a useful starting point for process mining researchers to address EHR data quality concerns.en
dc.description.statusNot peer revieweden
dc.description.urihttp://hpr.weill.cornell.edu/divisions/health_informatics/ieee_ichi.htmlen
dc.description.versionAccepted Versionen
dc.format.mimetypeapplication/pdfen
dc.identifier.citationFox, F., Aggarwal, V. R., Whelton, H. and Johnson, O. (2018) 'A data quality framework for process mining of electronic health record data', 2018 IEEE International Conference on Healthcare Informatics (ICHI), New York, USA, 4-7 June. doi:10.1109/ICHI.2018.00009en
dc.identifier.doi10.1109/ICHI.2018.00009
dc.identifier.endpage21en
dc.identifier.issn2575-2634
dc.identifier.startpage12en
dc.identifier.urihttps://hdl.handle.net/10468/6700
dc.language.isoenen
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)en
dc.relation.ispartof2018 IEEE International Conference on Healthcare Informatics (ICHI)
dc.rights© 2018, IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.en
dc.subjectData visualizationen
dc.subjectEHRen
dc.subjectResearch dataen
dc.subjectProcess miningen
dc.subjectData qualityen
dc.subjectData miningen
dc.subjectData integrityen
dc.subjectDentistryen
dc.subjectRegistersen
dc.subjectSystematicsen
dc.titleA data quality framework for process mining of electronic health record dataen
dc.typeConference itemen
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
A_data_quality_framework_for_process_mining.pdf
Size:
321.4 KB
Format:
Adobe Portable Document Format
Description:
Accepted Version
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.71 KB
Format:
Item-specific license agreed upon to submission
Description: