Electronic laboratory-based reporting, developed by the University of Pittsburgh Medical Center (UPMC) Health System, was evaluated to determine if it could be integrated into the conventional paper-based reporting system. We reviewed reports of 10 infectious diseases from 8 UPMC hospitals that reported to the Allegheny County Health Department in southwestern Pennsylvania during January 1–November 26, 2000. Electronic reports were received a median of 4 days earlier than conventional reports. The completeness of reporting was 74% (95% confidence interval [CI] 66% to 81%) for the electronic laboratory-based reporting and 65% (95% CI 57% to 73%) for the conventional paper-based reporting system (p>0.05). Most reports (88%) missed by electronic laboratory-based reporting were caused by using free text. Automatic reporting was more rapid and as complete as conventional reporting. Using standardized coding and minimizing free text usage will increase the completeness of electronic laboratory-based reporting.
Public health surveillance of infectious diseases is crucial for detecting and responding to illnesses that may represent potential outbreaks or bioterrorism events (
Allegheny County (population 1,348,000) is located in southwestern Pennsylvania and includes the city of Pittsburgh. Incidences of notifiable diseases in the county are required by law to be reported directly to the Allegheny County Health Department (ACHD). Each notifiable event is recorded on a case report form that is mailed or faxed to the ACHD by laboratory personnel, physicians, nurses, or infection-control staff; this procedure constitutes the conventional paper-based reporting system (
Schematic of information flow for the electronic reporting system of the University of Pittsburgh Medical Center Health System and for the paper-based reporting system to Allegheny County Health Department. NETSS, National Electronic Telecommunications System of Surveillance. ELR is electronic laboratory-based reporting, and CRS is conventional paper-based reporting system.
The UPMC Health System is a large university-based health-care network consisting of approximately 20 hospitals and hospital affiliates (
Eight UPMC hospital microbiology laboratories in Allegheny County are capable of electronic laboratory-based reporting by using Health Level 7 (HL7), an electronic messaging standard for data exchange and communication between health-care information systems (
The processing occurred in real time, i.e., messages were checked as they were received. Instead of a batch mode in which data are extracted from sets of reports at predetermined times, extraction of information occurred whenever data were received by the electronic system. The electronic system extracted the specific laboratory specimen, procedure, and result from the HL7 records. The data were then interpreted; a laboratory result was positive or negative based on the code in the result section, which was compared with a data dictionary developed by the UPMC Health System. Some duplicate records were recognized by the electronic system. The extracted three- or four-letter coded organism name, defined by the UPMC laboratory system data dictionary, was then converted to its full name through a translation table maintained in the Oracle (Oracle Corporation, Redwood Shores, CA) data storage; computer personnel could use the database to add or remove an organism they wanted to be monitored. To obtain complete patient demographic information, if not provided in the laboratory HL7 message, the electronic laboratory-based reporting system queried the Medical Archival Retrieval System at UPMC based on the patient’s medical record number in the message
We conducted a comparison evaluation of the UPMC electronic laboratory-based reporting and the ACHD conventional paper-based reporting systems. From the eight UPMC hospital or affiliated microbiology laboratories with HL7 links in Allegheny County, we compared all disease reports in the UPMC electronic and the ACHD paper-based systems (derived from the National Electronic Telecommunications System of Surveillance) databases with dates of positive culture from January 1 to November 26, 2000, for 10 infectious organisms:
Completeness of reporting was defined as the total number of unique, notifiable events identified independently through each surveillance system (UPMC electronic laboratory-based and ACHD conventional paper-based systems) divided by the estimated total number of reports available for reporting at the laboratory level (N) (
Capture-recapture methodology (11). C=number of reports received through both electronic laboratory-based reporting and conventional paper-based reporting. n1=number of reports received through conventional paper-based reporting system only. n2=number of reports received through electronic laboratory-based reporting only. X= estimated number of reports missed by both electronic laboratory-based reporting and conventional paper-based reporting system. R=number of reports received through conventional paper-based reporting system. S=number of reports received through electronic laboratory-based reporting. N=estimated total number of reports available by the Chandra Sekar-Deming capture-recapture calculation.
An electronic false-positive result was defined as a case that was incorrectly detected by the electronic system; a missed report (or false negative) was defined as a notifiable case that was not detected through the electronic system. The completeness of reporting of both systems was estimated after excluding false positives and duplicate reports.
The chronologic sequence of events for the reporting of an infectious disease or condition consists of exposure to an infectious agent, followed by symptom onset after an incubation period, and then the seeking of medical attention (
Timeline for reporting notifiable infectious diseases by the University of Pittsburgh Medical Center Health System, Allegheny County, Pennsylvania.
To determine the timeliness of the two surveillance systems, three time points were defined. T1 was the date/time when the laboratory result was obtained and entered into the UPMC laboratory computer. T2 was the date/time when the laboratory result was reported to ACHD by the conventional paper-based system. T3 was the date/time the automatic electronic laboratory-based system notification was generated at UPMC. The timeliness of electronic and paper-based systems was defined as t3 – t1 and t2 – t1, respectively. The difference between t3 and t2 represented how much sooner or later the electronic system identified notifiable diseases than the paper-based system. Median differences were expressed with an interquartile range. The timeliness calculations were performed with SAS version 8.1.
Before matching individual records in both systems and removing duplicate records, we calculated the completion rates for the data fields common to both the UPMC electronic and the ACHD paper-based databases by using Epi-Info 2000 (Centers for Disease Control and Prevention, Atlanta, GA).
We identified the specific reasons for the electronic system’s false positives and missed reports by using a traceback error analysis. Reports that were found in the UPMC electronic database but not in the ACHD paper-based database were identified, and case-patient information was reviewed from the laboratory computer reports and their HL7 messages (electronic false positives). Reports that were found in the ACHD paper-based database and not in the UPMC electronic database were identified after reviewing case-patient information, laboratory case reports, and archived computer files (electronic laboratory-based reporting missed reports). To further assess database accuracy, we also reviewed the paper reports and logs at the Allegheny County Health Department and compared these with data in the ACHD conventional paper-based reporting system database.
A total of 141 unique reports were identified; 116 (82%) were reported through the UPMC electronic laboratory-based system, and 94 (67%) were reported by ACHD conventional paper-based reporting system. Forty-seven (33%) of the notifications were received through the UPMC electronic system only, 25 (18%) through the ACHD paper-based system only, and 69 (49%) through both (
Venn Diagram depicting the number of notifiable disease reports received independently by the electronic laboratory-based reporting of University of Pittsburgh Medical Center electronic system, Allegheny County Health Department paper-based reporting, or both. The estimated true total number of reports available, calculated by the Chandra Sekar-Deming capture-recapture method, is shown in the large, encompassing circle. ELR is electronic laboratory-based reporting, and CRS is conventional paper-based reporting system.
After excluding electronic laboratory-based reporting false positives, the overall completeness of reporting was 74% (95% CI 66% to 81%) for the UPMC electronic system and 65% (95% CI 57 to 73%) for the ACHD paper-based system (p>0.05), showing no significant difference in completeness of reporting between the electronic and paper-based systems (
| Conventional reporting (ACHD) | Electronic reporting (UPMC) | ||||
|---|---|---|---|---|---|
| Total no. of available reportsb | No. of reports received | Completeness of coverage (95% CI) | No. of reports received | Completeness of coverage (95% CI) | |
| Notifiable infectious disease | |||||
| 37 | 25 | 0.68 (0.49 to 0.85) | 18 | 0.49 (0.32 to 0.65) | |
| 35 | 32 | 0.91 (0.83 to 0.97) | 34 | 0.95 (0.91 to 0.97) | |
| 17 | 10 | 0.59 (0.33 to 0.86) | 7 | 0.41 (0.19 to 0.67) | |
| 22 | 13 | 0.59 (0.39 to 0.77) | 17 | 0.77 (0.58 to 0.90) | |
| 9 | 5 | 0.58 (0.30 to 0.88) | 7 | 0.72 (0.46 to 0.88) | |
| UPMC Hospital laboratory | |||||
| A | 26 | 16 | 0.62 (0.46 to 0.80) | 24 | 0.92 (0.81 to 0.96) |
| B | 52 | 29 | 0.55 (0.42 to 0.65) | 47 | 0.91 (0.79 to 0.96) |
| C | 35 | 24 | 0.69 (0.43 to 0.90) | 9 | 0.26 (0.12 to 0.40) |
| D | 13 | 11 | 0.85 (0.64 to 0.92) | 12 | 0.87 (0.71 to 0.92) |
| E | 10 | 9 | 0.90 (0.70 to 0.90) | 9 | 0.90 (0.70 to 0.90) |
aUPMC, University of Pittsburgh Medical Center Health System; ACHD, Allegheny County Health Department; CI, confidence interval.
bEstimated total number of reports available by using capture-recapture (N in
Timeliness was calculated by using the 69 records common to both databases. The timeliness of paper-based reporting was a median of 5 days (interquartile range 4 days). The timeliness of the electronic reporting was a median of 1 day (interquartile range 0 days). Electronic alerts were reported a median of 4 days (interquartile range 4 days) sooner than through paper-based reporting. We discovered a trend in the UPMC electronic reporting: the time difference between the date/time the laboratory result was obtained and entered, and the date/time the HL7 message was sent was almost exactly 24 hours to the second. After extensive discussions with the UPMC laboratory administration and informatics personnel, the reasons for this finding are unknown.
Eleven data fields were common to both the UPMC electronic and the ACHD paper-based databases (
| Data field | No. (%) of conventional reported cases with field completed (n=534) | No. (%) of electronic reported cases with field completed (n=582) |
|---|---|---|
| Patient information | ||
| Patient ID | 534 (100) | 582 (100) |
| Name | 534 (100) | 582 (100) |
| Sex | 534 (100) | 582 (100) |
| Date of birth | 462 (86.5) | 582 (100) |
| Age | 518 (97.0) | 582 (100) |
| Address | 533 (99.8) | 306 (52.6) |
| Zip code | 533 (99.8) | 306 (52.6) |
| Specimen information | ||
| Organism name | 534 (100) | 582 (100) |
| Time result obtained | 534 (100) | 582 (100) |
| Time result reported | 534 (100) | 582 (100) |
| Other information | ||
| Status of report | 534 (100) | 220 (37.8) |
aAll rates before matching and duplicate record removal; UPMC, University of Pittsburgh Medical Center Health System.
Electronic laboratory-based reporting generated 10 reports that were found to be false upon investigation (
| Errors | No. (%) of electronic or paper-based only reports | Nature of problem |
|---|---|---|
| Electronic false positives | ||
| Incorrect use of free text with organism codes | 6 (60) | Culture report reads “No [free text]” followed by organism ID code |
| Inability to retrieve sent false reports | 3 (30) | Unable to retrieve preliminary reports |
| Failure of logic detection | 1 (10) | Data extracted from wrong portion of result field by logic detection |
| Total | 10 | |
| Electronic false negatives (missed reports) | ||
| Incorrect use of free text | 22 (88) | Organism name typed out as free text in result field |
| Unknown (failure of transmission?) | 3 (12) | Found to be in UPMC hospital computer terminal system by using organism ID code properly but not found in UPMC electronic database |
| Total | 25 |
aUPMC, University of Pittsburgh Medical Center Health System.
Data-entry errors, such as the incorrect use of free text, led to missed reports in the electronic system. Typically, these errors occurred when laboratory technicians entered the name of the organism as free text rather than with the preprogrammed UPMC disease codes. These errors accounted for 22 (88%) of 25 electronic missed reports, whereas the remaining three missed reports were found in the hospital computer systems but were not detected by the UPMC electronic system for reasons that remained unclear after investigation. Of the 47 cases in the UPMC electronic system not reported by the paper-based system to ACHD, 37 should have been reported to ACHD (“ACHD false negative”).
This evaluation is the first of a large health-care system by using automatic and electronic notifiable disease reporting. The electronic laboratory-based reporting was as complete as conventional paper-based reporting. The estimated completeness (74%) is similar to the recent report of 80% completeness of the electronic laboratory-based reporting from commercial clinical laboratories to the Hawaii Department of Health
The magnitude of the difference in completeness between electronic laboratory-based reporting and conventional paper-based reporting may have been greater if it had been possible to review reports coming exclusively from laboratories to the ACHD; paper-based systems receive reports from sources other than laboratories. Data specifying if a case record originated from a laboratory or health-care provider were not available in the paper-based database. Hence, a bias favoring completeness of reporting by the paper-based system existed in our analysis. However, most reports received by health departments originate from clinical laboratories
Maximizing electronic laboratory-based reporting sensitivity is important for detecting diseases, while maximizing specificity enhances the likelihood that cases are reported correctly. Theoretically, electronic reporting has the potential to be both sensitive and specific, with few false negatives and false positives. The specificity of electronic reporting could be particularly high for diseases diagnosed by laboratory tests with a low rate of false positives (e.g., culture for enteric organisms); the diseases caused by the organisms used in this study met this qualification. Notifiable diseases based on other types of tests (e.g., serology for syphilis) would require clinical criteria to enhance specificity (information not available by electronic reporting). In this evaluation, we found that the inability to retract preliminary positive laboratory reports that were subsequently confirmed to be negative reduced the specificity of electronic reporting. However, in some instances, the benefit of early detection might supersede an occasional false-positive report. For example, early detection is paramount for some organisms, such as
The UPMC electronic reporting has the potential to serve as a prototype for use nationally because it uses hospital-based laboratory information systems already in place to capture cases of disease that may be representative of the population at large. However, several findings from our analysis have implications for large health systems attempting to establish electronic laboratory-based reporting. The use of standardized disease codes should be encouraged because it maximizes both the sensitivity and specificity of electronic laboratory-based reporting. At UPMC, the incorrect use of free text at a single hospital substantially reduced the overall completeness of electronic laboratory-based reporting reporting. However, eliminating the use of free text may not be desirable from a laboratory personnel standpoint. As such, training laboratory personnel in the correct use of free text is important
A mechanism for retracting of preliminary reports not subsequently confirmed is essential to reduce false-positive reports. Retraction is the ability to both remove an incorrect or preliminary report from the database as well as to notify the recipients of the information of the change. If the sending system does not explicitly label the message as a correction or a retraction, then the electronic laboratory-based reporting system must have logic to detect it. The detection logic simply compares the previously reported preliminary reports in a cached table with a new one. If the logic finds a match but the new report does not have any notifiable organisms, the logic will send a retraction alert to officials at the local health department or hospital laboratory administrators and remove the false-positive report from the cached table. At the time of this evaluation, such retraction capability did not exist at UPMC because the UPMC laboratory sending system did not explicitly label the message as a correction or a retraction. In the future, one option may be to label preliminary reports as “preliminary” or “suspect.” Currently, two authors have been working on the retraction capability and expect to have such functionality available soon. However, the risk versus benefit of reporting preliminary laboratory results should be weighed in making the decision to retract such reports. The best approach might be to report preliminary results for diseases that require immediate notification, while reporting confirmed results for others.
Decisions to remove certain duplicate records that were not detected by electronic laboratory-based reporting should be made before integration of automatic reporting to ACHD. Caution should be used in the removal of some type of duplicates, as this decision may need to be disease specific. For example, repeated positive sputum cultures for tuberculosis from a patient who has received the recommended course of therapy may represent persistent, active infection and drug resistance, both of which are substantial public health concerns.
UPMC electronic laboratory-based reporting had lower completion rates of data fields with important contact information (specifically, address and zip code fields) compared with ACHD conventional paper-based reporting system, which may hamper efforts by public health personnel to contact patients quickly. The implementation of automatic demographic data extraction by electronic laboratory-based reporting from other resources such as epidemiologic or administrative databases (e.g., billing records), could substantially improve the data field completion rate for electronic laboratory-based reporting.
UPMC should continue to refine its electronic laboratory-based reporting before implementing direct automatic reporting to ACHD. Electronic laboratory-based reporting should not replace conventional reporting as observations made by astute clinicians are valuable in the timely reporting of certain notifiable syndromic illnesses (
The findings from this evaluation indicate that direct automatic reporting from a health system is feasible and as complete but more rapid than conventional reporting. An error analysis showed many correctable problems; better control of the use of free text and an ability to retract preliminary reports were key areas for improvement. Standard coding schemes should be used. Health departments need to evaluate electronic surveillance systems before integrating the data into existing reporting systems. CDC and state health departments should collaborate to develop a consensus on the goals for an electronic laboratory-based reporting system intended for public health laboratory-based disease reporting. Once these goals have been determined, guidelines may be created that would assess if the system achieves these desired goals. This evaluation may be used by health departments in evaluating other electronic surveillance systems.
Suggested citation: Panackal AA, M’ikanatha NM, Tsui F-C, McMahon J, Wagner MM, Dixon BW,et al. Automatic Electronic Laboratory-Based Reporting of Notifiable Infectious Diseases. Emerg Infect Dis. [serial on the Internet]. 2002 Jul [date cited]. Available from
We thank the staff of the Allegheny County Health Department, including Nancy Felton and Mary Jane Walicki, for their interview time and data collation efforts.
This evaluation was partially funded by a Centers for Disease Control and Prevention Cooperative Agreement with the BioMedical Security Institute, a collaboration of the University of Pittsburgh and Carnegie Mellon University for research on bioterrorism issues.
Dr. Panackal is an officer with the Epidemic Intelligence Service at the National Center for Infectious Diseases, Centers for Disease Control and Prevention. His recent field epidemiology research experiences have included outbreak investigations of Rift Valley fever in Saudi Arabia and Yemen and invasive aspergillosis in renal transplant recipients in California.