当前位置: 首页 > 期刊 > 《英国医生杂志》 > 2004年第25期 > 正文
编号:11356612
Cross sectional survey of multicentre clinical databases in the United Kingdom
http://www.100md.com 《英国医生杂志》
     1 Health Services Research Unit, Department of Public Health and Policy, London School of Hygiene and Tropical Medicine, London WC1E 7HT

    Correspondence to: N Black nick.black@lshtm.ac.uk

    Abstract

    Clinical areas covered by existing databases

    The clinical subject most commonly covered by the 105 databases was cancer (see box). While the 11 general databases, mostly regional and national cancer registries, were well known, there were 14 less well known specialised ones. Like many databases in other clinical areas, these were mostly developed by enthusiastic individuals rather than by national organisations. The next most common subjects covered were surgery (15 databases), congenital anomalies (14), and trauma and intensive care (9). Among the 23 databases for medical conditions, diabetes and cardiovascular conditions predominated. In contrast, only three databases related to maternity care, and only four related to mental health.

    Organisation and management of databases

    Most of the databases (66%) covered one or more UK nations (England, Scotland, Wales, Northern Ireland) (table 1). The rest were restricted to a region of one of these countries. Most (81%) had collected data continuously since being established, while most of the rest had been set up for a one off period.

    Table 1 Organisation and management of 105 clinical databases with full entry in DoCDat, August 2003

    Although half the databases used nationally approved codes to identify patients (that is, the NHS number), less than a third used approved institutional codes, and only 16% used clinician codes. A third had obtained explicit approval from the relevant clinical or professional body, such as a royal college.

    The group or team managing a database varied in size and in composition. While almost all included doctors (95%), only 42% included nurses, and 26% included allied health professionals (such as physiotherapists). Most recognised the need for technical and methodological input from epidemiologists (69%), statisticians (79%), and information technology specialists (63%). In contrast, only a minority saw a need for representation from managers (32%) and laypeople (24%).

    Funding came from a variety of sources. Most received some funds from the public sector (mainly the Department of Health or the NHS), though sometimes this was only pump priming to get the database established. Three other sources—private sector, subscriptions from participating healthcare providers, and charities—each provided finance for 10-20%. A few databases (5%) reported no funding. The distribution of funding partly reflects the absence from DoCDat of databases owned by private companies, despite our attempts to include them.

    Number and examples of clinical databases by clinical area

    General (9 databases)

    For example:

    Hospital Episode Statistics

    General Practice Research Database

    MRC National Survey of Health and Development

    Cancer (general) (11 databases)

    For example:

    National Registry of Childhood Tumours

    Northern Ireland Cancer Registry

    North West Cancer Registry

    Cancer (specialised) (14 databases)

    For example:

    Scotland and Newcastle Lymphoma Group

    Assessment of Stomach and Oesophageal Cancer

    British Association of Surgical Oncology—Breast Unit Database

    Surgical procedures (15 databases)

    For example:

    UK Hydrocephalus Shunt Registry

    National Adult Cardiac Surgical Database

    North West Arthroplasty Register

    Infectious diseases (3 databases)

    National Prospective Monitoring Scheme on HIV

    Nosocomial Infection National Surveillance Scheme

    UK Register of HIV Seroconverters

    Congenital anomalies (14 databases)

    For example:

    Oxford Register of Early Childhood Impairments

    Trent Congenital Anomalies Register

    Glasgow Register of Congenital Anomalies

    Maternity (3 databases)

    For example:

    St Mary's Maternity Information System

    Confidential Enquiry into Maternal and Child Health

    Trauma and intensive care (9 databases)

    For example:

    Intensive Care National Audit and Research Centre—Case Mix

    Programme Database

    Trauma Audit and Research Network

    All Wales Injury Surveillance System

    Diabetes (6 databases)

    For example:

    UK Diabetes Information and Benchmarking System

    Quality Indicators in Diabetes Service

    National Paediatric Diabetes Audit

    Cardiovascular disease (7 databases)

    For example:

    National Sentinel Audit of Stroke

    Myocardial Infarction National Audit Programme

    National Pacemaker Database

    Respiratory disease (3 databases)

    UK Cystic Fibrosis Database

    Chronic Obstructive Pulmonary Disease 2001 Audit

    Scottish Asthma Management Initiative

    Other medical conditions (7 databases)

    For example:

    Northern Region Haematology Register

    Scottish Motor Neurone Disease Register

    UK National Renal Registry

    Mental health (4 databases)

    For example:

    Functional Analysis of Care Environments

    Carers and Users Expectations of Mental Health Services

    National Drug Treatment Monitoring System

    Five of the databases were established over 50 years ago (table 2), four being cancer registries and one a longitudinal birth cohort. However, most databases were much more recent, with over half starting since 1990. While there is evidence that the establishment of new databases has accelerated in recent years, our figures refer only to those still functioning in 2003. It is likely that some databases established in earlier decades have since stopped, giving an underestimate of the incidence of new ones in that period. Information for the most recent years is also likely to be an underestimate because of delays in identifying new databases.

    Table 2 Age, size, and rate of growth of 105 clinical databases with full entry in DoCDat, August 2003

    The databases varied considerably in size, from data on a few hundred patients or episodes of care to over 130 million (table 2). Not surprisingly, smaller databases tended to have been established more recently: the five smallest had all started in the previous five years. The three largest databases recorded hospital admissions for an entire country (Hospital Episode Statistics, Scottish Morbidity Record, and Patient Episode Database for Wales). The growth rate of databases with continuous recruitment also varied considerably. Five databases acquired fewer than 100 patients or episodes of care a year. These tended to cover rare conditions (such as acromegaly or motor neurone disease). In contrast, the national hospital inpatient databases accumulated more than half a million new episodes a year.

    Security and confidentiality of data

    Most databases (71%) stored their data on a computer connected to an external network, albeit with robust firewalls to prevent intruders (table 3). Back up versions of the database were usually on disks or CD Roms (65%), although 30% of databases backed up to another computer with external connections. Almost 60% also retained data on paper forms.

    Table 3 Security and confidentiality of 105 clinical databases with full entry in DoCDat, August 2003

    Ideally, data should be reversibly anonymised (by using key codes), so as to minimise the risk of disclosing individual identities12 but to maximise possible use of the database.13 This was true for only 33%. Of the remainder, 12% were irreversibly anonymised and 55% contained patient identifiers. For most databases (72%), the patients had not been informed that personal data were being collected, and for 88%, signed consent was not obtained.

    Uses of databases

    Irrespective of the purposes for which the databases were established, most could be used for all four of the principal applications cited in the introduction. Most (75%) allowed ad hoc analyses to be conducted centrally, where the data were aggregated and stored, but only 58% allowed such analyses to be conducted locally by the providers of the data (table 4). Of the 95 databases that identified the healthcare providers, 40% produced audit reports on individual providers, but only 29% provided multicentre comparative reports.

    Table 4 Uses of 105 clinical databases with full entry in DoCDat, August 2003

    The use of databases for research was also patchy. About a third were unable to provide a bibliography of peer reviewed journal articles based on use of their database. Our search of PubMed revealed that about a quarter of the databases had not been used for any articles and a further quarter had been used in fewer than five articles each. In contrast, eight databases had each been the basis of over 100 articles. These data probably underestimate the research output as PubMed does not facilitate searches by database name and some authors may fail to mention the database they used. In addition, some articles may have appeared in journals not indexed by PubMed, and some recently established databases would not be expected to have generated any research output yet. However, we may also have overestimated the output as some articles, while authored by database custodians and associates, may not have made use of the database in question.

    Quality of the data

    We measured the quality of the data in the databases against five criteria (table 5). Over half (57%) of the databases recruited at least 90% of eligible people or episodes. This underestimates the true prevalence of good databases, because up to a third of database custodians did not know their recruitment proportion. Similarly, over 40% did not know the level of completeness of their data. Of those that did, about 70% reported high levels of completeness.

    Table 5 Quality of data in 105 clinical databases with full entry in DoCDat, August 2003

    About half the databases did not use explicit definitions for most of the variables collected. Of the 94 databases that included an outcome variable, 64% either used an independent, blinded observer (that is, one who was unaware of the intervention undergone) or this was unnecessary as the outcome was objective (usually survival). The data in almost all databases (92%) were subjected to range and consistency checks. In 27% some form of external validation (such as comparison with medical records) was also conducted.

    Associations between characteristics of databases and data quality

    Only one of the eight organisational characteristics we examined was significantly associated with database quality (table 6): databases that informed patients they were being included were more likely to have complete data. Given that 40 associations were tested, one statistically significant one (at P = 0.016) may have occurred by chance. Seven other associations were significant at the 10% level (P = 0.1): national databases were associated with better validation but poorer assessment of outcome than regional databases; approval from a professional body was associated with better validation; routine linkage was associated with better recruitment and validation; and having an epidemiologist or statistician in the management team was associated with better recruitment and validation.

    Table 6 Association between eight organisational characteristics of 105 clinical databases with full entry in DoCDat, August 2003, and five quality criteria. (Values are relative risk of characteristic being associated with good quality*)

    Discussion

    We found multicentre clinical databases in all areas of UK health care, though their distribution was uneven—cancer and surgery were better covered than mental health and obstetrics. They varied greatly in age, size, growth rates, and the geographical areas they covered. Their scope (and thus their potential uses) and the quality of the data collected also varied. The latter was not associated with any organisational characteristics. Despite impressive achievements, many faced considerable financial uncertainty.

    Considerable scope exists for improvements: greater use of nationally approved codes; more support from relevant professional organisations; greater involvement by nurses, allied health professionals, managers, and laypeople in database management teams; and more attention to data security and ensuring patient confidentiality (something that most database custodians are currently addressing in meeting the requirements of the Patient Information Advisory Group14). With some notable exceptions, the audit and research potential of most of the databases was not being realised.

    Study limitations

    Although this review provides critical information on UK clinical databases for the first time, it is limited in three ways. Firstly, we may have missed some key databases. In particular, those included were restricted to the public sector, despite our attempts to gain access to information about databases held by pharmaceutical companies. Secondly, our information was largely self reported by database custodians. Despite careful interrogation, some accounts may be unjustifiably favourable. Thirdly, the lack of statistically significant associations between organisational characteristics and data quality (with one exception) may reflect the small sample size.

    What is already known on this topic

    High quality clinical databases can support evaluative research, audit, clinical management, and planning of services

    There is a widespread lack of awareness of what databases exist in the United Kingdom, scepticism about their quality, and uncertainty as to the extent to which they are used

    What this study adds

    UK clinical databases exist in all areas of health care, varying greatly in age, size, geographical area covered, scope, and quality

    The audit and research potential of many databases is not being realised

    Considerable scope exists for improvements, which could be facilitated by a dedicated national support unit

    Implications of results

    Considerable effort and resources are expended by clinicians, methodologists, computing staff, and others to create and maintain clinical databases in Britain. Many database custodians work in isolation with little contact or support from others engaged in similar activities. The need for improvements in database organisation, data quality, and data use is widely recognised among these highly committed individuals and groups. The inception, in March 2003, of a forum for database custodians to exchange experiences and help one another has both highlighted these needs and started to meet them. However, without resources to promote and support the development of databases, such measures can have only limited impact.

    Just as clinicians are not expected to be able to organise and carry out randomised trials without the support of clinical trials units, so we should not expect high quality databases to be developed without some dedicated specialised support. This could be met by the establishment of at least one clinical database support unit. This could provide advice and assistance in organisation and management, information technology, epidemiology, and statistics. Without such an initiative, the variable picture of databases reported here is likely to persist, and their potential will not be realised.

    We thank all the database custodians for contributing to DoCDat, and Faith Hummerstone and Anton Buxton for data collection and entry.

    Contributors: NB conceived and designed the study, wrote the paper, and acts as guarantor. MB and MP carried out the analyses and commented on the paper.

    Funding: DoCDat was funded by the National Centre for Health Outcomes Development.

    Competing interests: The authors created and maintain DoCDat.

    Ethical approval: Not needed.

    References

    Pryor DB, Califf RM, Harrell FE, Hlatky MA, Lee KL, Mark DB, et al. Clinical databases. Accomplishments and unrealized potential. Med Care 1985;23: 623-47.

    McGlynn EA, Damberg CL, Kerr EA, Brook RH. Health information systems: design issues and analytic applications. Santa Monica: RAND Health, 1998.

    Black NA. High-quality clinical databases: breaking down barriers. Lancet 1999;353: 1205-6.

    Smith R. Is the NHS getting better or worse? We need better data to answer the question. BMJ 2003;327: 1239-41.

    Williams JG, Cheung WY, Cohen DR, Hutchings HA, Longo MF, Russell IT. Can randomised trials rely on existing electronic data? A feasibility study to explore the value of routine data in health technology assessment. Health Technol Assess 2003;7: iii, v-x, 1-117.

    Scottish Audit of Surgical Mortality. www.show.scot.nhs/sasm/(accessed 16 Oct 2003).

    Department of Health. Comprehensive critical care: a review of adult critical care services. London: DoH, 2000.

    Lundin J, Lundin M, Isola J, Joensuu H. A web-based system for individualised survival estimation in breast cancer. BMJ 2003;326: 29.

    Newton J, Garner S. Disease registers in England. Oxford: Institute of Health Sciences, University of Oxford, 2002.

    Black N, Payne M. Improving the use of clinical databases. BMJ 2002;324: 1194.

    Black N, Payne M. Directory of clinical databases: improving and promoting their use. Qual Saf Health Care 2003;12: 348-52.

    Lowrance WW. Learning from experience: privacy and the secondary use of data in health research. London: Nuffield Trust, 2002.

    Black NA. Secondary use of personal data for health and health services research: why identifiable data are essential. J Health Serv Res Policy 2003;8(suppl 1): 36-40.

    Higgins J. The Patient Information Advisory Group and the use of patient-identifiable data. J Health Serv Res Policy 2003;8(suppl 1): 8-11.(Nick Black, professor of )