Download - FUNCTIONAL OUTCOMES IN THE AGED WITH HIP FRACTURE: A … · iii Acknowledgements I would like to acknowledge and thank my mentor, Dr. Emil Schemitsch (Supervisor), and my Program

FUNCTIONAL OUTCOMES IN THE AGED WITH HIP

FRACTURE: A SYSTEMATIC REVIEW OF RANDOMIZED

CLINICAL TRIALS

By

Amy Milena Hoang-Kim, BscH, MA

A thesis submitted in conformity with the requirements for the degree of Masters of Science

Department of the Institute of Medical Sciences

University of Toronto

©Copyright by Amy Hoang-Kim, 2009

ii

Functional Outcomes in the Aged with Hip Fracture: A systematic review of randomized clinical trials

Master’s of Science, 2009

Amy Hoang-Kim Institute of Medical Sciences

University of Toronto

ABSTRACT

Hip fracture trials have used a wide range of patient-reported outcomes (PRO) suggesting a lack of

consensus among clinicians on what are considered the most relevant functional outcomes. We conducted

a systematic review to identify the outcomes used in hip fracture randomized controlled trials (RCTs). We

hypothesized that there had been an increase in numbers of PROs over time and the health status measure,

SF-36, would be used the most. A database search and screening yielded 86 original trials. The mean

Detsky score (and standard error) for quality was: 75.8% ± 1.76%. There was a trend in the increase of

functional outcome; however, the SF-36 was used only in (10 out of 86) 11.6% trials. Both the ADL-Katz

Index and HHS have lower respondent burden than the SF36 which may contribute to their frequent use in

hip RCTs. There is a lack of applicable measures suitable for patients with dementia.

iii

Acknowledgements

I would like to acknowledge and thank my mentor, Dr. Emil Schemitsch (Supervisor), and my Program

Advisory Committee members, Dr. Dorcas Beaton, Dr. Mohit Bhandari and Dr. Abhaya Kulkarni (and in

particular Dr. Beaton who has motivated me to write down my future goals for the next ten years). I am

deeply grateful for the academic enrichment they were able to provide, as well as their guidance and

unwavering support throughout this project.

I am thankful to have received funding from the Orthopaedic Division-Wellesley central site at St.

Michael’s Hospital in the form of the David Hastings Award for the Ontario Scholarship of Science and

Technology.

I would also like to thank my friends and colleagues who have helped me shape this project: David Santone

who screened the articles and extracted data, Professor Antonio Moroni from the Rizzoli Orthopaedic

Institute, Bologna, Italy and as well as my colleagues at the Mobility Clinical Program (Kelly, Taucha and

Katherine) at St. Michael’s Hospital, Toronto for their words of encouragement.

I would like to dedicate the thesis to my ‘parents’ Hai and Marisa and especially Hai who leads me through

the right and wrong turns in my life; because he is always by my side. I am extremely grateful to Andrea

for his patience, love and unwavering support over the years. It is because of them that I have had the

strength to see the project through to the end.

iv

TABLE OF CONTENTS

Abstract .................................................................................................................................................... ii Acknowledgements................................................................................................................................... iii Table of Contents...................................................................................................................................... iv List of Tables............................................................................................................................................ vi List of Figures........................................................................................................................................... vii List of Appendices.................................................................................................................................... viii List of Abbreviations ................................................................................................................................ ix CHAPTER 1: INTRODUCTION ............................................................................................................. 1 1.1 Objectives............................................................................................................................................. 4 1.2 Implications.......................................................................................................................................... 4 1.3 Overview of Thesis............................................................................................................................... 5 1.4 Keypoints from Chapter 1 ..................................................................................................................... 6 CHAPTER 2: BACKGROUND ............................................................................................................... 7 2.1 Introduction .......................................................................................................................................... 7 2.2 Defining physical function .................................................................................................................... 7 2.3 Conceptual frameworks ........................................................................................................................ 8 2.3.1 The Nagi scheme of disablement......................................................................................................... 9 2.3.2 The Verbrugge and Jette Model.......................................................................................................... 11 2.3.3 The Wilson and Cleary Framework ....................................................................................................13 2.3.4 The International Classification of Functioning, Disability and Health (ICF)...................................... 14 2.4 Discussion ............................................................................................................................................ 17 2.5 Summary .............................................................................................................................................19 2.6 Keypoints from Chapter Two ................................................................................................................ 20 CHAPTER 3: METHODS ........................................................................................................................ 21 3.1 Introduction .......................................................................................................................................... 21 3.1.1 Definition of the research question..................................................................................................... 22 3.1.2 Identification of the search strategy.................................................................................................... 22 3.2 Data extraction......................................................................................................................................25 3.3 Detsky Quality assessment scale ........................................................................................................... 25 3.4 Statistical analysis.................................................................................................................................26 3.5 Results.................................................................................................................................................. 26 3.5.1 Study Characteristics......................................................................................................................... 28 3.5.2 Quality assessment............................................................................................................................. 28 3.5.3 Outcome instruments.......................................................................................................................... 30 3.5.4 High quality RCTs vs. Low quality RCTs............................................................................................ 32 3.6 Discussion ............................................................................................................................................ 39 3.6.1 Randomization................................................................................................................................... 39 3.6.2 Sample size calculation...................................................................................................................... 40 3.6.3Identification of outcome ....................................................................................................................40 3.6.4Follow-up........................................................................................................................................... 40 3.6.5 Statistical methods............................................................................................................................. 41 3.7 Summary..............................................................................................................................................41 3.8 Keypoints from Chapter Three .............................................................................................................. 43 CHAPTER 4: RESULTS.......................................................................................................................... 44 4.1 Introduction .......................................................................................................................................... 44 4.2 Methods................................................................................................................................................ 44 4.3 Results.................................................................................................................................................. 47 4.3.1 Measures of Activity and Participation............................................................................................... 47 Physical Performance Test......................................................................................................................... 47 Timed-Up-and-Go-Test............................................................................................................................... 49

v

6 Minute Walk Test..................................................................................................................................... 50 4.3.2 Activities of Daily Living.................................................................................................................... 51 Index of Katz.............................................................................................................................................. 51 Barthel Index.............................................................................................................................................. 51 OARS Fillenbaum ADL............................................................................................................................... 53 Klein Bell ADL........................................................................................................................................... 54 Disability Rating Index............................................................................................................................... 55 Lawton and Brody IADL............................................................................................................................. 56 Functional status Index...............................................................................................................................56 4.3.3 Composite scores............................................................................................................................... 57 Harris Hip score......................................................................................................................................... 58 Stinchfield hip assessment........................................................................................................................... 59 Charnley Hip classification......................................................................................................................... 59 Matta functional score................................................................................................................................ 59 Hip Rating Questionnaire........................................................................................................................... 59 Salvati and Wilson score............................................................................................................................. 59 Oxford Hip Score........................................................................................................................................ 59 4.3.4 Generic Measures of Health Status..................................................................................................... 59 The Short-Form 36..................................................................................................................................... 60 Functional Status Questionnaire................................................................................................................. 61 Swed-Qual.................................................................................................................................................. 62 Nottingham Health Profile.......................................................................................................................... 63 London Handicap Scale.............................................................................................................................. 64 4.3.5 Measures of Utility............................................................................................................................. 66 EuroQol-5D............................................................................................................................................... 66 4.4 Discussion ............................................................................................................................................ 69 4.5 Summary..............................................................................................................................................72 4.6 Keypoints in Chapter Four .................................................................................................................... 73 CHAPTER 5: SUBGROUP ANALYSES................................................................................................. 74 5.1 Introduction .......................................................................................................................................... 74 5.2 Methods................................................................................................................................................ 75 5.3 Results.................................................................................................................................................. 76 5.3.1 Trials including & excluding patients with dementia........................................................................... 76 5.3.2 Diagnosis and screening dementia..................................................................................................... 77 5.3.3 Diagnosing emotional status.............................................................................................................. 77 5.3.4 Outcome measures............................................................................................................................. 77 5.4 Discussion ............................................................................................................................................ 78 5.5 Ethical considerations ........................................................................................................................... 79 5.6 Summary .............................................................................................................................................80 5.7 Keypoints from Chapter Five ................................................................................................................ 87 CHAPTER 6: CONCLUSIONS ............................................................................................................... 88 6.1 Implications of findings ........................................................................................................................ 88 6.2 Limitations ........................................................................................................................................... 88 6.3 Future directions................................................................................................................................... 89 6.4 Summary..............................................................................................................................................90 REFERENCES......................................................................................................................................... 91 APPENDICES .......................................................................................................................................... 105

vi

LIST OF TABLES

Table 2.1 The domains of Activities and Participation component of the ICF.............................................. 15 Table 3.1 The Levels of Evidence .............................................................................................................. 23 Table 3.2 Characteristics of the 86 included studies listed in alphabetical order........................................... 29

Table 3.3 Data extraction of outcome measurements................................................................................... 35 Table 4.1 Composite scores........................................................................................................................ 59 Table 4.2 Trials with generic health status or utility measures ..................................................................... 65 Table 4.3 Instrument characteristics measuring health-status or utility......................................................... 68

Table 5.1 The spectrum of cognitive impairment ........................................................................................ 75

Table 5.2 List of trials with outcomes stratified by patient subtype.............................................................. 82 Table 5.3 Group A trials including patients with severe co-morbidities ....................................................... 84

vii

LIST OF FIGURES Figure 2.1 Verbrugge and Jette’s model of the Disablement Process ........................................................... 12

Figure 2.2 The ICF .................................................................................................................................... 17

Figure 3.1 Search strategy.......................................................................................................................... 27

Figure 3.2 Comparison of Category 1 versus Category 2 trials .................................................................... 31

Figure 3.3 Frequency use of instruments .................................................................................................... 32

Figure 3.4 The number of RCTs that measure each ICF concept ................................................................. 33 Figure 3.5 The prevalence of investigator-developed instruments................................................................ 34 Figure 4.1 Composite scores contain different ICF concepts ....................................................................... 58 Figure 4.2 The applicability of instrument types ......................................................................................... 67 Figure 5.1 Diagnostic tools used to assess co-morbidity in high quality trials .............................................. 81 Figure 5.2 Diagnostic tools used to assess cognitive impairment in high quality trials ................................. 81

viii

LIST OF APPENDICES Appendix 1 Search strategy........................................................................................................................ 106

Appendix 2 21-point Detsky scoring system............................................................................................... 109

Appendix 3 Quality assessment scores ....................................................................................................... 110

Appendix 4 Included trials ......................................................................................................................... 112

Appendix 5 Inclusion and exclusion criteria ............................................................................................... 123

ix

LIST OF ABBREVIATIONS

AAS Ambulation activity score

ADL Activities of Daily Living

ADL – KATZ Katz index of Activities of Daily Living

ASA American Anesthesiologist Society

BADL Barthel Index of Activities of Daily Living

BDR Blessed Dementia Rating

BMD Bone Mineral Density

CAM Confusion Assessment Method

CBI Chinese Barthel Index

COVS Swedish version of clinical outcome variables

DRI Disability Rating Index

DSM-III-R Diagnostic and statistic manual of mental disorders

EBM Evidence-based medicine

EQ-5D EUROQOL 5D

FAC Functional Ambulation Classification

FIM Functional Independence Measure

FPM Functional Performance Measure

FRT Functional Reach Test

FSI Functional Status Index

FSQ Functional Status Questionnaire

HHS Harris Hip Score

HRQ Hip Rating Questionnaire

HRQL Health-Related Quality of Life

IADL Instrumental Activities of Daily Living

ICF International Classification of Functioning, Disability and Health

ICIDH International Classification of Impairments, Disability and Handicaps

LEP Leg extensor power rig

MBI Modified Barthel Index

MCS Mental Component Score

x

LIST OF ABBREVIATIONS

MFA Musculoskeletal Functional Assessment

MMSE Mini Mental State Examination

MTS Mental Test Score

MWT Minute walk test

NHP Nottingham Health Profile

OARS Older American’s Resources and Services

OHS Oxford hip score

PCS Physical Component Score

POMA Performance Oriented Assessment of Balance

PPME Physical Performance and Mobility Examination

PPT Physical Performance Test

PRO Patient-reported Outcomes

RCT Randomized Controlled Trial

RM Repeated Measure

ROM Range of Motion

RSA Radiostereometric Analysis

SF-36 Short Form 36

SIP Sickness Impact Profile

SPMSQ Short portable mental status questionnaire

SWED-QUAL Swedish Health-Related Quality of Life Survey

TAD Tip Apex Distance

TUG Timed up and Go Test

VAS Visual Analogue Scale

WHO World Health Organization

WHO-QOL BREF World Health Organization Quality of Life

WOMAC Western Ontario and McMaster Universities Index

1

CHAPTER 1

INTRODUCTION

In Canada, the percentage of the population over 65 years of age is projected to increase to 15% in 2011

and to 25% by the year 2036.1,2 The incidence of hip fractures also increases substantially with age, rising

from 22.5 and 23.9 per 100,000 population at age 50 to 630.2 and 1289.3 per 100,000 population by age

80, for men and women respectively.3-6

The elderly are at a major risk for a first hip fracture. The normal decrease in bone mineral density may

account for 60% to 85% of the variation in fracture risk.7 In a recent analysis addressing fracture rates in

the province of Ontario, it has been reported that since 1997 the hip fracture rate has begun to decrease and

the population projections suggested that this decrease will continue to a rate of 33.1 per 10,000 in 2005.

The suggestion was in fact confirmed in an updated projection with actual data for hip fractures in 2003-

2005.8,9 In Ontario, Canada there is universal health care; and as such, BMD testing increased ten-fold and

prescriptions for bone-sparing medications also increased from 12, 298 in 1996 to 225, 580 in 2003 which

can account for the falling hip fracture rates.8 Other contributing factors for first-time hip fractures include

cognitive impairment as well as impaired mobility and poor visual depth perception.7,10 In contrast,

dizziness and poor or fair self-perceived health care characteristics are predictive of a second hip fracture.10

In another study, the investigators found that the rate of Parkinson’s disease, dementia and audiovisual

impairment was statistically higher before the second operation.11 The one-year risk of a subsequent

fracture varies from approximately 2% to 10%.12-14 The lifetime risk of a second hip fracture has been

estimated at 20% but may be as high as 55%.15

The consequences of these fractures are considerable both for patients and healthcare services. Only half of

these patients regain full autonomy.16 Healthcare resources include initial hospitalization, re-

hospitalization, rehabilitation, chronic care, home care, long-term care (LTC) and informal care. The total

annual economic implications of hip fractures in Canada are estimated to be $650 million currently and are

expected to rise to $2.4 billion by 2041.16 A longitudinal cohort study analyzing levels of health service

2

use, institutional care and their associated costs, found that one year costs were significantly different for

patients who returned to the community ($21385) versus those who were transferred to ($44156) or

readmitted to LTC facilities ($33729).16 In the absence of an appropriate comparison group, all costs during

the period following hospital discharge are likely to be attributed to the hip fracture, while in reality, the

presence of co-morbidities in the aged have a considerable impact.17 A study by Bernardini, investigated

co-morbidity and adverse clinical events in patients over the age of 65 admitted to a geriatric rehabilitation

unit after proximal hip fracture.18 These patients had complex, multiple, interacting pathologies, with 78%

having significant co-morbidity. Hip fractures are the second leading cause of hospitalization for the aged

and the added co-morbidity in these patients lengthen hospital stays.19-20

Traditionally, clinical success in the treatment of hip fracture has been measured in terms of mortality,

physiologic changes or improvement in physical function.21-23 In a recent ten-year review of the

orthopaedic literature, there was an overall 10% increase in the use of patient-reported outcomes (PRO) in

observational and randomized controlled trials.24 The use of PROs could minimize the bias associated with

collecting information about the functional result following surgery, since it has been known that patients

are not inclined to report actual functional information to the surgeon who they respect because he

performed the intervention.25 The limitations of a patient-reported outcome is that it provides subjective

information but gathering objective data is equally important. Functional outcomes can provide the best

possible basis for quantifying the expected outcomes, and these must be considered in the context of the

patient’s values and preferences about the expected outcomes of a decision.26, 27 Jette defines recovery from

functional disability as encompassing psychological and social recovery as well as physical recovery.28 The

four target domains that contribute to functional outcomes can be viewed as physical, mental, emotional

and social in nature.28 In treating the aged with hip fractures, there is a need to facilitate clinical decisions

where surgeons must weigh, either explicitly or implicitly, the expected benefits of a particular intervention

whether surgical, medical or rehabilitative against the potential harm and cost.27, 29 However, without a

common metric to measure these targets we would be unable to compare results across trials and guide

clinical decision-making.

3

According to McDowell and Newell, the measurement of a concept like health or functional recovery relies

on collating information from various indicators representing components of the overall concept.30 In

addition, a dimension (or domain) refers to the area of experience or behaviour one is trying to measure.31

From a theoretical standpoint, “a person does not have a disability, a person experiences it”.32 The high

degree of variability is attributed to the perception that biology provides the foundation for impairment

which subsequently causes the disability.33-36 The choice of an appropriate disability conceptual framework

to classify different domains and instruments will be later discussed in Chapter Two.

The detailed methodology will be reported in Chapter Three. A systematic review of randomized trials if

conducted with rigour in identifying, critically appraising and synthesizing data of the best available

evidence can facilitate clinical decision-making.37 The evidence retrieved may potentially contradict

clinical expertise, physiologic rationale and medical training.

The design and conduct of good comparative studies in this context relies upon the choice of valid

instruments that are reliable, and responsive.38 Should the instrument assessing functional outcomes prove

to have good psychometric properties, the value of the published literature would be enhanced.25 However,

pragmatic qualities such as the applicability of such instruments in hip fracture trials should also be

considered in addition to the psychometric properties. In Chapter Four, we will compare the logistical

choice of functional outcome instruments. In particular, whether the instrument was a burden to administer,

require additional training, and have an adequate score distribution as well as format compatibility.

Furthermore as different and severe co-morbidities exist in older patient populations, clinical results may

not be representative of all patient types presenting with hip fractures. If patient-reported outcomes are

used only for healthy and lucid patients, PROs will be ineffective in the 31 to 45% of patients in Geriatric

Orthopedic Rehabilitation Units who are reported to have cognitive impairment.39 Patients that fall into this

category are often challenging to deal with because of a lack of compliance or mortality rates that lead to a

loss in follow-up. Randomized controlled trials should diagnose co-morbidities and screen patients for

either inclusion or exclusion according to the research question and monitor higher degrees of disability via

4

functional outcome assessment. In a literature review, Wells et al concluded with evidence that: 1) frail

older rehabilitation candidates with mild to moderate dementia should not be excluded from rehabilitation

and 2) frail older patients should be screened for cognitive impairment.20 This patient subtype will be

further explored in hip fracture trials in Chapter Five.

1.1 Objectives

The purpose of our study was to conduct a systematic review of the instruments used to assess functional

outcomes in randomized controlled trials in the aged with hip fractures. We believe that the patient’s

perspective on their functional ability to perform daily activities following hip fracture treatment is a

critical factor and indicator of treatment effectiveness. We hypothesized that over time there had been an

increase in instruments assessing functional outcome. We also wanted to see if the increase over time

would correlate with a common metric for measuring functional outcome in patients with hip fracture.

1.2 Implications

Functional outcome or the ability to carry out an individual’s daily activity represents a way to quantify the

burden of having a hip fracture. Functional outcome is a challenge that can neither be determined directly

nor described by a single variable but it is important to identify a common metric to assess the

postoperative management of hip fracture patients in order to validate research findings. Impaired physical

function has been linked to many indicators of increased health services utilization and has become

fundamental for researchers, clinicians and funding agencies

We can gain greater understanding of the patient’s perspective by using the appropriate instruments that

measure all aspects of functional recovery. Health in the aged is often compromised by various co-

morbidities of differing levels of severity. These areas need to be further investigated with the aim of

finding a common metric to assess specific populations implicating a change in the future conduct of

Evidence Based Medicine. My thesis will clarify whether patients with dementia were included in hip

fracture randomized trials. The implications of obtaining results which are not representative of all patients

are major, and future research needs to investigate the potential use of proxy responders.

5

1.3 Overview of the Thesis

The overall aim of this thesis is to focus on finding a common metric to assess functional outcome in those

with hip fractures over the age of 65 years.

This thesis is presented in a series of 6 chapters. Chapter 2 defines the concepts related to outcome using

the International Classification of Functioning, Disability and Health (ICF) to classify the domains and

instruments found in the randomized controlled trials. Chapter 3 describes the methodology used to define

and identify articles to be used for a systematic review of outcomes and to describe those articles. Chapter

4 describes the taxonomy relevant to the applicability of instruments and describes those instruments used.

The objective of Chapter 5 is to identify a common metric in those trials, which have included patients with

severe cognitive impairment. Chapter 6 will describe the implications of the systematic review and

limitations of the study. This final chapter also addresses the possibilities for future investigation.

6

Keypoints from Chapter 1 • This is a systematic review aimed to review the

outcomes assessed in RCTs evaluating the aged following a hip fracture

• Our research questions: 1) Is there an increase in the use of functional

outcome assessment in studies of hip fracture management and rehabilitation from 1990 to 2007?

2) What outcome measures are being fielded in RCTs of hip fracture? Are there differences between high and low quality trials?

3) How practical are these to use in the aged with hip fracture? In those with dementia? What are the benefits of investigator-developed vs. non-investigator developed instruments and performance-based vs. impairment-based composite scores vs. multi-attribute scoring systems?

• Implications of this research: • Care will be based on EB Guideline care maps • Outcomes need to be broad enough to describe

the full impact • The exclusion of those with dementia can create

7

CHAPTER 2

BACKGROUND

2.1 Introduction

Variability in functional outcome reporting in hip fracture management could create challenges in the

comparison of results across trials. In order to better define functional outcome, we believed that the ICF

framework would be appropriate to understand function which encompasses 4 principle domains: physical,

mental, social and vocational. In this chapter, four frameworks for disability are reviewed and compared.

The International Classification of functioning, Disability and Health (ICF) conceptual framework is an

accepted language and framework, to describe and classify concepts and domains as well as regions of

overlap and their relationship to one another.40, 41 We used this model to discriminate trials without

functional outcomes (Category 1) from those trials containing functional outcomes (Category 2). For

instance, those trials reporting on radiographic analysis and screw osteointegration would not contain any

functional outcomes. These measure the concept of impairment under the category of Body Function and

Structure within the ICF model.

2.2 Defining physical function

There is a lack of consistent language and uniform definitions when defining physical function. Words such

as physical function, functional status, physical ability, and disability have all been used interchangeably.21,

42-44 Undeniably, consistent terminology is fundamental because the scope and content of measures used to

quantify physical function often originate or are derived from the terms used to define physical function.

For example, the definition of disability is operationalized when researchers use Activities of Daily Living

(ADL) 45 and Instrumental Activities of Daily Living (IADLs).46 The scale of IADL extends the ADLs to

cover activities that require higher levels of cognition and/or finer motor skills. For example, IADL can rate

individuals on their ability to carry out eight activities, the ability to use the telephone, shopping, food

preparation, housekeeping, laundry, ability to travel (by car or public transit), responsibility for taking own

medication, and ability to handle finances. Scores range from 0 (complete independence) to 8 (complete

dependence).46 These terms have led to a wide variety of measures with different content. These scales

8

have helped in better defining different phases within the disablement process for the elderly in terms of

which level of activity involving the higher levels of cognition have not been recovered following a hip

fracture treatment. In understanding those patient subtypes, those not capable of performing IADLs and

capable of performing ADLs could be a better measure of outcome based on the patient’s functional

recovery.

Physical function has been evaluated using varied modalities including self-report, proxy report (from

family or medical professionals), and observed physical activities (i.e. measured distance walks or timed

sit-to-stand). This ultimately makes comparison across studies difficult resulting in potential confusion

regarding what is actually being measured.47

Furthermore, there is often a difference between what a person can do, or is capable of doing and what they

actually do and this difference may be affected by how questions are worded or how much encouragement

is given to a patient during a test.48, 49 Finally, within each of these modalities, there are a number of issues

such as framing of questionnaires or whether questionnaires are self- or investigator-administered.

2.3 Conceptual frameworks

In the past, much of the medical literature was conceptually based upon a clinical paradigm which focused

on etiology, pathology, and clinical outcome.35 In turn, disability in the biomedical model was assumed to

be a problem related to the individual that medical management would solve. The biomedical model

sharply contrasts with the biopsychosocial paradigm or the “quality of life” model. The social paradigm

views overall functioning and well-being as a measure of complex behaviours and feelings. For instance,

Nagi defines disability as a social construction and therefore additional factors such as the interaction of a

person with his or her environment are considered.50

Over the years, many attempts have been made to fuse the two schools of thought. For example, the Wilson

and Cleary model adopts a single taxonomy of patient outcome according to underlying health concepts.35

The ICF offers an alternative perspective adopting a dynamic interaction that is based on a biopsychosocial

9

conception of disability. The selection of the ICF model for the classification of functional outcomes was

chosen because it could place outcome in the context of variables important to physical function.

A conceptual framework or scheme goes beyond just representing a scientific model that defines major

concepts. It is also used to generate hypotheses about the interrelationships of different variables included

in the scheme. Not in the least, these models are guidance for subsequent measurement. The problem is the

lack of a universally accepted framework; however, the main purpose of the ICF to provide a common

language to describe disability concepts has made the framework widely popular.51 However, what remains

fundamental in scientifically testing and in determining the importance of clinical variables is conceptual

clarity, 52 these models will now be examined in greater detail.

2.3.1 The Nagi Scheme of Disablement

In the early 1960’s Saad Nagi described a framework for disablement distinguishing between four

concepts: active pathology, impairment, functional limitations and disability (Figure 1). Nagi’s basic

framework was subsequently adopted by the Institute of Medicine (IOM) and is now often referred to as the

IOM scheme and has been adapted by the National Centre for Medical and Rehabilitation Research.53

Nagi first began by attempting to define the relationships between active pathology, sickness and illness,

impairments, functional limitations, and disability. Nagi differentiated between active pathology (a term he

used to describe the behaviour resulting from an acute condition of usually short duration and ‘Illness’) or

‘sickness’ (a term resulting from chronic conditions). 54 According to Nagi, ‘Disability’ can be present in

the absence of disease. Disability can result from genetic factors or injury that give rise to permanent

impairment, but are not considered sickness or illness. 54

Active Pathology may result from an infection, trauma, or metabolic or degenerative disease and “involves

an interruption or interference with normal processes, and the simultaneous efforts of the organism to a

10

regain a normal state”.54 The efforts to regain a normal state can be impeded by a compromised immune

system (for example), or aided by medical or surgical intervention.54

Impairment indicates an anatomical, physiological, mental or emotional, loss or abnormality. The

distinction between an active pathology and impairment is that pathology is considered impairment, while

not all impairment results from pathology (as in the case of congenital abnormalities). Impairments affect

function at the cellular level and vary in their degree of impact. Manifestation of impairment is influenced

by the course of the underlying pathology, prognosis, progression or recovery, and point of onset in the

lifestyle of the individual. A hip fracture patient following an intervention could exhibit confusion and

depression which implies either a mental impairment combined with an emotional one.

Functional Limitations are “limits in performance at the level of the whole organism or person”. 54 The

difference between impairment and functional limitation is the level at which the limitations are

manifested. Functional limitations result from the cumulative effect of one or more impairments to the

point where the function of the individual is affected. For example, the inability to walk up stairs may be

the result of nerve or muscle cells damage, or may be the result of heart or lung impairment. Functional

limitations are noticeable manifestations of one or more underlying impairments.54

Disability is “a limitation in performance of socially defined roles and tasks within a socio-cultural and

physical environment”. 54 Not all impairments or functional limitations result in a disability and a disability

may be restricted to specific areas of social functioning and not affect others. A patient with hip fracture

may be unable to integrate back into the community by not being able to organize an event as efficiently as

before. Nagi identifies several factors that influence the severity of a disability such as the reaction and

expectations of the individual and those significant to them, and characteristic of the environment and the

social and cultural barrier within it. 54

11

2.3.2 The Disablement Process – Verbrugge and Jette

Lois Verbrugge and Alan Jette present a socio-medical model of disability referred to as the Disablement

Process.42

The Disablement Process model is designed to be applicable to chronic, acute, and acquired illnesses across

the lifespan. Verbrugge and Jette have defined “Disablement” as to “the impact that chronic and acute

conditions have on both body systems and on people’s ability to function in the expected and desired way

of their society”.42 The Disablement Process combines the underlying framework of the Nagi scheme and is

designed for ease of use in medical research.50, 55 One of the most obvious differences between the

Verbrugge and Jette model and those preceding it is the inclusion of risk factors and extra and intra-

individual factors that mediate and influence the progression of the disability through the main pathway.42

Specifically, in the Disablement model, Pathology is defined as “biochemical and physiological

abnormalities that are detected and medically labeled as disease, injury, or congenital/developmental

conditions”.42 Pathologies can be acute such as short-term disease or injury (defined as less than 3 months

duration), or chronic, such as injuries with long consequences or degenerative diseases. Pathology is

usually identified as medical diagnoses, and examples include osteoporosis, cancer, cataracts, and cerebral

palsy. Pathologies can also be congenital or biochemical and may be detected through signs, symptoms, or

specialized tests.

12

Figure 1 Verbrugge and Jette’s model of the Disablement Process

*Figure adapted from with permission from Verbrugge and Jette (1994)

Impairments are defined as “dysfunctions and structural abnormalities that are significant enough to have

consequences for physical, mental or social functioning”.42 Impairments can be immediate or delayed and

can occur in the same location as the underlying pathology or may manifest in secondary sites. Here, an

impairment can be considered a loss of balance as measured by the Berg balance test or pain measured by a

visual analogue scale.

Functional Limitations are defined as restrictions in fundamental physical and mental activities.42

Fundamental physical activities include overall mobility, discrete motions and strengths, trouble seeing,

hearing, and communicating. Fundamental mental activities include central cognitive and emotional

processes. Difficulties in these activities can manifest themselves diversely, as in the ability to climb stairs,

read large print, produce intelligible speech, be oriented in time and place, lift objects or remain alert.

Pathology Impairments Functional Limitations Disability

Extra-Individual Factors Medical care and rehabilitation Medications and other therapeutic regimens External supports Built, physical, and social environments

Intra-individual factors Lifestyle & behaviour changes Psychosocial attributes & coping Activity accommodations

Risk Factors

13

Functional limitations can be moderated by external factors Verbrugge termed “extra-individual and intra-

individual”.42 These factors serve to lessen the impact of the functional limitation on the individual and

reduce the extent of the observable disability. Extra-individual factors include medical care and

rehabilitation, medications and other therapeutic regimens, external supports, and modifications to

structural, physical and social environments. A hip fracture patient with the aid of a walking cane can walk

to the grocery store. Intra-individual factors are changes or modifications the individual is capable of

making and include lifestyle and behavior changes, psychological attributes and coping mechanisms, and

activity accommodations. For instance, one may take be able to complete an activity by increasing the

length of time.

2.3.3 The Wilson and Cleary Framework

In the Wilson and Cleary framework, the assessment of biological and physiological factors focuses on

“the function of cells, organs and organ systems”.35 Examples include diagnoses such as pulmonary

tuberculosis, inflammatory bowel disease, or laboratory values such as serum hemoglobin or creatinine;

measures of physiological function such as pulmonary function tests; and physical examination findings

such as systolic ejection murmur, pulmonary wheezes, or splenomegaly.

A Symptom is the patient’s perception of “an abnormal physical, emotional, or cognitive state”. 35

Psychophysical symptoms have been referred to as symptoms “primarily associated with mental health” or

symptoms not clearly physical or psychological in origin.

Functional status assesses the ability of the individual to perform particular defined tasks. Four domains of

functioning that are commonly measured are physical function, social function, role function, and

psychological function. Here, measures of symptoms that are specific to the hip will prove to be excellent

predictors of function.

General health perceptions are strong predictors of mortality and also of the use of general medical and

mental health services. Overall quality of life is the assessment of the respondents’ subjective well-being

14

with general measures of how happy and/or satisfied they are with their life as a whole. Patient preferences

or values are evident at several levels in this model and in particular with general health perceptions and

overall quality of life. Certain symptoms are more burdensome than others, and those individuals will

prefer not to have those symptoms even if they may have to tolerate others.

Utility assessment is a methodology that elicits patients’ values for particular states of health, and utilities

are often used as an overall measure of health-related quality of life (HRQL). Whereas most HRQL

measures assess particular health states, utility measures assess the value that individuals assign to different

states. One of the conceptual problems with using utility assessment to measure HRQL is the way utilities

are measured. That is, they involve other factors beyond just health status, including the value patients

place on life, their risk aversion, or their attitudes toward certain types of medical intervention.

2.3.4 The International Classification of Functioning, Disability and Health (ICF)

The WHO began work on The International Classification of Functioning, Disability and Health 40 (ICF),

an update of the ICIDH55, and formally known as the International Classification of Impairments, Disability

and Handicaps—version 2 (ICIDH-2). The ICF was ratified in 2001 and is a classification of disablements

that group consequences associated with diseases, disorders or injuries, and puts greater emphasis on the

role played by social and environmental factors. Functioning and disability are described in the ICF in

terms of the dynamic interaction between health condition, personal factors, and the environment (Figure

2).42 The ICF is not only a classification system for the impact of disease it is also a theoretical framework

for a relationship between variables. The ICF places the emphasis on function rather than condition or

disease. The ICF provides a description of situations with regard to human functioning and its restrictions.

The information is organized in two parts: Part 1 deals with Functioning and Disability; while, Part 2

covers Contextual Factors.40 Each part has two components:

The Body component comprises two classifications, one for functions of body systems, and one for body

structures. Activities may be limited in nature, duration and quality.42 Activity limitations are referred to as

disabilities and are scaled by difficulties and whether assistance is needed in order to carry out the activity.

15

The ICF has been identified by the American National Committee on Vital and Health Statistics (NCVHS)

as the only viable code set for reporting functional status.56 The ICF has limitations and is not in common

use because of unclear definitions of concepts.42 In fact, recently there are suggestions for new components

within the Activity and Participation ICF Classification according to Badley’s new interpretation of the

ICF.57 According to Badley, Activity and Participation is divided into three components: Acts, Tasks and

Societal Involvement.57 Acts are the general things that a person can do independent of context or purpose.

Tasks are the purposeful things people do in daily life in a specific context. 57 Societal involvements is the

individual in the context of society and acknowledged areas of human endeavor. 57 The social role is what

defines societal involvement and not the acts and tasks that may or may not be involved. 57

The Activities and Participation component of the ICF covers aspects of functioning from both an

individual and a societal perspective. There are 617 activity items that fall under the nine domains shown in

Table 1.

Table 1 The domains for the Activities and Participation component of the ICF

Learning and applying knowledge

General tasks and demands

Communication

Mobility

Self-care

Domestic Life

Interpersonal interactions and Relationships

Major life areas

Community, social and civic life

16

In the ICF scheme, Participation is involvement in a life situation. Activity Limitations are difficulties an

individual may have in executing activities.40 Participation Restrictions are problems an individual may

experience in involvement in life situations.40

The components of contextual factors of the ICF have been subdivided into Environmental Factors, which

has an impact on all components of functioning and disability and are organized in sequence from the

individual’s most immediate environment to the general environment, and Personal Factors, also a

component of contextual factors but are not classified in the ICF because of the large social and cultural

variance associated with them.40

The components of Functioning and Disability in the ICF can be used to indicate problems (e.g.

impairment, activity limitation or participation restriction which describe an overall state of disability) or

they do not indicate a problematic aspect of health and health-related states summarized under the umbrella

term functioning. Body functions and structures can be interpreted by means of changes in physiological

systems or in anatomical structures.40

For the Activities and Participation component, two constructs are available: capacity and performance. A

person’s functioning and disability is conceived as a dynamic interaction between Impairment (Body

Function and Structure), Activity Limitation, Participation Restriction and Contextual Factors. As

indicated above, Contextual Factors include both personal and environmental factors. ICF includes a

comprehensive list of environmental factors as an essential component of the classification. Environmental

Factors interact with all the components of functioning and disability. The basic construct of the

Environmental Factors component is the facilitating or hindering impact of features of the physical, social

and attitudinal world.40

17

Figure 2 The ICF *Figure adapted with permission from (WHO 2001)40

2.4 Discussion

All of the frameworks described above are similar with respect to their gross categorization of disability

progression from the cellular level to social functioning. All models describe some form of disruption in

the usual function of the human body at the cellular level, be it through injury, genetic or congenital

abnormality, or acute or chronic disease. The disease or pathology then disrupts normal function of an

organ or system to the point where it becomes noticed by the individual and may be diagnosed and some

treatment (where possible) may be instigated. Subsequently there is an interference with performing

activities considered normal for a particular age-gender population, and some modification of the physical

environment is often required to ameliorate the issues presented.

In the Verbrugge and Jette scheme, disability refers to an individual’s capacity to perform daily activities in

life. 42 This ultimately takes into consideration the involvement of the individual based on what social

activities or household chores they are typically involved in based on their age and sex group due to a

health or physical problem.42 In comparison to the Nagi schemes and more in keeping with the ICF,

Verbrugge and Jette place a greater emphasis on the inter-relatedness of the different aspects of their model

Health Condition (Disorder or Disease)

Body Function And Structures

Activity Limitation

Participation Restriction

Environmental Factors Personal Factors

18

and the inter-connectedness with the personal and environmental factors. However, they have been

criticized for lacking enough distinction between functional limitations and disability.48 For instance, if an

individual is asked about climbing stairs, this could be considered a functional limitation rather than being

classified as disability which places an emphasis on the daily social activity of using stairs to get to the

subway platform for instance. The latter example could lead to some confusion.

Similarly, distinguishing between the ICF’s categories of Activity Limitation and Participation Restriction

can be challenging. In a recent analysis, Badley states that these distinctions are not equally divided and in

reality are more staggered because the definitions are not exactly the same.57 The validity of Badley’s

interpretation still is questionable because it is recent and has not been validated or accepted as a universal

modification of the ICF. However in the future, the ICF classifications may need to be adjusted to reflect

the multi-item scales and summary scale scores that may not clearly fit these sub-domains. For example,

some instruments claim to measure “health-related quality of life” or “quality of life”. An examination of

the domains and items included may demonstrate that the domains are precisely those that focus

specifically on disability or other aspects of physical or cognitive functioning. It is therefore necessary to

go to the domain level and review the items that are actually included before making a judgment about

what an instrument measures. Furthermore, certain domains may be too specific in structure or functions

and may require specialized information that individuals are unable to self-report in a valid way. It may

therefore be that self-reported information on acts, tasks and societal involvement is a better operational

way of measuring the underlying health concept even if these are proxy measures of the health state. For

the purposes of this thesis, we will blend the two categories of Activity Limitation and Participation

Restriction into one category Activity and Participation. Ultimately, functional recovery will be assessed by

instruments classified by their domains under the umbrella term Activity and Participation which will

represent an individual’s expectation in being able to perform activities in both their daily and social life

with the burden of a hip fracture.

19

2.5 Summary

We propose for the current study the use of the ICF Model, a universal language to discern concepts from

the domains in the instruments found in the systematic review. Furthermore, it is critical to use a conceptual

framework that contains definitions of functional outcome, which could better clarify areas of overlap and

neglect in RCTs enrolling aged patients with hip fractures. According to the literature, measurement has

four major purposes: classification, discrimination between groups, prediction, and evaluating change.58

However, the definitions of the conceptual frameworks are still lacking in clarity and the problem is that

the instruments developed prior to the conceptual frameworks are trying to be fit into new concepts. The

direct application of a framework like ICF is problematic because of misfit between its concepts and the

current instrument targets and content. The future could lead to a rethinking of the framework or the choice

of instruments to fit the unique domains within the framework.

20

Keypoints from Chapter Two

• Functional outcome is the assessment of an individual’s capacity to carry out daily activities

• Several conceptual frameworks used to define disablement are reviewed. 1) The Wilson and Cleary is a biomedical model, both 2) Nagi and 3) Verbrugge are a socio-medical model, and 4) the ICF is a biopsychosocial model

• We used the International Classification of functioning, Disability and Health (ICF) model because it is an accepted universal language

• Activities and Participation domains are: learning and applying knowledge, general tasks and demands, communication, mobility, self-care, domestic life, interpersonal interactions and Relationships, major life areas, community, social and civic life

• In the future, the ICF may need to adjust to reflect the multi-item scales and summary scale scores that may not clearly fit these sub-domains

21

CHAPTER 3 METHODS

3.1 Introduction Systematic reviews are concise summaries of the best available evidence that address sharply defined

clinical questions.37 Clinical decision-making is then facilitated by empirical evidence which may well

contradict clinical expertise, physiologic rationale and medical training. EBM allows a quantitative and

“blind” approach to evaluating treatment outcomes. Because there is an overwhelming amount of published

literature, it is important to be able to extract relevant data in a systematic way and to eliminate “noise” or

data from lower quality studies. Indeed, the hierarchy of evidence levels was adopted by the Journal of

Bone and Joint Surgery as a way to classify the published articles. In Table 1 the classification of Levels of

Evidence for Primary Research Question for therapeutic trials investigating the results of treatment are

included (this excludes diagnostic studies, prognostic studies, and economic analyses).59 According to this

hierarchy for therapeutic trials, Level 1 randomized trials contribute the highest scientific value. On the

opposite pole, expert opinion whether from the surgeon’s perspective or from the scientist or researcher is

considered to contribute the least value and classified as Level 5.

A randomized controlled trial (RCT) is defined by the random allocation of patients to intervention group

and followed over time for an outcome of interest. An RCT, when conducted properly, can avoid selection

or confounding biases and provide the most objective basis for quantifying treatment effectiveness.

Randomization minimizes the difference between two comparative groups so that the effects of treatment

are not confounded by differences. Usually these would be patient characteristics such as age, co-morbidity

and severity of illness that are relatively evenly distributed within the two groups and thus do not affect

results. This reduces the confounding bias. In prospective studies (Level II), data can be collected

specifically for the purposes of the study which adds to the accuracy of the data whereas retrospective

cohorts may present insufficient quality data to address the specific research question. In a case-control

study (Level III) the exposure is measured, reconstructed or recollected after the development of the

disease.

22

3.1.1 Defining the research question

For a systematic review, one formulates a focused clinical question as opposed to a question broad in scope

typical of a narrative review. For the purposes of this study, we conducted a systematic review of

randomized clinical therapeutic trials (Level 1) in the aged with hip fractures in order to identify the

functional outcome instruments utilized in these studies. Functional outcomes are used to quantify a

person’s capacity to perform certain daily activities; therefore, we wanted to identify those instruments

used to assess functional outcomes following surgical treatment of a hip fracture. This would allow us then

to determine the EBM guidelines for the aged with hip fracture by determining a common metric across the

trials.

3.1.2 Identifying the search strategy

A systematic review comprise a comprehensive and explicit search strategy, whereas a narrative review

does not usually specify an explicit search strategy which could potentially bias the results. In our study, a

comprehensive search of the literature from 1950 to 2007 (week 38) was conducted independently by a

librarian at St. Michael’s Hospital Medical Library (Toronto, Canada) using multiple electronic databases

such as PUBMED, OVID Medline, EMBASE, Cochrane, Ovid Healthstar, and CINAHL [Appendix 1].

The search strategy used included subject headings such as: ‘hip fracture’, and the narrower terms and

keywords such as ‘trochanteric, subtrochanteric, intertrochanteric, pertrochanteric, intracapsular,

extracapsular fracture’ with age “age 65 and over”. Studies in the English language were included if

patients had had surgical treatment of a hip fracture, were randomized, and a mean age of at least 65 years

old.

Studies were excluded based on the following criteria: patients had fractures other than hip, if the minimum

age at patient enrolment was less than 50 years old and if only pre-operative or peri-operative measures

were investigated. Only one reason was sufficient for exclusion. The combined searches from all the

electronic sources were placed in a Reference Manager database (Reference Manager Professional Edition

23

Table 1 Levels of Evidence for the Primary Research Question for therapeutic studies adapted from the

Journal of Bone and Joint Surgery59

Level 1 • High quality randomized controlled trial with statistically significant

difference or no statistically significant difference but narrow confidence intervals

• Systematic review of Level I randomized controlled trials (and study results were homogeneous)

Level II • Lesser-quality randomized controlled trial (e.g. <80% follow-up, no blinding

or improper randomization) • Prospective comparative study • Systematic review of Level II studies or Level-I studies with inconsistent

results

Level III

• Case-control study • Retrospective comparative study • Systematic review of Level-III studies

Level IV • Case series

Level V • Expert opinion

System, Version 11). Abstracts and titles were hand searched independently by two reviewers (AHK,

DS). Potentially eligible articles were retrieved and references were screened for any other relevant studies.

We used the ICF model, an accepted universal language and framework, to describe and classify concepts

and domains as well as regions of overlap.40, 41 Outcomes are classified into two main categories: 1) Body

Function and Structure and 2) Activity and Participation. Functional recovery following hip fracture can be

determined through improvements that are physical, mental, social and vocational in nature.60 For the

purpose of evaluating functional outcomes we further excluded those trials with only radiographic

parameters (Category 1) such as x-ray analysis to observe unions or non-unions.

Trials, which contained outcomes related to Activity and Participation (Category 2), were further separated

into three intervention groups: surgical, medical or rehabilitation. Surgical trials included intracapsular

fractures treated with conservative versus surgery, surgical techniques, internal fixation device A versus B,

24

replacement versus fixation, arthroplasty. Surgical trials included extracapsular fractures treated with

conservative versus surgery, surgical techniques including condylocephalic nails (Ender nails etc),

cephalocondylic nails (Gamma nail etc) and arthroplasty. Specific medical therapies included

supplementation and the use of drug therapy. Medical therapy trials also included secondary prevention

including the use of antibiotics, wound drapes, surgical drains, and thromboembolic prophylaxis.

Rehabilitation trials included those evaluating interdisciplinary programmes with early supported

discharge, gait retraining, mobility exercise, nutritional supplementation, patient education and early

mobilization, prevention including fall prevention and education on how to fall, use of protective devices,

modification of the environment and prophylactic infection of osteoconductive materials. Discordance for

inclusion of the trials in the systematic review was resolved by discussion, if not resolved then a third party

would decide.

25

3.2 Data Extraction

All relevant data from each of the RCT studies were abstracted by two investigators and re-checked for

accuracy. The data included (1) first author, (2) year of publication, (3) total sample size, (4) intervention

type, (5), maximum follow-up, (6) intention-to-treat (7) domains measured, (7b) instruments used, (8)

references of functional outcome instruments, (9) mode of administration, and (10) description of outcome

instruments. An investigator-developed instrument was defined either as a development of a scale, without

prior validation, or modifications to an existing instrument for the specific randomized trial evaluating the

aged with hip fractures. An investigator-developed instrument also includes item omissions as well as

changes to wording and scoring systems.

3.3 Detsky Quality assessment scale

Author, institution, journal, country, industry and other sources of funding were blinded by one of the

reviewers using Adobe. The quality of the trial was then assessed independently by a second reviewer.

Each eligible study was assessed for quality using the 21-Detsky scale.61 The Detsky scale was chosen over

other commonly-used scales, such as the Jadad and Chalmers scales, because there is less emphasis on lack

of blinding which would have penalized the surgical trials included in our review.62 The Detsky scale

focuses on five principle areas: randomization, outcome measures, inclusion and exclusion criteria,

interventions, and statistics (Appendix 2). Assessment of randomization includes the determination of

whether randomization was carried out, whether it was described in sufficient detail, and whether the

randomization was conducted in a concealed manner. The article is then evaluated on whether objective

outcome measures were used, whether the outcome measures were adequately described, and whether the

outcome assessor was blinded to treatment group. With respect to the inclusion and exclusion criteria,

studies will obtain a better score if these criteria are clear and well defined and if an inclusion and exclusion

report was maintained during the course of the trial in order to assess for sampling bias. Description of both

the control and treatment groups was also evaluated. And statistical analysis focuses on whether the

investigators performed a sample size calculation prior to trial initiation and whether the P value was stated

26

and the analysis was conducted properly. A maximum of 20 points could be obtained for positive trials and

21 points for negative trials. In the case of negative trials, subsequent questions regarding post hoc power

calculations or confidence intervals were asked. As was done in previous trials, the scores between positive

and negative trials were converted to percentages for purposes of comparison.63, 64 Studies scoring more

than 75% on the Detsky Scale were designated high quality and those studies scoring less than 75% on the

Detsky scale were designated low quality.

3.4 Statistical analysis

The Kappa coefficient was used as a measure of chance-corrected agreement between raters in the

screening phase of selecting the articles for appraisal.65, 66 Kappa > 0.65 was considered good agreement.67

Analysis of variance was used to compare the differences in mean quality scores among the surgical,

medical and rehabilitation trials (SAS software version 9.1). The mean score per area of the Detsky scale

was determined to identify areas of weaknesses. A chi-squared test was used to evaluate the number of

publications with functional outcomes (Activity and Participation) over time (SAS software version 9.1).

The level of all tests was set at 0.05 and reported with 95% confidence intervals.

3.5 Results

In the electronic databases, 2451 citations were found when limited to randomized controlled trials

performed on subjects with a mean age of 65 (Figure 1). Of these, 551 studies appeared potentially relevant

and were retrieved in full. Application of the criteria for eligibility further eliminated 254 articles: 72 were

not only hip fractures, 57 articles did not focus on post-operative outcomes, 29 were not RCTs, 27 articles

were not in the English language, 18 articles did not include the aged and 24 remaining articles were

duplicate and not original trials. Of the remaining 211 articles, 122 (58.7%) articles only contained

radiographic measures of impairment (Category 1). Eighty-six (41.3%) unique trials contained measures of

Activity and Participation (Category 2) were included and assessed for quality (Appendix 3): 71 (83%)

were identified from computerized database searches; twelve (14%) from bibliography searches and three

27

(3%) from content experts (Appendix 4). The agreement in selecting these articles was very high (kappa,

0.92; 95% confidence interval, 0.87 to 0.98).

28

Figure 1 Search strategy conducted from 1950 to 2007 (week 38)

Initial Literature Search “Hip Fracture”

N=2451

Cross references & Content experts

N=15

Excluded Using Database Search

Filters Not RCT, English, Mean

age > 65 yrs N=2034

Potentially Eligible Trials N=551

Eligible Trials N=211

RCT (Category 1) Trials Radiographic Parameters

N=122

Duplicate Publication N=3

Excluded N=254 72 other fx types

57 prevention-fracture risk 29 not RCTs

27 not English 18 age

24 duplicate

RCT (Category 2) Trials Functional Outcome Measures

N=86 71 Database search 12 Cross references 3 Content experts

29

3.5.1 Study Characteristics

Nine trials had negative outcome results and thus were scored out of 21 points. Fifteen trials included

intention-to-treat analysis (Table 2). Follow-up ranged from 7 days to 13 years. The studies varied in

follow-up periods, the majority of the trials (15.1 %) had a maximum of 2 years follow-up, 17.4% of the

trials had a maximum 6 months follow-up, and 8.1% of the trials had a maximum follow-up of 3mths. The

mean completeness of patient follow-up was 84.1% (20% - 100%). Forty studies concealed randomization

to the investigator with the use of sealed envelopes. The mean patient sample size was 162 (43-455

patients) for surgical trials, 97.7 (18-243 patients) for medical trials, and 118 (24-319 patients) for

rehabilitation trials.

3.5.2 Quality Assessment

86 studies were included for quality assessment (Appendix 3). Twelve of these were retrieved by screening

the references of the included trials. 59 (68.6%) trials scored more than 75% in quality (30 surgical, 11

medical and 18 rehabilitation trials) and 27 (31.4%) trials scored less than 75% in quality (18 surgical, 2

medical and 7 rehabilitation trials). The mean score (and standard error) for the quality of all the

randomized trials was: 75.8% ± 1.76% (95% confidence interval, 72.3-79.3%). Medical trials had a higher

mean quality score than did surgical trials (83.7% compared with 72.7%, p=0.025). Quality assessment

ranged from 25% to 95%. The three areas of poorest quality were: poor reporting of treatment group

concealment to the investigator (62% of the 86 trials), no blinded outcome assessment (63% of the 86

trials) and no inclusion of an a-priori sample size calculation (71% of the 86 trials). The four areas of best

quality was (3a) inclusion/exclusion criteria well defined, (4a) the therapy was fully described for the

treatment group and (4b) the therapy was fully described for the controls and (5b) the statistical analysis

was appropriate.

30

Table 2 Characteristics of the 86 included studies listed in alphabetical order

Eligible Randomization concealment

Blinding Follow-up Withdrawals/ Dropouts/ Excluded patients

ITT Quality

Author (year)

Yes Yes Method Patient Outcome Assessor

% Yrs Reported Yes %

Adams (2001)68 X X Envelope ? ? 80 1 X 95 Ahrengart (2002)69 X X Envelope ? ? 87 3 X X 80 Bach (2004)70 X X ? X ? 78 1 X X 90 Bachrach-Lindstrom (2000)71

X ? ? No No 75 1 X X 75

Baker (2006)72 X X Envelope ? ? 85 3 X 90 Bal (2005)73 X X Shipping of

implants X X 89 2 Not

reported 30

Bannister (1990)74 X No No No No 65 1 X 30 Binder (2004)75 X X ? ? ? 76 6mths X X 80 Blomfeldt (2005)76 X X Envelope X No 95 4 X 90 Blomfeldt (2005b)77 X ? Envelope ? ? 80 2 X 85 Blomfeldt (2007)78 X X Envelope No Yes 82.5 1 X X 85 Boonen (2002)79 X ? ? X ? 99 6mths None 65 Bruce (2003)80 X ? ? ? ? 92 2mths X X 52 Burns (2007)81 X No No No No 90 6mths X 85 Calder (1995)82 X ? ? ? ? 67 6mths X 65 Calder (1996)83 X X ? ? ? 80 2 X 75 Cornell (1998)84 X X Envelope X X 98 6mths X 90 Crotty (2002)85 X X ? No X 100 4mths Not

Reported 81

Crotty (2003)86 X X Envelope ? X 99 1 X 90 Davison (2001)87 X X ? ? Yes 100 3 ? 55 Desjardins (1993)88 X ? ? ? X 85.8 1 X 65 Di (2007)89 X X ? ? No 100 1mth None 45 Dujardin (2001)90 X ? ? No ? 80 6.5mth

s X 65

Efstathopoulos (2007)91

X X Envelope ? ? 96 6mths X 85

El-Abed (2005)92 X ? Admission ? ? 78 3 X 60 Espaulella (2000)93 X X Envelope X ? 93 6mths X 95 Foss (2005)94 X X ? X X 99 5days X 95 Hagsten (2004)95 X X Envelope X X 85 2mths X 95 Hagsten (2006)96 X X Envelope X X 75 2mths X 90 Hallan (2006)97 X X Envelope ? X 70 2 X 86 Hardy (1998)98 X No Even/odd No ? 70 1 X X 65 Hardy (2003)99 X ? ? No ? 80 3 X 70 Harrington (2002)100 X X Envelope ? X 80 1 X 85 Hauer (2002)101 X ? ? X X 80 3mths X 70 Hedstrom (2002)102 X X Envelope No ? 1 X X 75 Huang (2005)103 X X ? X X 89 3mths X 85 Huusko (2002)104 X X Envelope X X 88 3mths X X 95 Janzing (2002)105 X X Envelope No No 72 1 X 85 Johansson (2000)106 X X Envelope No No 90 2 X 75 Johansson (2006)107 X X Envelope ? ? 71 2 X 85 Jonsson (1996)108 X X Envelope No No 75 2 X 45 Karn (2006)109 X No No No No 100 6mths None 55 Keating (2005)110 X X ? No No 85 2 X 95 Keating (2006)111 X X No No X 63 2 X 95 Kim (2005)112 X X ? ? X 100 2 X 85 Krichbaum (2007)113 X X Envelope ? Yes 69 1 X 80 Kuisma (2002)114 X X Envelope X X 92.6 1 X 80 Lamb (2002)115 X X Envelope X X 88 3mths X 95 Lauridsen (2002)116 X ? ? ? Yes 58 ? X X 57 Mangione (2005)117 X X ? ? X 80.4 3mths X 85 Mattsson (2005)118 X X Envelope ? X 83 6mths X X 90

31

Mattsson (2006)119 X X Envelope ? ? 36.4 2 X 85 Miedel (2005)120 X X Envelope ? Yes 97 1 X 90 Moroni (2004)121 X X Envelope ? ? 99 6mths X 70 Naglie (2002)122 X X Envelope X X 99.6 6mths X 80 Neumann (2004)123 X X ? X X 100 3mths None 80 Oldmeadow (2006)124

X ? ? ? ? 100 7 days Not Reported

65

Oude Voshaar (2006)125

X ? ? X X 64 6mths X X 55

Ovesen (2006)126 X X Envelope ? ? 96 6mths X 95 Pajarinen (2005)127 X X Envelope ? ? 70 4mth X 80 Papasimos (2005)128 X ? ? ? ? 85 1 X 65 Parker (2000)129 X X Envelope No No 100 3 X X 80 Parker (2002)130 X X Envelope No ? 99 3 X 90 Peterson (2004)131 X ? ? ? X 99 1 X 75 Peyser (2007)132 X X Envelope No No 82 1 X 85 Raia (2003)133 X X ? X ? 95 1 X 80 Ravikumar (2000)134 X No Day of

week No No 93 13 X 70

Resnick (2007)135 X X ? ? ? 74.5 1 X X 75 Rogmark (2002)136 X X Envelope ? No 100 2 ? X 80 Santini (2005)137 X No Alternate

days ? ? 74.5 1 Not

Reported 25

Sherrington (2003)138

X X ? ? ? 96 2wks X X 86

Sherrington (2004)139

X X Envelope ? No 95 4mth X X 85

Shyu (2005)140 X No ? ? No 80 3mths X 70 Sikorski (1981)141 X No Card from

box ? ? 20 2 Not

Reported 55

Soreide (1976)142 X No Date of Birth

? ? 64 2 X 65

Stenvall (2007)143 X X Envelope ? No 84.2 1 X 85 Tidermark (2003)144 X X Envelope ? X 80 2 X 80 Tidermark (2004)145 X X Envelope X X 88 1 X 95 Tinetti (1999)146 X X ? ? X 100 1 X 85 Tsauo (2005)147 X ? ? ? ? 46 6mths X 55 Utrilla (2005)148 X X Envelope X X 95 1 X 80 Van der Lely (2000)149

X X ? X X 68.5 6mths X 95

Van Dortmont (2000)150

X X Envelope X X 100 2 None X 80

Van Vugt (1993)151 X ? ? ? ? 98 3 X 55 Vidan (2005)152 X ? X ? 97 1 X 76 Vidyadhara (2007)153 No ? ? ? Yes 90 2 Not

Reported 35

? - Unclear

3.5.3 Outcome instruments

Over time there was a trend, although not significant, in the increase in the use of instruments assessing

function in the aged with hip fractures (chi-square = 2.904, df = 1, p=0.09) (Figure 2). We found

considerable variability in the selection of outcome measures across studies (Table 3). Of 86 included

studies, 48 trials assessed radiographic parameters, 29 trials gait pattern, 19 trials pain, 14 trials gait speed,

34 trials activities of daily living. 14 trials at most used the same instrument, the Katz- ADL Index45 and 16

trials at most used the same impairment-based scoring system, the Harris hip score154 and 10 trials used

32

Parker’s New mobility score155 (Figure 3). The utlity measure, EQ-5d, was used in 8 trials and the health

status measure, the SF-36 was used in 10 trials.

The instruments, found in each Category 1 (N=122) and 2 (N=86) randomized trials, were classified

according to the ICF concepts (Figure 2). The 86 randomized controlled clinical trials contained measures

of Body Function and Structure as well as Activities and Participation (Category 2 trials). Seven out of the

86 RCTs were rehabilitation trials which only contained measures of Activities and Participation.

Figure 2 A comparison of Category 1 trials only assessing radiographic parameters versus Category 2

Trials with Measures assessing Activity and Participation over the last two decades

33

Figure 3 The number of times a scale is used in one of the 86 RCTs and showing the most commonly-

used scales.

High quality Randomized Controlled Trials

In the 59 included trials, which scored above 75% on the Detsky scale, 70% of the surgical trials used

composite scores. Eight trials used the Euroqol-5d (EQ-5D) 156 for utility and 8 used the Short-Form 36

(SF-36)157 health status measures. Of the SF-36 health status measures, one additional trial was found

that reported both the chronic and acute form of the SF-36 and one trial used only the physical function

subscale of the SF-36. Three out of the nine trials reported the SF-36 summary scores such as the

Mental Component Score and Physical Component Score. Eight high quality trials at most used the

same impairment-based composite score: 8 (13.6%) trials used the Harris hip score (HHS), the

Charnley hip score158 (6 trials), Oxford hip score159 (1 trial) and the Hip rating questionnaire160 (4

trials). The impairment based composite scores were predominantly used in surgical trials with the

HHS also used in two medical trials. 13 (22.0%) high quality trials used the Katz ADL Index and 7

(11.9%) trials used Parker’s New mobility score. 3(5%) high quality trials adopted investigator-

developed instruments (Figure 5).

1614

10 10 86

0

5

10

15

20

Harris hipscore

ADL-Katz New MobilityScore -Parker

SF-36 EQ5D TUG

Commonly used instruments

Nu

mb

er o

f R

CT

s in

clu

din

g t

he

scal

e (N

=86

)

34

Figure 4 The number of RCTs (N=208) that measure each ICF concept

Low quality randomized controlled trials

27 trials scored less than 75% according to the Detsky scale, and were considered low quality. 5 trials

scored in the intermediate range of 70-75%. Only 6 trials included measures of health status (SF-36,

Nottingham health profile161, Sickness Impact Profile188 and the WHO-QOL BREF162). The majority of

the low quality trials (N=10) used the Harris hip score and 8 (30%) used investigator-developed

instruments (Figure 5). None in the intermediate quality assessment range adopted investigator-

developed instruments.

Number of RCTs measuring ICF concepts

59%

41%

3%0%

10%20%30%40%50%60%70%

Only Body Function &Structure

Both BodyFunction/Structure &Activity Participation

Only ActivityParticipation

ICF concept

Num

ber

of R

CT

s m

eas

urin

g

ICF

con

cep

ts (%

)

35

Figure 5 The prevalence of investigator-developed instruments in low and high quality trials.

36

Table 3 Data extraction of outcome instruments fielded in the included trials that met acceptability criteria.

Outcome categories

Instruments Instrument named/described Method of administration described

Cited reference on psychometric properties/protocol

Assessed but no details given Total number of studies

Radiographic parameters (union, fracture healing, reduction, implant positioning)

X-ray, RSA a)implant positioning [118H,77H,127H,144H,69H]; TAD [118H,99L,100H,68H,121L,153L]; Miedel’s classification mod. [126H];b) Cementing grade [72H]; c)acetabular erosion [72H,87L]; d)avascular necrosis Ficat[118H] e)osteolysis[73L]f)loosening [87L] g)scintimetry [151L]

n/a 19[126Ha,72Hb,118Ha,d,77H,127H,144H,99L, 69H,100H,68H,150H,78H,132H,112H,148H,73L, 121L, 87L,153L]

29 [91H, 148H,72Hc, 107H, 111H,120H,133H,130H,136H,90L,106H, 129H,83H,84H,110H,141L,119H, 134L,145H,101L,109L,137L,98L,74L, 82L,88L,128L,92L,108L]

48

Bone mineral density

a)Dual energy x-ray absorptiometry; b)single energy photon absorption c)Qct d)ultrasound

Singh index [90L, 112H,101L,153L, 105H];4[145Ha,79La,104Hb,102Ha, c,d,75Ha]

n/a n/a n/a 10

Range of motion

Goniometer Merle D’aubigne [119H] 4[76H,69H,119H,140L,147L]

1[119H] 8[76H,69H,129H,84H,141L,140L, 98L,147L]

9

Balance a)step test; b)postural stability c)functional reach d) force plate e)berg balance scale f)activities-specific balance confidence scale g)progressive Romberg test h)timed single-stance-limb stance

8[115Hb,81Hc,131Hc,139Ha-d, 85He,f,75He,g,h, 125Lc,101Lg]

2[115Hb,139Ha-d] 7[115Hb,81Hc,139Ha-d,85He,f,75He,g,h,125Lc,101Lg]

1[101Lg] 8

Gait pattern a) Use of walking aid/walking ability/ Functional ambulation classification, b)walking distance c) New mobility score

16 [127Ha, 91Hc, 48Hc, 99Lc, 130Hc, 105H, 129Hc, 94Hc, 93Ha,c, 152Ha,139Ha,c,114Ha, 101La,99Lc,98Lc,74L]

11[72Hb,76H,130Hc,68H,78H,94H, 104Ha,93Ha,c,152Ha,139Ha,c,108La]

11 [91Hc, 148Hc, 99Lc, 130Hc, 129Hc, 94Hc, 93Ha,c, 152Ha, 139Hc, 99Lc, 98Lc]

17[126H,76H,69Ha, 100Ha,68H, 83H,78H, 104Ha, 114Ha, 101La, 116La, 74L, 124La,b, 88L, 108La, 105H, 151L]

29

Pain Visual Analogue Score b)Mobility pain subscale c) SF-McGill pain

12[118H,119H,132H,104H,102H,115Hb, 81Hc,d, 101L, 125Ld, 140Le, 89L, 105H]

4[118H,99L,132H, 105H] 9[118H,119H,132H,102H,115Hb,81Hc,d,101L, 125Ld, 140Le]

9[99L,69H, 136H, 100H,98L, 74L, 89L, 108L, 151L]

19

37

Outcome categories




questionnaire d)Wong-Baker pain rating scale e)numerical scale

Grip strength Dynanometer (Jamar)

2[145H,79L] 2[145H,79L] 1[145H] 1

Hip muscle strength

Dynanometer; b) Leg extensor Power rig c)1RM d)spring balance e) lateral step-up ability f)MICROFETZ

10[118H,119H,115Hb,117H,131H,139Hd,e,146Hc, 140Le, 101Lc, 147L]

2[139Hd,e,140Le] 7[117H,115H,131H,139Hd,e,146Hc,101Le,147L]

1[140Le] 10

Performance a) Timed up and go test; b) Functional performance battery c)PPME d)modPPT e)POMA f)step height g)chair stand h)stair climbing

11[84H,70Hb,81Ha,131Ha,86Ha,139Hc, 85Ha, 146Hb, 75Hd, 125Lb, 101La,e-h]

n/a 8[84H,81Ha,131Ha,86Ha,139Hc,85Ha, 125Lb, 101La,e-h]

3[70Hb,146H,75Hd] 10

Gait speed 6 minute walk test, 3m, 4m, 6m, 10ft, 50ft, free/maximal, step count, S-COVS-walking ability item, step length

14[84H,79Lb,102H,115H,81H,117H,131H, 86H, 75H, 135H, 143H, 125L, 101L, 147L]

3[84H,86H,135H] 8[79Lb,102H,81H,131H,89L,143H,125L,101L]

3[84H,75H,147L] 14

Activities of Daily Living a) Barthel Index, b)

Katz Index c)OARS-Fillenbaum d)klein bell e)CBI

29[149H, 118H, 76Hb, 77Hb, 120Hb, 144Hb,78Hb,145Hb, 104Hb,102Hb,98Ha,71Hb,93Ha,9a,152Hb, 86Ha, 104Hb, 85Hb, 122H, 146Hc, 75Hc, 143Hb, 95H, 25L, 140Le, 80Lb, 101La, 87La]

15[149H, 118H, 76Hb, 77Hb, 144Hb,78Hb, 145Hb,104Hb,71Hb,93Ha,85Hb,122H, 146Hc,140Le]

25[76Hb,77Hb,120Hb,144Hb,78Hb, 145Hb,104Hb,102Hb,98Ha,71Hb,93Ha, 103, 152H, 86Ha, 104Hb, 85Hb, 122H, 146Hc, 143Hb, 95H, 140Le, 80Lb, 101Le, 87La]

7[129H,150H,119H,112H,75Hc,125L,108L]

34

Instrumental activities of daily living

a)Lawton b)Hamrin c) klein bell

7[104Ha,71Hb,104Ha,122H,95Hc,140La,101La]

4[104Ha,71Hb,122H,140La] 7[104Ha,71Hb,104Ha,122H,95Hc,140La, 101La]

7

Functional Index Measure Mobility subscale [123H]

1[123H] 1[123H] 1[123H] 1

Disability Rating Index Modified

2[96H,95H] 2[96H,95H] 2

Total functional score

1[137L] 1[137L] 1

38

Outcome categories




Musculoskeletal Function Assessment Mobility subset [133H]; ADL subset [133H]

1[133H] 1[133H] 1[133H] 1

WOMAC modified 1[109L] 1[109L] 1[109L] 1 Impairment-based composite scoring systems Oxford hip score 1[72H] 1[72H] 1[72H] 1 Harris hip score;

Modified [109L] 17[68H,106H,83H,78H,134L,112H, 148H,102H, 71H, 109L, 73L, 121L, 87L, 153L, 82L, 89L, 147L]

1[68H] 12[68H,78H,112H,148H,71H,109L, 73L,121L, 87L, 153L, 89L,147L]

5[107H,106H,83H,134L,82L] 18

Hip rating questionnaire; Modified [111H, 110H, 75H]

4[111H,84H,110H,75H] 1[110H] 1[84H] 3[111H,110H,75H] 4

Matta hip functional score

1[92L] 1[92L] 1

Stinchfield’s hip assessment system

1[142L] 1[142L] 1

Functional status index

1[113H] 1[113H] 1[113H] 1

Charnley’s score Pain subscale [130H,129H]

6 [76H, 77H, 120H, 144H, 130H, 129H]

4[76H,77H,144H,130H] 6[76H,77H,120H,144H,130H,129H] 6

Van Vugt score 1[151L] 1 Salvati and Wilson 2[90L, 128L] 2[90L,128L] 2 Generic health-status

SF-36; a) Physical function subscale; b)acute & chronic form

12 [117Ha, 72H, 118H,133H,103H, 131Hb, 86H,85H, 75H, 121L, 140L, 92L]

4[72H, 118H,133H, 140L] 10 [72H,118H, 133H, 103H,117Ha, 86H,85H,75H140L, 92L]

1[131Hb] 12

Sickness Impact Profile; a) Mobility and activity subscales

3 [70H,125La,92L] 3[70H, 125La, 92L] 3

WHO-QOL Bref 1[147L] 1[147L] 1[147L] 1 Functional Status

Questionnaire a) ADL subscale

1[75Ha] 1[75Ha] 1

London Handicap Scale

1[85H] 1[85H] 1[85H] 1

SWED-QUAL 1[96H] 1[96H[ 1 Nottingham Health 1[82L] 1[82L] 1

39

Outcome categories




Profile Utility Euro-QOL 5D 8[111H, 76H, 77H, 120H, 144H,

110H, 78H, 145H] 5[76H,144H, 110H, 78H, 145H]

8[111H, 76H,77H, 120H, 144H, 110H, 78H, 145H]

8

Social role functioning

Social Activity Scale

1[146H] 1[146H] 1[146H] 1

Exercise tolerance

Yale physical activity survey a)Exercise subscale

1[89La] 1[89La] 1[89La] 1

*H – denotes high quality trials scoring > 75% on the Detsky scale, L denotes low quality trials scoring < 75% on the Detsky scale; N.B. Trials with only measures of impairment have been excluded from this systematic review

40

3.6 Discussion

We found 86 studies of which 59 were deemed to be high quality, scoring more than 75% on the Detsky

scale and included measures with domains under Activity and Participation. Furthermore, quality

assessment of the trials revealed that high quality trials (those scoring more than 75% on the Detsky scale)

relied less on investigator-developed instruments. These were more commonly found in low quality trials

which also reflected on their poor study conduct seen in the quality scores. Investigator-developed

instruments typically are those that were designed for the specific purpose of the trial and often lack proper

references to the development of the instrument. The development of the instrument would contain

information regarding its conceptual basis as well as its primary purpose and target population which will

be further discussed in Chapter Four.

Our results indicate that generic health status measures and measures of utility are tested more and

standardized than measures based on performance such as the TUG or impairment such as the HHS or even

single domains such as Activities of Daily Living. Clinical trials, especially those with surgical intervention

as the primary research question, were dominated by impairment-based scoring systems. Impairment-based

scoring systems can be found in rehabilitative trials; whereas, performance-based instruments are not

typically found in surgical trials. This indicates the future possibility of utilizing impairment-based scoring

systems across different trial types. Indeed, we found a positive trend in the use of functional outcome

instruments over time. Interestingly, the SF-36 contrary to our hypothesis was not widely used. An

assessment of practical characteristics defining these instruments in the following Chapter will elucidate the

benefits of these different instrument types.

3.6.1 Randomization

Most patients enrolled in the trials were assigned randomly; however, many trials did not conceal the

treatment group to the investigator. Concealment is achieved by taking adequate steps to ensure those

responsible for assessing patient eligibility for trial enrolment are kept unaware of whether the patient will

be allocated to the control or experimental group. Authors should provide adequate description of the

41

concealment of randomization results. For example, was patient enrolment randomized via a computer-

generated list and was concealed in a sealed envelope just prior to surgery?

Furthermore, in order to preserve the effects of randomization, it is important that patients are followed and

evaluated within the group to which they were initially allocated, regardless of whether they received or

completed the intended treatment. This is an intention-to-treat analysis. If only 17% (15/86) of the trials

explicitly stated an “intention-to-treat” analysis, it leaves us to suspect the possibility of crossover and

drop-out biases.

Data collection often was not blinded, presumably due to obvious radiographic or other evidence indicating

which treatment was received for a given patient. Subjective assessments are always construed as having

some degree of bias when the assessor is not blinded to the treatment allocation. This is challenging but

there are several new studies that suggest ways to overcome this. For instance, leaving personnel unaware

of the study hypotheses or masking the implant on radiographs could ensure that this criterion is met.163

3.6.2 Sample size calculation

We also found that the majority of the trials neglected inclusion of an a-priori sample size calculation. It is

believed that the larger the sample size for the trial, the larger the number of outcome events and the greater

our confidence that the true relative risk reduction is close to what is observed.164 However, the smaller

patient populations typical of medical trials could serve their purpose of providing confident, definitive

answers to important questions. In this scenario, it is the number of events rather than the number of

patients that count.165 Even these small, indeterminate trials (with large confidence intervals) can prompt us

to question conventional but untested therapeutic wisdom. Furthermore, these small trials even when

individually inconclusive can serve as the basis for convincingly conclusive overviews and meta-analyses.

True enough, in order for these trials to be considered they must be registered at their inceptions rather than

left unreported at their conclusions.166 The selection of appropriate population for estimating the sample

size is important and should in principle be made based on the primary endpoint.

42

3.6.3 Identification of outcome

Primary and secondary outcomes should also be clearly defined and when applicable any methods used to

enhance the quality of the measurements. An example of what should be described, are multiple

observations and training of assessors.167

3.6.4 Follow-up

In order for the results to be valid, the study must have a sufficiently long and complete follow-up. The

number of patients lost to follow-up can potentially impact the validity of an article. If the information is

not available on the prognosis of patients lost to follow up and they represent a significant proportion of the

studies’ overall population, it is impossible to estimate the overall success of the intervention.164 It has been

reported that if less than 5% of the patients were lost to follow-up the effect on the outcome is considered

minimal or negligible and if 20% or more of the patients are lost to follow-up the validity of the trial is

significantly threatened.165 However, one should consider all patients lost to follow-up as the worst-case

scenario and re-analyze the results with those patients’ added as failures or adverse events. If this

significantly alters the results, then the validity can be considered poor and it is important to determine the

magnitude of the treatment effect. This is especially important, because the way in which the results are

presented can alter the impact of the intervention making the treatment effect look very small or quite large.

3.6.5 Statistical methods

The majority of trials reported p values and in the case of negative outcomes the confidence interval. These

are two frequently encountered statistical measures of study precision. The p-value is the probability that

the treatment effect can occur in a long run of identical trials as a result of chance alone. If the p-value is

less than or equal to 0.05 it will be statistically significant. Although useful, the p-value does not convey

the range over which the effect can possibly occur.168 The latter is addressed with the concept of the

confidence interval, which is the range of values within which we can be confident that the true value for

the whole population lies. Typically, a standard confidence interval of 95% is reported. This means that

43

there is 95% certainty that the true value of the measured variable lies within the stated range. As the

sample size of a trial increases so does the number of events and subsequently the 95% CI interval should

get smaller.22

3.7 Summary

Consistent with other systematic reviews including surgical trials, there are particular deficiencies in the

published hip fracture literature including poor concealment of allocation, sample-size calculations, and

very few explicitly report an intention-to-treat analysis in the trial. There is also limited information on

patients who withdrew or for whom follow-up was incomplete. Overall, medical trials are more rigorous

and have better quality scores than either rehabilitation or surgical trials. The latter may be due to the

facilitation of blinding following drug treatment in the hip fracture trials. Rehabilitation trials have the

shortest follow-up endpoint.

In the years of publication between 1950 and 2007, there had been an increase in the use of instruments

assessing function in the aged with hip fractures. Based on the evidence gathered from this systematic

review, it is clear that very few randomized clinical trials use the same index or questionnaire. Measures

such as the SF-36, contrary to our hypothesis, were not commonly used in hip fracture RCTs. Furthermore,

the use of investigator-developed instruments are used predominantly in trials scoring less than 75% on the

Detsky scale when compared to those trials scoring more than 75% in the latter. In the next chapter, the

characteristics of the instruments will be explored to elucidate what clinicians have been quantifying as

functional recovery in hip fracture trials.

44

Keypoints from Chapter Three

• An RCT can avoid selection or confounding biases and provide an objective basis for quantifying study outcomes

• 86 unique trials were identified and assessed using the 21-point Detsky quality scale

• Overall, there were poor reporting of treatment group concealment to the investigator, no blinded outcome assessment and no inclusion of an a-priori sample size calculation

• There were more investigator-developed instruments in the low quality trials when compared to the high quality trials

• There was an increase in the use of functional outcome instruments over time, although standardized measures such as the SF-36 was not widely used

45

CHAPTER 4

Instrument Characteristics: Pragmatic considerations

4.1 Introduction

Selection of the most appropriate measurement tools involves pragmatic as well as psychometric

considerations.25, 169 The pragmatic aspect of an instrument is defined as having the advantages of easy

administration to either respondent or examiner in addition to a simple score that can be interpreted in the

aged following a hip fracture. In a recent review of the evidence on measurement tool appraisal in geriatric

rehabilitation, Auger et al coined the term ‘applicability’ of a measurement tool for a given population or in

a specific context, defined by four dimensions: respondent burden, examiner burden, score distribution and

format compatibility.169 We believe that the most commonly used functional outcome instruments found in

randomized hip fracture trials have good pragmatic qualities. The objective of this chapter is to examine the

characteristics of the measurement tools found in RCTs, which in turn could assist clinicians and

researchers in creating proper EBM care maps. EBM care maps can be made once an appropriate

instrument is found to be applicable and psychometrically tested such that treatment effectiveness can be

measured and the burden of a hip fracture followed over time. An instrument which ranks high in

applicability will achieve minimal burden to the respondent and examiner, adequate score distribution and a

format that is compatible with the respondents’ age, culture, language and abilities.169

4.2 Methods

We extracted the measurement tools assessing functional outcomes from 86 randomized clinical trials

identifying the aged undergoing postoperative hip fracture treatment and rehabilitation. Furthermore, the

text and reference lists of the identified articles were screened in order to compile relevant literature on the

instrument used in the RCT. Consequently, those instruments that were developed for the unique purpose

of the trial and not cited in the references were termed as ‘investigator-developed’ and excluded. Should the

instrument not reference the original development article, the reference cited by the authors was still used.

The latter would have been an indication of either poor availability of the instrument to the authors

46

conducting the trial subsequently leading to a poor decision on the part of the clinicians or researchers to

include the instrument without obtaining complete information. We extracted the following characteristics

of the instrument: 1) Source or year of initial publication 2) Purpose or stated purpose for the development

of the instrument and the intended population 2) Description including the number of items in the index,

number of components 3) Response options whether dichotomous or use of a visual analogue score 4)

Adequate scoring distribution 5) Respondent burden 6) Examiner burden and 7) Format compatibility.

Items 4-7 will take into consideration the recommendations by Auger et al169. We also compared the

applicability of the instrument used in the hip fracture trials to note any modifications of the original scale

or index. We then rated overall measures in the categories of performance-based, activities of daily living,

impairment-based and multi-attribute aggregate scoring systems based on their applicability in hip fracture

trials. 10 points were allotted to poor applicability of the measure, 5 points indicated moderate applicability

and 0 points indicated good applicability in each of the four dimensions further described below.

According to Auger,169 the definitions used to assess the pragmatic criteria or applicability of measurement

tools in the hip fracture literature were the following:

Respondent burden components include invasiveness which can be either physical or emotional in origin

towards the respondent. A low respondent burden is optimal for an instrument.169 It takes into account how

long it takes to complete the questionnaire or scale or activity, whether all items are completed, what does

the individual being administered the instrument feel about the questions or tasks in regards to his condition

of having a hip fracture. Another factor to consider is the overall perception of the score by the respondent.

Examiner burden takes into account if specific training is required in administering the instrument to assess

the functional outcome in a patient with hip fracture. Are there requirements for specific props in order to

administer the instrument? How popular is the test and how often is it cited in the hip fracture literature?

Can the examiner interpret the scores and is he given appropriate detailed instructions in a standardized

fashion? Is there a cost in obtaining the instrument and are there copyright restrictions that could limit its

47

use in the particular setting? A tool with minimal examiner burden demonstrates better potential for being

applicable.169

Adequate score distribution is one indication of the applicability of a measurement tool. When the score of

scale represents a wide spectrum of behaviours or performance levels of a target population, it is applicable

to them. For continuous variables, it involves comparing the distribution of scores to the normal curve and

for categorical data, choosing a threshold value at the bottom or top of the scale representing a possible

floor or ceiling effect.169

Format compatibility refers to the appreciation of the compatibility between the target population

characteristics and the assessment format to avoid age, gender, cultural or disability biases.170 Format

compatibility components include the characteristics of the tested population (age, gender, culture,

language), abilities required to accomplish the test and format types available. Abilities required to

accomplish a test such as intellectual, emotional, physical requirements, can be empirically verified. If the

difficulty threshold is too high, there will be a large proportion of missing data. Format types include self-

report, performance-based, clinical judgment scales, as well as alternate formats (assistance, answer cards,

large print, validated translations). The availability of validated translations (language and culture) of

measurement tools further ensures an accurate interpretation of results.

The focus of many therapeutic interventions whether surgical, medical or rehabilitative, is the attention

towards the individual’s ability to perform certain activities as well as they would like to. We will adopt the

ICF scheme to identify those instruments that measure functional outcome. The latter is defined by those

measures used to quantify activities one can perform in their daily life as well as the societal impact one has

to deal with when recovering from the burden of a hip fracture. There are many composite scores that cut

across many different domains. The ICF codes will be used to express in percentages the number of

components in the composite scoring systems and classify them as either Body Function and Structure or

Activity and Participation.36 Impairment-aggregate scores are defined as those composite scores which

consisted solely of components of Body Function and Structure. Multi-attribute aggregate scores are

48

defined as those composite scores which consist of all categories in the ICF (Body Function and Structure

as well as Activities and Participation and External Factors).

The appropriate choice of a functional outcome instrument is fundamental in order to ensure that results are

obtained which reflect the patient. Should the instrument be inapplicable to those enrolled in hip fracture

RCTs, the methodology and results could be replicated but not informative. Furthermore, the difficulty in

choosing an appropriate instrument stems partially from the lack of a taxonomy that describes the practical

issues encountered in choosing these instruments for an RCT.

4.3 Results

Overall, 81 different instruments were identified measuring both Body Function and Structure and Activity

and Participation. 36 instruments were used to measure Body Structure alone such as radiographic

parameters or bone mineral density. 24 instruments measured Body Function, 13 evaluated measures of

Activity and Participation and 8 different composite scoring systems were identified. All the instruments

differ in their items and scales as well as scoring distribution and response options.

4.3.1 Measures of Activity and Participation Mobility measures include changing and maintaining body position, carrying, moving and handling objects,

walking and moving, moving around using transportation. Those measures that were performance-based

and rated by an external observer were predominantly utilized in rehabilitation trials.

Physical Performance Test171

Source: Reuben 1990

Purpose: The PPT assesses multiple domains of physical function using observed performance of tasks

that simulate activities of daily living of various degrees of difficulty. The PPT was designed to assess

strength, mobility, dexterity, and stamina through direct observation of the elderly person’s performance of

specified activities.

Description: Two versions are presented: a nine-item scale that includes writing a sentence, simulated

eating, turning 360 degrees, putting on and removing a jacket, lifting a book and putting it on a shelf,

49

picking up a penny from the floor, a 50-foot walk test, and climbing stairs (scored as two items); and a

seven-item scale that does not include stairs. The PPT was tested in a clinical practice (one Parkinson’s

disease patient), a board-and-care home, and a senior citizens’ apartment.

Response options: The PPT can be completed in less than 10 minutes and requires only a few simple

props. The test was administered by an observer according to a protocol.

Scoring: PPT scores are the sum of the 5-point scales for each of the items, varying from ‘most capable or

fastest’ to ‘unable to do’.

Respondent burden: Physical or emotional invasiveness and the respondent may not feel that the score

actually reflects their capacity to complete a task. For instance, the aged may take longer to completing a

task but end up finishing it given more time. Furthermore, the aged may feel uneasy performing these tasks

not in their home or accustomed setting.

Examiner burden: Training is required and instructions are unclear. The framing of the instructions to

study subjects is not accurately described: Were subjects aware they were being timed? Were they told to

complete the tasks as fast as they could or as carefully and accurately as they could? According to the

author, but unspecified in the listed reference: patients were instructed to complete the tasks at their usual

pace and in their usual manner and were informed that they were being timed.

Availability: environmental requirements

Popularity in hip fracture trials: A physical performance test was used in one study (Boonen 2002) and

modified because stairs were not available.

Adequate score distribution: moderate but may have ceiling effects for those who are unable to complete

all tasks or are typically immobile or inactive.

Format compatibility: Cultural and disability bias. The potential effects of sensory loss (visual and

hearing impairment) and limited education (illiteracy) were not evaluated. The format may not be

applicable to those with cognitive difficulties.

50

The timed-up and go test (TUG)172

Source: Podsiadlo 1991

Purpose: The TUG measures basic mobility skills including a sequence of functional maneuvers used in

everyday life.

Description: The TUG includes basic mobility skills, such as rising from a chair, walking 3 metres, turning

and sitting down on the same chair. The original description of the ‘get up and go’ test by Mathias173 was

later modified by Podsiadlo and Richardson172 who developed a scaling system based on the observer’s

perception of the patient’s fall risk. It describes a realistic mobility assessment including potential fall

situations, such as transfer in and out of a chair, walking and turning. Podsiadlo validated the TUG in 60

elderly patients from a geriatric day hospital.

Response options: The TUG can be completed in less than 12 seconds in community-dwelling versus a

maximum of 50 seconds for those that are institutionalized. This test requires only a few simple props. The

test is administered by an observer according to a protocol. Subjects are observed and timed from the

instant they rise from an armchair, walk 3 metres, navigate an obstacle on the floor (placed at a distance of

4 metres from the chair). Subjects wear their regular footwear and are allowed to use the arms of the chair

to get up. Subjects begin the test on the word ‘go’ and are instructed to ‘walk at a comfortable fast and

secure pace’. Subjects perform the TUG three times on the same day, after one practice trial.

Scoring: The TUG is a staff-completed functional test. The final score is the trial at which the subject is

able to carry out the test the quickest of the three timed trials. Graded on a 5-point scale (a=normal,

5=severely abnormal), a score of 3 or more indicates a person is at risk of falling.

Respondent burden: High: physical or emotional invasiveness

Examiner burden: High: Training required

Availability: environmental requirements include a chair, measuring tape, video camera (to record each

performance), and stopwatch (if using timed scores).

Popularity in hip fracture trials: The TUG was used in five trials (Crotty 2002, 2003, Burns 2007, Peterson

2004, Hauer 2001)

51

Adequate score distribution: Moderate: 3 performance groups are defined depending on duration of test

in seconds (Group 1: < 20 seconds; Group 2: 20-30 seconds; Group 3: >30 seconds)

Format compatibility: Poor: co-morbidity bias.

The 6-minute-walk test (6MWT)174

Source: Guyatt 1985

Purpose: The 6MWT is used to objectively measure the exercise capacity in patients with chronic heart

failure. Previously tested and developed for patients with respiratory disease.175

Description: The walking tests were conducted in an enclosed corridor on a 33m-long course. The patients

were instructed to walk from end to end, covering as much ground as they could during the allotted time.

At the end of the test the supervisor calls out ‘stop’ and the distance walked is recorded.

Response options: Distance covered.

Scoring: The mean score on the last four walks was used to compare with established norms.

Respondent burden: physical or emotional invasiveness. Some patients expressed dissatisfaction with the

study: ‘I felt ill after the walking tests”, “The walks didn’t seem to be beneficial”; “A doctor wasn’t

present” and dropped out.174

Examiner burden: Training is required and instructions are not standardized whether encouragement or no

encouragement should be given by the examiner.

Availability: no environmental requirements

Popularity in hip fracture trials: The 6MWT was used in two trials (Mangione 2005, Peterson 2004).

Because of concerns for patient fatigue, only one trial per patient was conducted (Mangione 2005).

Adequate score distribution: comparison with established norms

Format compatibility: Psychiatric disease bias.

52

4.3.2 Activities of Daily Living

Index of activities of daily living45

Source: Katz 1963

Purpose: The index is used to measure the functional status both in research and in daily care.

Description: The index of independence in Activities of Daily Living (Index of ADL) is a scale whose

grades reflect profiles of behavioral levels of six sociobiological functions: bathing, dressing, toileting,

transfer, continence, and feeding.

Response options: 4-point scale ranging from A) without any difficulty to D) only with the help of others.

Scoring: Each category was scored on a dichotomous rating scale (0=independent, 1=dependent).

Respondent burden: none

Examiner burden: No training required, should the patient have difficulties in reading, interviewer will

call the categories.


Popularity in hip fracture trials: Surgical Trials: Blomfeldt 2005, 2007, Tidermark 2003, 2003b; Medical

Trials: Tidermark 2004, Bachrach-Lindstrom 2000, Huusko 2002, Hedstrom 2002; Rehabilitative Trials:

Vidan 2005, Stenvall 2007.

Adequate score distribution: Each category was scored on a dichotomous rating scale (0=independent,

1=dependent).

Format compatibility: The index was first tested among patients in hospitals, where clinicians rated the

patients’ ability to perform six tasks. It is now currently in use for community dwelling, non-

institutionalized, elderly individuals, and also as a self-report measure instead of an assessment tool for

clinicians. Compatible with respondents’ age, culture, language and abilities

Barthel Index176, Modified Barthel Index177

Source: Mahoney 1965 (Original Barthel Index), Shah 1989 (Modified Barthel Index)

Purpose: The Barthel Index is designed to rate the level of independent functioning for ten ADLs in

individuals with neuromuscular or musculoskeletal disorders or other chronic disorders

53

Description: The Original Barthel Index is a 10-item staff completed generic ADL assessment. The items

include: feeding, chair/bed transfers, grooming, toilet, bathing, walking, stairs, dressing, bowels, and

bladder. The modified Barthel Index has the same ten items with the addition of wheelchair in alternative to

walking.

Response options: Values assigned to each item is based on time and amount of physical assistance

required to perform the activity. Original Barthel Index has a 3-point scale (unable, with help, and

independent). The Modified Barthel Index has a 5-point scale (unable, attempts but unsafe, moderate help,

minimal help, fully independent).

Scoring: In the original Barthel Index there is an ordinal measurement score. Weighting for chair/bed

transfers and walking were scored 0 points for unable, 5-10 with help and 15 for independent. Bathing is 0

points for unable and with help, and 5 points for independent. All other items are scored 0-points unable, 5

points= with help, and 10 points=independent. For the modified Barthel Index, unable is scored 0 points for

all items. 0, 2, 5, 8, 10 points from unable to fully independent are allocated for the items such as feeding,

toilet, stairs, dressing, bowels, bladder. 0, 3, 8, 12, 15 points from unable to fully independent are allocated

for items such as chair/bed transfers and walking. 0, 1, 3, 4, 5 points from unable to fully independent are

allocated for items such as grooming, bathing, wheelchair. A higher score correlates with greater

independence.

Respondent burden: It takes 5 minutes to complete.

Examiner burden: No training required and is designed for use by all professional staff involved with the

patient.


Popularity in hip fracture trials: 2 trials used the Barthel ADL [MT: Espaulella 2000; RT: Huang 2005]. 4

trials used the modified Barthel Index [ST: van der Lely 2000; RT: Crotty 2003, Naglie 2002, and Binder

2004].

Adequate score distribution: More options with the Modified Barthel Index.

Format compatibility: Barthel Index includes several items less likely to be affected by a fracture, such as

“feeding”, “personal toilet”, “bowel” and “bladder control”.

54

OARS scale178

Source: Fillenbaum 1981179

Purpose: The IADL Scale is a generic instrument that can be applied in persons with or without disability

and is constructed to measure instrumental ADL in detail.

Description: The IADL scale provides information about an individual’s level of independence when

performing global ADL tasks. It includes 14 items: seven items assess BADL (eating, dressing and

undressing, grooming, walking, getting in and out of bed, bathing and continence), and seven items that

assess IADL (using the telephone, travel, shopping, meal preparation, housework, taking medicine, and

management of finances).

Response options: Each item is rated on a 3-point scale: performs the activity without help (2) performs

the activity with some help (1), or completely unable to perform the activity (0). The item on continence is

rated as no problem (2), sometimes has trouble getting to the bathroom on time (1), and has catheter or

colostomy (0). Higher scores indicate a lower level of functioning in IADL such as making a phone call or

shopping.

Scoring: Three summary scales are computed by summing the scores of items: (1) a total score, the sum of

all 14 items (range 0-28), (2) a BADL score, the sum of the 7 BADL items (range 0-14) and (3) an IADL

score, the sum of the 7 IADL items (range 0-14). An ordinal score can also be computed based on the

OARS scoring algorithm, with five categories: good-excellent (no impairment), mild, moderate, severe, and

total impairment. In the Tinetti (1999) study, each self-care and home management ADL was scored 0

(does not do), 1 (does with human help), or 2 (does without human help). These scores were aggregated

into a composite self-care ADL score that ranged from 0 to 14, and a home management ADL score that

also ranged from 0 to 14. For both outcomes, a proportion of participants performed all activities without

human help (complete independence) and a proportion scored at least as well at follow-up as before the

fracture (recovered).

Respondent burden: Irrelevant questions

55

Examiner burden: A semi-structured interview is administered to the patient.


Popularity in hip fracture trials: 1 rehabilitation trial used the Fillenbaum ADL [Tinetti 1999].

Adequate score distribution: Good

Format compatibility: Good

Klein-Bell ADL index180,181

Source: Klein 1979180

Purpose: The Klein-Bell ADL scale is a generic instrument that can be applied in persons with or without

disability and is constructed to measure basic ADL in detail.

Description: It can measure a patient’s level of independence in basic ADL with a raw score or with a

weight score in 170 items. The items in the scale are divided into six dimensions: dressing, elimination,

mobility, bathing and hygiene, eating and emergency telephone use. The majority of items (162 items)

measure ADL (dressing, bladder and bowel management, mobility, hygiene, eating and drinking and using

the telephone), whereas eight items measure body function (bladder and bowel emptying, bladder and

bowel incontinence, chewing and swallowing food, swallowing liquids, verbalizing telephone messages).

Response options: Each item was rated in an empirical manner on a five-point scale with four criteria: 1)

How difficult is it for average able-bodied persons? 2) How difficult is it for the average able-bodied

person to perform this activity for someone else (to provide maximum assistance)? 3) How much time does

it take to perform this activity? 4) How injurious to one’s health would it be if the activity could not be

performed?

Scoring: A weight scheme (1 to 3) was used under the assumption that some items are more difficult, time

consuming etc than others was used. Weight 3 is given to the most complex items. Raw sum scores

obtained.




56

Popularity in hip fracture trials: 1 rehabilitation trial used the Klein Bell ADL index [Hagsten 2004]. The

authors chose 75 observation points considered relevant for hip fracture patients within 4 areas: dressing,

toilet visits, mobility and bathing-hygiene. The occupational therapist watched the patient perform each

specified activity and scored it as either failure (assistance required, 0 points) or achieved (activity

performed without verbal or physical assistance from another person, but with technical aids required). The

grading according to Klein-Bell was then used.

Adequate score distribution: A data program called the ADL diagram compiles the raw score to

recommendations in the K-B Scale Manual. Raw sum scores may make it more difficult to interpret results

as patients can get the same scores in spite of different needs of assistance.

Format compatibility: Has been translated into Swedish. Great ceiling effects obtained across different

populations.

Disability Rating Index182

Source: Salén 1994

Purpose: The DRI was originally developed for assessment of patients with back pain. The items selected

were intended to be applicable both to disability due to pain or impairment of hip and knee function and to

impairment of gross body movements due to pain and muscular or neurological deficits. The DRI has been

applied and developed in an academic orthopaedic department and in several rehabilitation units throughout

Sweden.

Description: The DRI covers 12 items concerning physical function only: dressing (without help), outdoor

walks, climbing stairs, sitting for an extended time, standing bent over a sink, carrying a bag, making a bed,

running, light work, heavy work, lifting heavy objects, participating in exercise/sports.

Response options: the DRI is a self-administered form, where patients mark a 100-mm visual analogue

scale in accordance to his/her presumed ability to perform the daily physical activities in question. The

anchor points are without difficulty = 0 and not at all = 100. The 12 items are divided into three sections: 1-

4, common basic activities of daily life; 5-8, more demanding daily physical activities; 9-12, work-related

or more vigorous activities. The questions are arranged in increasing order of physical demand, particularly

with reference to lower-back pain.

57

Scoring: An index is achieved by measuring the distance in mm between the zero points and the patient’s

markings on the VAS. The mean of these measurements provides the DRI expressed in percent of the

highest possible rating. The 12 counts may be presented in a graph as a Disability Rating Profile (DRP) and

the typical average profiles for different conditions may be calculated.

Conceptual basis: The DRI aims to assess disability as defined by the ICIDH – WHO (1980). The DRI

can be used alone or with questionnaires concerning psychological well-being, social health scales and

quality-of-life assessment scales.

Comprehensiveness: The different activities covered by the DRI represent gross movement or static

physical stress and reflect function of the spine, legs and arms. This makes the DRI applicable to common

conditions of the musculoskeletal system, such as back pain, neck pain, impairments of the shoulder, hip or

knee and muscle weakness.

Hip fracture trials: Two rehabilitation trials (Hagsten 2004, 2006) used a modified DRI. Lawton and Brody IADL scale46

Source: Lawton and Brody 1969 Purpose: The Lawton and Brody IADL Scale is an assessment of performance in eight daily activities that

are essential to maintain independence in the community.

Description: The items include telephone use, shopping, transportation use, meal preparation, housework,

laundry, medication use, and finances.

Scoring: Each item was scored either 1 or 0. A maximum score of 8 is generated if they are competent in

all eight tasks.




Popularity in hip fracture trials: 1 rehabilitation trial used a modified version of the IADL (Lawton and

Brody) including seven items instead of eight [Binder 2004].

Adequate score distribution: Dichotomous scale

Format compatibility: Good

58

Functional Status Index182

Source: Jette 1980 Purpose: The functional status index is a self-report of function which is defined as the degree of

dependence, the degree of difficulty, and the amount of pain experienced in performing specific activities

of daily living.

Description: The participant is asked to assess his or her average performance during the past 7 days in

relation to 18 tasks comprising ADLs and IADLs. Participants rate their performance in each task on three

dimensions: the difficulty they had in performing the task (a scale of 1 to 4, with 4 being the greatest

difficulty), the amount of pain they experienced while performing the task (1 to 4, with 4 being the most

pain), and the amount of assistance they needed (1 to 5, with 5 being the most) to complete the task.

Scoring: Using average scores on each dimension to score a given person on that dimension. The ‘average’

difficulty score is based on the difficulty ratings the person provides as his or her assessment of the

difficulty encountered when performing each of the 17 ADLs and/or IADLs.

Respondent burden: Irrelevant questions for hip fracture patients such as the use of public transportation

prior to surgery.

Examiner burden: None.


Popularity in hip fracture trials: 1 rehabilitation trial used the Functional Status Index (FSI) [Krichbaum

2007].


Format compatibility: Used in hip fracture patients and good indicator of cognitive ability and pre-morbid

participation.

4.3.3 Composite scores

Composite scores are composed of a combination of ICF concepts including Impairment, Activity &

Participation as well as External Factors (Figure 4). The majority of the trials used the Harris Hip score,

followed by the Charnley hip score and the Hip rating questionnaire. An individual trial used the Oxford

59

hip score and the Functional status index (Table 1). The advantages of these questionnaires directed at hip

fracture patients are their ability to detect more focused types of changes.

Figure 1 Composite scores contain different ICF concepts

Harris Hip Score154

Source: Harris 1964 Purpose: The primary focus of the score is to measure hip functionality in the clinical outcome of hip

replacement.

Description: It is 100-point scale score questionnaire with 4 major dimensions of functionality

administered by the physician.

Respondent burden: Low.



Adequate score distribution: High score indicates normal range of function. These are the scores for the

following: pain (44 points), function (47), range of motion (5) and absence of deformity (4).

Format compatibility: Widely used in hip fracture studies. However, for those patients who have no pain

prior to surgery and may be more affected in areas of participating in activities and social life, may have a

score that is not representative of them.

60

Table 1 Composite scores

Stinchfield hip assessment system (modified Judet point system)

Harris hip scale

Charnley’s classification

Matta Scoring System

Hip Rating Questionnaire

Salvati and Wilson score

Oxford hip score

Year of initial publication

Judet 1952183

Stinchfield 1957184

1969 1972 1986 1992 1973 1996

Reference to First development

184, 185 154 158 186 160 187 159

Development population

Hip Arthroplasty

Hip arthroplasty

Hip arthroplasty

Fractures of the acetabulum

THR Hip THR

Number of items

3 4 3 3 13 + VAS 4 12

Domains Pain Movement Walking

Pain Function Range of Motion Absence of deformity

Pain Movement Walking ability

Pain Ambulation ROM

Global Pain Walking Ability to perform daily functions

Pain Walking ability Muscle power and motion function

General pain level, bathing, getting in/out of car, putting on socks, shopping, walking, stairs, standing, limp, severe/sudden pain, work, pain at night

Response 6-point scale

Weighted point scale

6-point ordinal scale

6-point scale

100-point VAS for overall impact

6 possible responses for each

5 possible responses (least to most difficult)

Scoring Classified into: excellent (16 points or more), good (12-15 points), fair (9-11 points), poor (8 points or less)

Max 100 (pain 44, function 47, absence of deformity 4, range of motion 5)

Each domain graded on a scale of 1 to 6, with 1 indicating total disability and 6 indicating a normal state

Highest possible score 6, lowest score1. Clinical Grade (Total points): Excellent, 18; Good, 15-17; Fair, 12-14; Poor 3-11

Max 100 Max 40, each item considered separate

Best score, 12 ; Worst score, 60

Administration Physician Physician Physician Physician Patient Physician Patient

4.3.4 Generic measures of health status

Generic measures quantify a patient’s perception of his or her overall health state (Table 4 and 5). Many

have more than one dimension, usually distinguishing physical, emotional, and social functioning. The

primary purpose of these questionnaires is to quantify overall health rather than that related to a specific

condition. They tend to be less sensitive to changes in the specific orthopaedic disorder.

61

Health-status The Short-Form 36 (SF-36)157

Source: Ware 1992

Purpose: The SF-36 measures the effects that could be a direct function of disease and treatment (general

health).

Description: The generic tool most commonly used in the hip fracture literature is the SF-36 item

instrument. The SF-36 is a 36-item survey that measures eight domains of health: physical functioning, role

limitations due to physical health, bodily pain, general health perceptions, vitality, social functioning, role

limitations due to emotional problems, and mental health. It yields scale scores for each of these eight

health domains, and two summary measures of physical and mental health: the physical component

summary (PCS) and mental component summary (MCS). The SF-36 is available in both standard (4 weeks)

and acute (1-week) recall versions. The acute form was developed for cases in which health status would be

measured weekly or biweekly. It was created by changing the recall period for six of the eight scales [Role-

Physical, Bodily Pain, Vitality, Social Functioning, Role-Emotional and Mental Health] from ‘the past four

weeks’ to ‘the past week’. Two scales, physical functioning and General Health do not have a recall period;

the items and instructions for these scales are identical across acute and standard forms.

Scoring: All but one of the 36 items (self-reported health transition) is used to score the eight SF-36 scales.

Each item is used in scoring only one scale

Respondent burden: Irrelevant questions for hip fracture patients and length is problematic



Popularity in hip fracture trials: In the 9 RCT trials that used the SF-36, 4 were used in surgical trials and 5

in rehabilitation trials. 5 trials used the SF36, 2 trials used the summary scores and 1 trial used the chronic

and acute forms of the SF-36. 1 trial used only the physical function subscale from the SF-36.

Adequate score distribution: Good. With the release of version 2 in 1996, the norms were updated using

data from the 1998 National Survey Functional health status and norm-based scoring (NBS) algorithms

were introduced for all eight scales. NBS employs a linear T-score transformation with mean = 50 and

62

standard deviation = 10, which makes it possible to meaningfully compare scores for the eight scale profile

and the physical and mental summary measures in the same graph. Furthermore, the availability of a

scoring software minimizes the bias of incomplete questionnaires for more respondents.

Format compatibility: Used in hip fracture patients. The SF-36 acute form could be more sensitive to

recent changes in health status, but developers feel that higher mean scores resulting from the SF-36 acute

form would have implications on the norm-based interpretation of acute form scores. The benefit of this

instrument is its portability across different disease states.

Functional Status Questionnaire60

Source: Jette 1980

Purpose: The FSQ is a patient-completed questionnaire designed to provide assessment of physical,

psychological, social and role function in ambulatory patients.

Description: There are 34 core items to give a computerized report of one page. It is used for screening

and monitoring.

Scoring: The items are scored to produce 6 summary scale scores (VAS with warning zones to indicate

areas of functional disability) and single-item scores.

Respondent burden: Irrelevant questions for hip fracture patients

Examiner burden: Training implies that the questionnaire is used by doctors in primary care situations but

could also be used by allied health professionals.



Format compatibility: Any person with a chronic disabling condition (musculoskeletal, orthopaedic,

rheumatological) for community, inpatients, rehabilitation and outpatients.

Popularity in hip fracture trials: Physical function subscale of the FSQ was used in one trial Binder (2004).

63

Sickness Impact Profile188

Source: Bergner revised in 1981

Purpose: A generic, functional and quality of life measure used to evaluate the impact of disease on

physical and emotional functioning.

Description: The SIP-NH includes 136 items that patients check in two overall domains (physical and

psychosocial). There are 12 dimensions of daily activity. Physical dimensions are ambulation, mobility,

body care and movement. Psychosocial dimensions are social interaction, communication, alertness

behaviour, emotional behaviour, sleep and rest, eating, home management, recreation and pastimes,

employment.

Scoring: An overall SIP score can be computed.


Examiner burden: Training none



Format compatibility: The SIP is used for many different medical conditions.

Popularity in hip fracture trials: Bach (2004) used the SIP-NH. SWED-QUAL198

Source: Brorsson 1993

Purpose: A subjective patient-based assessment of health-related quality of life

Description: A brief Swedish HRQOL instrument translating selected measures from the Medical

Outcomes Study. It consists of 61 items that form 11 multi-item scales tapping aspects of physical

functioning, role functioning, emotional well-being, pain, sleep, family functioning and general health

perceptions. Physical functioning is assessed by limitations in a variety of physical activities, ranging from

strenuous to basic, due to health (seven items). Also includes one item concerning mobility and one item

concerning satisfaction with level of physical health. Role functioning is defined by limitations in major

role activities due to physical (three items) and mental health problems (three items). Pain (six items) was

included to capture differences in physical discomfort. Emotional well-being is assessed in terms of

64

positive and negative affect (six items are included for each). The overall sleep problems scale (six items)

addresses problems with sleep initiation and maintenance, respiratory problems, sleep adequacy and

somnolence. Family functioning is assessed with regards to satisfaction with family life (four items),

quality of relationship with spouse or partner (six items) and sexual functioning (four items). The measure

of general health perceptions taps into the patient’s own ratings of their current health (three items), prior

health (two items), resistance to illness (two items) and health concern (one item). All items are

administered using a ‘during the last week’ or ‘now’ time frame. Both positive and negative health states

are tapped by these measures.

Scoring: Items were all scored so that a high score indicated better health. The ‘don’t know’response

category was scored as intermediate between the ‘partly agree’ and ‘partly disagree’ categories and

between the ‘fairly satisfied’ and ‘fairly dissatisfied’ categories. The ‘not applicable’ response is coded as

missing data. Scale scores are constructed by summing items measuring the same construct into a single

scale score. These are then transformed linearly to a 0-100 possible range.





Format compatibility:

Popularity in hip fracture trials: 1 rehabilitation trial used the SWED QUAL (Hagsten 2006). Nottingham Health Profile161, 190

Source: Hunt 1980

Purpose: Patient-completed questionnaire to determine and quantify perceived health problems.

Description: Two parts 1) 38 weighted statements covering 6 areas (sleep, mobility, energy, pain,

emotional reactions, and social isolation); 2) 7 statements regarding specific aspects of daily life

(employment, household chores, social life, relationships, sex life, hobbies, holidays).

Scoring: Completed by Yes/No tick boxes.

65





Format compatibility: Older people, the chronically ill, pregnant women, those with limb fractures, minor

non-acute conditions, fit working men, peripheral vascular disease, osteoarthritis and a random community

sample.

London Handicap Scale191

Source: Harwood 1994

Purpose: Patient-complete interval level postal questionnaire for patients experiencing a health-related

handicap.

Description: Based on the ICIDH, it uses part utilities (a matrix of scale weights) relating to different

levels of disadvantage for 6 dimensions.

Scoring: There is a formula for combining these into an overall handicap score. The scale is meant for

comparisons between groups of subjects and problems arise if trying to make individual comparisons.





Format compatibility: All patients with a health-related handicap.

66

Table 2 RCTs measuring health-related quality of life using the SF-36 and EQ5D. Author (year) N Intervention Body function and

structure Activity Limitations and participation restriction

Composite scores

Mattson118 (2005) 112 Surgical Radiographs Global pain: “How is your hip pain today?” VAS- pain

Isometric hip abductor muscle strength ADL (6 parameters) SF-36

Baker72 (2006) 81 Surgical Radiographs

Walking distance SF-36 MCS SF-36 PCS

OHS

Raia133 (2003) 115 Surgical Radiographs Mobility subset (MFA) ADL subset (MFA) SF-36

Tidermark144 (2003) 95 Surgical Radiographs Pain (mod. Charnley)

ADL (Katz) Independently walking without aids EQ5D SF-36

Huang103 (2005) 126 Rehabilitation ADL (Barthel) SF-36

Crotty86 (2003) 66 Rehabilitation TUG MBI SF-36 PCS* SF-36 MCS*

Crotty85 (2002) 66 Rehabilitation TUG Berg Balance Scale Activities-specific Balance confidence scale MBI London Handicap Scale Use of mobility aids Physical dependence Social dependence SF-36

Mangione117 (2005) 33 Rehabilitation Lower-extremity force Mental status

Free gait speed SF36 physical function subscale 6MWT

Peterson131 (2004) 70 Rehabilitation 1RM

Gait 6MWT TUG Transfer in and out of bed SF-36 chronic & acute forms FRT

Miedel120 (2005) 217 Surgical Radiographs EQ5D

Charnley score

Blomfeldt78 (2007) 120 Surgical Radiographs Mobility ADL (Katz) EQ5D

HHS

Keating111 (2006) 296 Surgical Radiographs EQ5D Mod HRQ

Blomfeldt76 (2005) 102 Surgical Radiographs ADL Ability to walk independently EQ5D

Charnley score

Blomfeldt77 (2005b) 60 Surgical Radiographs ADL (proxy) Walking capability (proxy) EQ5D (proxy)

Charnley score

Tidermark145 (2004) 60 Medical DXA ADL (Katz) EQ5D

Tidermark144 (2003) 102 Surgical Radiographs Mobility ADL (Katz) Domestic independence ability to walk with or without walking aids

Charnley score

Keating110 (2005) 207 Surgical Radiographs EQ5D HRQ

67

4.3.4 Measures of Utility EuroQol-5D (EQ-5D)156

Source: The EuroQol Group, 1990

Purpose: A generic instrument for assessing quality of life.

Description: It identifies 243 possible health states. It is a patient-completed instrument for describing and

valuing health-related quality of life, giving a single index score for each health state measured. A 2-part

questionnaire, respondents answer five questions on their health status as measured by mobility, self-care,

main activity (i.e. work), leisure activity, pain and anxiety.

Scoring: There are three possible levels of response for each item. Each state carries a utility value, which

is calculated using time trade-offs. Perfect health and death have utility values of one and zero,

respectively, and states worse than death (<0) are possible. Part 2 is a visual analogue scale- 0 being worse

imaginable and 100 being best imaginable health status.

Respondent burden: Irrelevant questions for hip fracture patients. The questionnaire takes 1 minute to

complete.




Format compatibility: Older people, the chronically ill, pregnant women, those with limb fractures, minor

non-acute conditions, fit working men, peripheral vascular disease, osteoarthritis and a random community

sample.

Popularity in hip fracture trials: Eight trials (13.6%) used EQ-5D for utility (Miedel 2005, Keating 2005,

2006, Blomfeldt 2005, 2005b, 2007, Tidermark 2003, 2003b, 2004). Eight of those trials were surgical and

one was medical (Tidermark 2004). In the RCT by Blomfeldt 2005b, the EQ5D was administered using

proxy. The EQ5D was used in combination with the Charnley hip score (Miedel 2005), the Charnley hip

score, ADL and ability to walk independently (Blomfeldt 2005) and in a similar combination except with

the use of proxy responses for the ADL, walking capability and living status (Blomfeldt 2005b); the Harris

68

hip score, ADL and mobility (Blomfeldt 2007); with a modified Hip Rating Questionnaire (Keating 2006).

The aim of one study (Tidermark 2003) was to evaluate the responsiveness of the EQ5D and SF-36

instruments, i.e. their ability to capture clinically important changes in elderly patients with a displaced

femoral neck fracture.

Figure 2 The applicability of instrument types such as performance-based, Activities of Daily Living,

Impairment-based composite scores and Multi-item composite scores and the trendline of respondent

burden across measures (10 points = poor, 5 points = moderate, 0 points = good).

69

Table 3 Instrument characteristics measuring health-status or utility

Functional Status Questionnaire

Nottingham Health Profile

Short Form-36 WHOQOL-BREF

EQ5D

Year of initial publication

1978 1981 1994 1996 1990

Reference to First development

60 161, 190 157 162 156

Development population

Generic Generic Generic Generic Generic

Number of items 34 38 36 26 5 Domains Basic ADL

Int. ADL Mental health Work performance Social activity Quality of interaction 6 individual questions (work status, bed days, restricted days, sexual relationships, feelings about health, social activity)

Part I: Pain Physical mobility Sleep Energy Social life Emotion Part II: Daily activities

Limitations in physical activities Bodily pain Limitations in usual role activities Limitations in usual role activities because of emotional problems General mental health Limitations in social activities because of physical or emotional problems Vitality General health perceptions

Physical health Psychological health Social relationships Environment

Mobility Self-care Usual activities Pain or discomfort Anxiety/depression

Response 4, 5, 6-point ratings are used, referring to health in the past month, standardized to a 0 to 100 range

Dichotomous scale

Five-point scales and dichotomous scale

5-point response scales cover intensity, capacity, or frequency. For the subjective questions a satisfied-dissatisfied scale was used.

Each dimension in part I has 3 levels: no problems, some problems, severe problems, the resulting 1-digit number expresses the level selected for that dimension creating a 5-digit profile number; part II includes VAS 0 to 100

Scoring 6 summary functional status scores 6 single-item scores

Six scores each out of 100, weighted

Two sets of scores are derived: 8 section scores and two summary scores, one for physical component and one for mental component scores. Each subscore 0 to 100.

Highest domain score 20

EQ5d index score-0 indicates worst possible health state and a value of 1 indicates full health

Administration Patient Patient Patient Patient Patient

70

4.8 Discussion

In the past, the assessment of clinical outcome following a hip fracture relied primarily upon radiographic

parameters and range of motion in hip flexion to measure surgical success or the patient’s impairment.

Functional parameters such as quality of life and satisfaction from the perspective of the patient rather than

the surgeon were overlooked. We believe that in order to select an appropriate instrument for a study it is

crucial to evaluate the pragmatic characteristics as well as the psychometric properties of measurement

tools. Despite the rise over the recent decades in the use of functional outcome instruments in the aged

with hip fractures, the inclusion of hastily developed instruments can compromise the quality of the scales

and its psychometric properties. We believe that our results demonstrate that trialists could be using

measures quantifying activities of daily living because they are on average moderate across the four

dimensions of applicability (Figure 2). Likewise, the use of impairment-based composite scores, such as the

Harris hip score, could be because of its minimal respondent burden and good format compatibility (Figure

2). Multi-attribute scores, which include health status measures such as the SF-36, may be poor in score

distribution; have high examiner burden and poor format compatibility that altogether contribute to their

infrequent use in hip fracture randomized controlled trials.

There are important factors to consider when choosing appropriate measures in hip fracture trials such as

the patient’s perspective. Patient-reported outcomes (PRO’s) are reports coming directly from patients

about how they function or feel in relation to a health condition and its therapy, without interpretation of

the patient’s responses by a physician or anyone else. On the other hand, a performance measure is one in

which an individual is asked to perform a specific task that is evaluated in a standardized manner using

predetermined criteria, such as counting repetitions or timing the activity. The timed-up-and-go test (TUG)

does not focus on a single specific impairment such as the hip joint in itself, but rather on multiple domains

(including balance and strength). These measure the interaction of these factors on the performance of

activities of daily living.192 The TUG, which measures the concept of Activities and Participation, does not

in itself provide a comprehensive evaluation of the many factors involved in functional recovery.172 Several

authors have shown that physical performance measures have not been as thoroughly developed and tested

71

to the extent of the patient-reported outcomes and hence, modest to poor correlations can exist between

performance measures included in a battery of tests.69, 171, 172, 193,195,196 A composite score obtained from a

battery of performance tests may not be measuring the intended concept.

Several trials do not fully report on the specific index or instrument used to measure functional outcomes.

This can further attest to some of the heterogeneous results obtained from the RCTs which make them

incomparable. Our results indicate that poorly-conducted studies also use investigator-developed indices.

Some investigator-developed indices reported a few questions on activities of daily living which can

produce results that are often qualitative and ambiguous, are of unknown reliability and validity, and have

only localized usage.196 In the current RCT hip fracture trials, modifications to existing scales could

undermine their measurement properties. For instance, the word “arthritis” was replaced with “hip fracture”

in the hip rating questionnaire.111, 180 Abbreviated versions also need to be better understood as accuracy

may be sacrificed. Seven out of the twelve items of the Disability Rating Index182 were used because the

remaining 5 items were not considered relevant to elderly hip fracture patients and hence replaced by seven

other items.95,96

The setting and mode of administration is also important in keeping with the conceptual development of the

original scale. For instance, while the BI has been used to obtain detailed information regarding the

patient’s ability to perform ADL in the home, this may not equate with the individual’s recorded ability in

the rehabilitation center.177 The MBI however reflects the patient’s ability in self-care during medical

rehabilitation. This is important because of the influence of factors in the home, and the community

environment e.g. doorway widths, wheelchair access, attitudes of the patient and the caregivers and their

relationship.177 Furthermore, if a measurement is adequately tested, a manual to describe it should be

available. These should include a full description of the purpose of the method, the population for which it

is designed, the populations on which it has been tested and the intended use for the data collected.

72

Composite events pose problems in interpretation.197 The problematic nature of a simple aggregate or

composite index could assume equal weighting of tasks, and does not differentiate the contribution of

specific tasks to cumulative disability (Table 1). For instance, the total BI score is generally not as

significant or meaningful in treatment as the scores on individual items, since these indicate where the

deficiencies are.176, 215 In recent years, the scoring of health measures are also being scrutinized and score

responses that do not represent a breadth of performances are becoming unacceptable.30 The composite

score of some instruments are consistently reported without the relative weighting of specific items within

clusters of roughly equivalent item sets as those of the (I)ADL scales.198 The opposite dilemma can occur

when validated parts are put together in a battery of performance measures such as the FPM battery, thus

making them of unproven validity. In this scenario, Bach claimed that it was difficult to determine how this

composite measure might have contributed to the negative outcome of the trial.70 Indeed, the variability

found in measured function, regardless of the scale, cannot be explained by a single condition or illness and

is a dynamic multi-construct system which warrants future research.

Not only is there a need to adopt reliable methods to assess patient recovery following a hip fracture, but it

is also important to capture all aspects including Body Function and Structure as well as Activity and

Participation. Furthermore, measures that address function in patients with hip fracture can result in good

applicability for the specific population while generic health-status measures can allow for comparisons

across different target populations (Table 2). Consideration of functional outcome measures that include

measures of impairment that are either physician-rated or observer-based, together with measures of

activity limitations and participation restrictions that are either patient self-report or observer-based can

place the effectiveness of the intervention within the context of important and expected outcomes from the

perspective of the patient. Health-status measures cover at least the physical, emotional, and social

dimensions of health but may cover much more (Table 3). In addition to physical, mental, and social well-

being, the Euroqol also covers usual activities. The SF-36 includes work and role performance. The FSQ

overlaps with the SF36 in its measurement of physical function but also encompasses interactions with

family and friends and the WHOQOL additionally asks about spiritual well-being, transportation, and

environmental factors under the umbrella term of health-related quality of life. It is our belief that the SF36,

73

although widely used in clinical studies, may be challenging for the aged with hip fracture who present

various co-morbidities such as dementia and may be difficult to assess with a significant loss to follow-up.

In scoring the EQ5D, a persons’ health state is first summarized by a five-digit profile, indicating the score

on each of the five dimensions including mobility, pain or anxiety-depression, self-care and performance of

usual activities. Subsequently, the scores can be converted into a summary index using a utility-weighted

scoring system. In addition to the self-report version, the EQ5D also exists in observer, proxy versions.

Patient-completed and proxy versions are moderately good but overall, studies have shown a lack of

responsiveness of the EQ5D in those patients found at the upper ends of the health spectrum that may be

attributed in part to the composite scoring system which negates a finer discriminative response scale.30, 199

In contrast, there is good reliability of the well- validated SF-36 in the cognitively impaired.199 However,

the appropriateness of the item wording in the SF36 such as “running and lifting heavy objects” has been

criticized for not being appropriate for the aged.199 In all clinical measures, a ceiling effect may occur when

a subject reaches the maximum score that can be attained within a measure but can still surpass that

score.200

4.5 Summary In examining the practical aspects of the instruments, it is possible to draw some conclusions about their

reported frequency in hip fracture trials. Ideally, an instrument should be low in examiner and respondent

burden as well as have good scoring distributions and format compatibility. However, in addition to the

applicability of instruments it is also important to evaluate the patient’s perspective and to use instruments

that fit the needs of the target population. In the future, we recommend instruments that measure across

Body Function and Structure as well as Activity and Participation. Multiple instruments may be necessary

to address this as composite scores may lack in interpretability of the overall score for the respondent.

Future research should evaluate the psychometric properties of the identified instruments. Finally

composite scores may not fit entirely in the present ICF framework and this is a limitation to the present

study. In the future, we recommend modification of the existing framework to encompass these

instruments. This may include a further breakdown of the components Activity and Participation such to

distinguish between acts that are general things that a person can do independent of context or purpose and

74

Tasks which are done purposefully in their daily life in a specific environment.57 Also societal involvement

may be represented as the context of society and acknowledged areas of human endeavour.57

75

Keypoints in Chapter Four

• Pragmatic as well as psychometric properties should be assessed in the use and development of instruments

• Clinicians in hip fracture trials have mostly adopted measures of ADL. These measures scored on average moderate in all four dimensions of applicability.

• Measures of health-status and utility measures for hip fracture patients were not used consistently in the randomized hip fracture trials. These measures were also ranked high in examiner burden and poor format compatibility.

• We recommend multiple measures that assess both Body Function and Structure as well as Activity and Participation

• An optimal instrument should have minimal examiner and respondent burden and a good score distribution and format compatibility

76

CHAPTER 5

SUBGROUP ANALYSES

5.1 Introduction

Patients diagnosed with dementia represent a substantial proportion of patients with hip fractures that

clinicians will encounter. The prevalence of dementia increases dramatically with age and about one-half of

all nursing home residents have some form of cognitive impairment.200-3 Hip fractures are the second

leading cause of hospitalization in the aged and the added co-morbidity only lengthens hospital stays.19, 20

In a previous systematic review, Wells concluded that frail older rehabilitation candidates with mild to

moderate dementia should not be excluded from rehabilitation and should be screened for cognitive

impairment.20 Table 1 represents the various types of dementia present in the aged.

Older adults with co-morbid chronic conditions are more likely than younger adults to be at a higher risk of

having poor outcomes.204 Besides the increased risk of dislocation; patients with severe cognitive

dysfunction pose significant challenges to the treating surgeon such as a lack of compliance, problems in

completing rehabilitation regimens and further complications with other co-morbidities. Surgeons need

evidence of the impact of different treatment alternatives for this patient population. There is good reason

to believe that relative treatment effectiveness will be different in patients with dementia compared to

patients without dementia. For example, in a rehabilitative trial a patient with dementia will forget not to

weight-bear immediately following hip fracture surgery and without reminders from family or caregivers

the patient may jeopardize early rehabilitation and path to recovery.

Appropriate screening followed by early intervention can promote better informed decision-making. In the

past, a condition such as dementia was a main predictor of mortality; it is now a main predictor of higher

degrees of disability.205-8 Functional assessment is integral to determine the ability of individuals to

complete successfully daily living tasks and live in their usual physical, social, and cultural environment.

Furthermore, traditional mental status screening tools such as the Mini-Mental State Examination (MMSE)

which focus on cognitive functions are not suited for monitoring the progression of the everyday

77

manifestations related to cognitive impairment.209 It is important not to bias the results of hip fracture trials,

by not assessing all patient subtypes i.e. measuring only healthy lucid patients. In a subgroup analysis of

the studies identified in our systematic review, we wanted to identify those trials that potentially included

those with severe cognitive impairment. Our primary objectives were to identify those trials with and

without patients with dementia via the descriptions of inclusion and exclusion criteria reported. We also

wanted to identify the functional outcome instruments in those trials reporting patients with dementia and

hip fracture.

5.2 Methods

In this study, we classified the hip fracture trials into two Groups based on the entry criteria stated in each

trial. Group A trials did not explicitly state exclusion of any patients with co-morbidities. Group B trials

excluded patients diagnosed with one or more co-morbidities or a diagnosis of dementia or those unable to

provide informed consent. To answer our primary question, we examined what instruments measuring

functional outcome were used in Group A vs. Group B trials. In order to ascertain whether the trials

included patients with dementia, the following data was extracted from Group A trials: 1) any terms

including “dementia” and “severe cognitive impairment”, 2) type of diagnostic instrument used, 3)

inclusion and exclusion criteria and 4) functional outcome instruments 5) time points for assessment and 6)

statistical analysis in Group A trials.

Table 1 The spectrum of cognitive impairment

The Spectrum of Cognitive Impairment

Age-related cognitive decline: Decline in cognition consequent to the aging process that is within normal

limits given the person’s age. There may be complaints of mild short-term memory loss manifested by

problems remembering names or appointments; slowed responses; prolonged complex reaction times;

difficulties in solving complex problems; and decreased creativity

Amnestic mild cognitive impairment: Presence of a memory complaint preferably corroborated by an

informant. There is objective memory impairment but otherwise normal general cognitive function.

Activities of daily living should be intact and the patient cannot meet criteria for dementia.

78

Multi-domain mild cognitive impairment: Memory impairment plus deterioration in at least one additional

nonmemory cognitive domain but otherwise normal general cognitive function. Activities of daily living

should be intact and the patient cannot meet criteria for dementia.

Dementia of the Alzheimer’s type: The development of multiple cognitive deficits manifested b (1)

memory impairment and (2) one or more of (a) aphasia (language disturbance), (b) apraxia (impaired motor

skills in absence of primary motor dysfunction), (c) agnosia (failure to recognize objects despite intact

sensory function), and/or (d) disturbance in executive functioning (i.e. initiating, planning, organizing,

sequencing, abstracting). The deficits are severe enough to cause significant loss of social or occupational

functioning and are a decline from a previous level of functioning

Source: American Psychiatric Association210, 2000, Petersen211, 2004

5.3 Results

58 trials were included for data extraction. 1 trial by Hallan (2004) was excluded from the original 59

included studies because of inadequate reporting of inclusion and exclusion criteria.

5.3.1 Group A trials vs. Group B trials

Group A included 15/58 (25.9%) trials, which included patients with dementia and the remaining 43/58

(74.1%) trials excluded patients with dementia (Table 2). One trial included only patients with dementia

and one trial included only hip fracture patients with severe cognitive impairment.77, 150 Two trials explicitly

stated that dementia was not an exclusion criterion.107, 152 2 trials included instruments which accounted for

patients with cognitive impairments or used proxy or caregivers to answer questions.152 The other 9 trials

had mixed cases of patients with different co-morbidities.

Three trials excluded patients with severe co-morbidities that included respiratory or cardiovascular failure

or cerebrovascular disease which could affect mobility (Appendix 4). If patients were dependent in all basic

ADLs or non-ambulatory prior to fracture or unable to walk distance they were excluded from mostly

rehabilitation trials (6 rehabilitation trials, 3 surgical and 1 medical). Terminal illness was considered an

79

exclusion criterion in 6 trials (4 were rehabilitation trials). Living status either independent or alone less

than 4hrs per day or in long-term facility or had pre-fracture hospitalization, was considered an exclusion

criterion in 6 trials.

5.3.2 Diagnosing patients with co-morbidities in RCTs

24 out of 58 (41.3%) trials used diagnostic instruments to assess patients with co-morbidities (Figure 1).

The majority of which used the ASA classification212 (11/58 trials).

19 out of 58 (32.8%) trials diagnosed cognitive impairment (Figure 2). The majority of the trials (7/58)

used the Mini Mental State Examination (MMSE).213-4 Only one trial examined patients both preoperatively

and postoperatively administering the MMSE.112 There was no consistency found in the thresholds used to

exclude patients even within each intervention type (Appendix 4). 7 trials out of 58 did not specify any

cut-off points in the trial methodology. Only two surgical trials used cognitive screening for the purpose of

including patients with dementia.77, 150

5.3.3 Diagnosing emotional status

Only three studies examined the effect of depression and anxiety on rehabilitation in the elderly with hip

fractures using the Geriatric Depression Scale, the Hospital Anxiety and Depression scale and the CES-D.

But no patients with either depression or anxiety were excluded from RCTs.81, 113, 131

5.3.4 Outcome measures Based on the inclusion and exclusion criteria reported in the high quality RCTs, nine types of outcome

measures were commonly used in those trials (Group A) assessing all patient subtypes (Table 2). Upon data

extraction, one trial (Miedel 2005) which used a quality of life or utility measure such as the EQ5D was

subsequently excluded from Group A trials since the authors reported the administration of EQ5D (proxy)

did not include those patients with severe cognitive impairment (Table 3).120 Another trial used the EQ5D

proxy in patients with severe cognitive impairment.77 Group A measures found to assess all patient

subtypes were: Harris hip score, range of movement, pain, mobility, ADL, MBI and IADL. Other outcome

80

measures included: the use of support and accommodation. The use of a proxy or caregiver was utilized in

four trials in Group A and in three trials in Group B.

5.4 Discussion

In our systematic review the majority of the randomized controlled clinical trials excluded patients who

were cognitively impaired. However, in a study by Van Dortmont, with all hip fracture patients diagnosed

with dementia using the DSM-III-R, the authors in accordance with their previous work and other authors,

claimed that mental health state does indeed appear to be a significant prognostic factor for operative

treatment for intracapsular femoral neck fractures.215 In the study by Blomfeldt, the authors used the

Euroqol-5D despite referencing papers that do not support its use in those patients with severe cognitive

impairment. This is because of its poor correlations with outcome and ceiling effect.77 Ceiling effects were

reported to be attributed to the lack of finer discriminative response options found in this utility measure

thus resulting in low responsiveness.199 None of these trials include any measurement of quality of life even

if administered to a proxy or caregiver. Bachrach-Lindstrom utilized a proxy. Using a proxy has been

shown to be a reliable way to collect valuable information even when working with patients with mental

impairment.71, 216 All RCTs that included patients with dementia incorporate outcomes of mobility, living

status, activities of daily living and measures of impairment such as fracture fixation and reduction or other

parameters measured radiographically and measures of pain. The assessment of pain is challenging in

mentally impaired patients and the authors based their findings on anamnestic data in combination with a

physical examination using the Harris Hip Scale.71

Dementia is characterized by losses in cognitive function making it challenging to perform complex tasks

of daily living.205 Consequently, persons with dementia may have significant problems in their social,

occupational, self-care, cognitive and communicative abilities.217-9 However, randomized controlled trials

still need to substantiate their findings regarding patients with significant co-morbidity by capturing all

relevant measures in these patients groups. In one rehabilitation trial, there were no significant differences

81

in functional recovery in the groups.152 A subgroup analysis, revealed that the effect of geriatric

intervention on short-term functional recovery was greater in the subgroup of patients without dementia

(65% and 47% in the intervention and control groups respectively, p=0.006) and in the group of patients

who walked independently before the fracture (62% and 48%, P=0.03) than in those with dementia and

those who did not walk independently before the fracture.152 However, these results must be cautiously

interpreted because the analysis was not previously specified and the number of patients in each subgroup

was small.

Several reports claim that classifying the pre-fracture mental condition based on a postoperative

examination like the Mini Mental State Examination is not reliable due to the high incidence of temporary

postoperative delirium in this group of patients.71, 220 Several papers report that age, sex, location at time of

injury, pre-injury status, preoperative complications, time until surgery, and presence or absence of

dementia are poor prognostic factors.221-3

Likewise, the ASA was designed to categorize physical status in order to create a uniform system for

statistical analysis. Interestingly, it was never intended to represent peri-operative risk of morbidity for any

individual patient considering the many other influential factors such as the surgical procedure, patient

preparation, variability of surgeons and equipment used in different institutions that can affect outcome.224

Ward and colleagues reported that the ASA classification was a good predictor of mortality, and the 3-year

mortality was significantly less for ASA I and II patients than for ASA III, IV, and V patients.225 However,

the ASA classification is subjective and is dependent on the surgeon, internists, and anesthesiologists in

particular to discern classes ASA II, III, III and IV. In a study by Mak, the authors found that the

percentage of agreement was between 31 to 85%.224 Overall correlation was only fair in all groups (Kappa

indices: 0.21 – 0.4). The current pattern of inter-observer inconsistency of classification was similar to that

20 years ago and exaggerated between locally and overseas trained specialists (P <0.05). Overall, we do not

recommend the use of the ASA classification to include or exclude patients. Although it is used frequently

in studies to assess co-morbidity, its usefulness is questionable and a new more precise scoring system is

needed.

82

5.5 Ethical considerations

Cognitive impairment does not lead necessarily to an assessment of incompetence. Cognitively impaired

elderly people may still be able to make morally responsible decisions, based on different degrees of

decision-making and on their personal history.226 The solution is more complex and assessment instruments

need to be used with caution when dealing with multiple interactions between medical personnel, patients

and their family members. With regard to the ethical considerations and care of patients with dementia, the

approach needs to encompass the patient’s feeling of being cared for and approached as competent or free

individuals. As the elderly are regarded as a vulnerable group, it is important to use a method that can

protect those who cannot decide, and provide the opportunity to participate in research for those who are

able to decide for themselves. Especially because the law prohibits scientific research on incompetent

patients, unless special conditions are fulfilled, a closer investigation on the issue of informed consent is

needed, and should be recommended for future research. Patients with mild to moderate dementia still have

moral capacity. In our systematic review, the majority of rehabilitation trials focusing on patients with

dementia in specialized treatment centers have reported that significant improvement in postoperative

functional status can be achieved. Furthermore, protocols that have included these patient subtypes have

clearly relied upon one major concept of disability: activities of daily living and accommodation or living

status and the use of support for walking. These measures, do not provide adequate clinical judgment on

treatment effectiveness even for patients enrolled in the study who are healthy. Proxy and patient responses

are not interchangeable; however, proxy responses can provide an option for assessing function and health

status in patients who are unable to respond on their own behalf. In a prospective longitudinal study,

examining agreement between patient and proxy respondents using the Health Utilities Mark 2 and Mark 3

over time during a 6 month recovery after hip fracture, the authors reported ICC values from 0.50 to 0.85

(p<0.001) for physically based observable dimensions of health status and from 0.32 to 0.66 (P<.01) for

less observable dimensions. Future investigation of the proxy inter-rater agreement in the use of health

status instruments is needed.227

83

5.6 Summary

Although trials in patients with hip fractures can and should include patients with co-morbidities, the

majority of the trials we identified systematically exclude this important group of patients. Furthermore, in

trials that did include patients with co-morbidities the thresholds that defined inclusion were not consistent.

84

Figure 1 Diagnostic tools used to assess co-morbidity in high quality RCTs

0

2

4

6

8

10

12

ASA Ceder Charlson Co-morbiditygrade

Unspecified

Diagnostic tools for comorbidity

** ASA American Anesthesiologist Society

Figure 2 Frequently used measures for diagnosing cognitive status in high quality RCTs

0

1

2

3

4

5

6

7

8

MMSE MTS SPMSQ BDR CAM other

Diagnostic tools for cognitive impairment

**MMSE, Mini Mental status examination; MTS, Mental Test Score; SPMSQ, short portable mental status questionnaire; BDR,

blessed Dementia Rating; CAM, Confusion Assessment Method

85

Table 2 List of measures stratified by group based on inclusion and exclusion criteria alone. Group A trials did not explicitly state exclusion of patients with co-morbidities. Group B trials excluded patients with hip fractures diagnosed with one or more co-morbidities. Group A (N = 14)

Group B (N=27)

EQ5d 1 2 EQ5D (proxy) 2 SIP-NH 1 SWED-QUAL SF-36 3 SF-36 (chronic and acute forms)

1

PCS (SF-36) only 2 PCS (proxy-SF36)only 1 MCS (SF36)only 2 MCS (SF36 proxy)only 1 Charnley hip score 2 Harris hip score 2 2 Hip Rating Questionnaire

1

Mod HRQ 1 Oxford hip score 1 MFA full Yale physical activity survey

1

Outcome expectations for exercise scale

1

Questionnaire about health and functional abilities

1

Global health self-ratings

1

Functional Independence Measure (Mobility subscale)

1

Functional performance battery

1

Physical performance test

2

PPME 1 Physical function subscale (FSQ)

1

Level of satisfaction 1 Level of satisfaction (proxy)

1

ROM 1 3 Pain 5 4 Short form McGill pain questionnaire

1

Wong Baker pain rating scale

1

Mod pain Charnley

86

Pain (Charnley) 1 Balance 3 Balance scale (Berg) 1 Postural stability 2 Strength 6 Functional reach 2 Functional ambulation classification

1

FAC (proxy) 1 Functional Status Index 1 Falls Efficacy Scale 1 Mod falls efficacy scale 1 Mobility 6 15 Mobility (proxy) 1 Gait component (POMA)

Step Activity Monitor 1 ADL 4 6 Mod ADL 1 ADL (proxy) 2 MBI 1 2 IADL 3 1 London Handicap Scale 1 Disability Rating scale (DRI)

DRI modified Use of support 2 5 Caregiver strain index 2 Living situation 5 2 * Excluded 1 medical trial for not clearly reported inclusion and exclusion criteria (Hallan, 2006), ** 1 trial in Group A limited the use of EQ5D to those patients without severe cognitive dysfunction but included all patient types in the trial (Miedel, 2005).

87

Table 3 Group A trials including patients with severe co-morbidities such as dementia

Primary Author

Year Intervention Diagnostic tool, grade (definition of cognitive impairment), N patients

Types of outcomes Time points

Peyser132 2007 Surgical ASA 1(normal healthy)=1 2(patient with mild systemic disease)=46 3(severe but not incapacitating disease)=49 4(severe, incapacitating systemic condition that is a constant threat to life)=7 5(near death)=0

• Stability of fixation

• Pain (VAS) • Weight-bearing

index

6, 12, 24 wks & 24 mths

Stenvall143 2007 Rehab MMSE Dementia= 64 Cancer= 29 Stroke=49 Depression (GDS)=78 Diabetes= 40 Cardiovascular disease = 110 Hearing loss= 76 Vision loss = 64

• Mobility (S-COVS)

• Living conditions • ADL (Katz)

4, 12 mths

Krichbaum113 2007 Rehab MMSE < 24 = 1 • Living conditions • Global health self

ratings • Functional status

index

1, 3, 6 and 12 mths

Ovesen126 2004 Surgical ASA 1 = 39 2 = 39 3 = 51 4 = 17

• Reduction and implant positioning

• Walking ability • Accommodation • Need for walking

aid • Mortality

4 (need for walking aid), 12 mths

Johansson107 2006 Surgical Mod Norton scale 4(fully oriented)= 90 1-3 (no contact, cannot answer adequately, occasionally confused)= 56

• Dislocation • Pain (HHS) • Accommodation • Harris hip score • Costs • Mortality

3mths, 1 & 2 yrs

Kim112 2005 Surgical MMSE = NR ASA = NR

• Quality of reduction

• Bipolar cup migration

• Erosion of acetabular cartilage

• Stem stability • Bone quality

(Singh index)

• Pain (HHS) • MMSE • ADL (Katz) • HHS • Costs

Pre-op, 2yrs

88

Primary Author



Miedel120 2005 Surgical Ceder Co-morbidity scale Group A= 45 (41%) Group B = 48 (44%) SPMSQ > 4 (cognitive function intact, mildly impaired, moderately impaired) At inclusion = 162 At 4mths = 142 At 1 year = 134 NB 66(13%) lost to f-u Absent because of advanced age, illness or dementia

• Fracture reduction

• Screw position • Wound infection • Mobility • ADL (Katz) • Charnley’s

numerical classification

• EQ5D (only presented in those without severe cognitive dysfunction SPMSQ ≥ 3

4, 12 mths

Vidan152 2005 Rehab Chart review or interview of patient’s nurse Cardiovascular disease = 73 Diabetes mellitus = 64 Hypertension = 134 Dementia = 78 Delirium assessed by CAM = NR

• Wound infection, prosthesis luxation

• Functional Ambulation Classification (FAC)

• ADL (Katz) • Discharge

location • Composite

functional status outcome (ADL + FAC)

3, 6, 12 mths

Blomfeldt77 2005 Surgical SPMSQ 0-2 cognitive function severely impaired = 60 Co-morbidities Ceder C, another illness which affected rehabilitation = 60

• Fracture healing • Hip

complications

• Charnley’s numerical classification

• EQ5D • ADL (Katz)

4, 24 mths

Ahrengart69 2002 Surgical ASA 1=76.8 2=172.4 3=147 4=29.4 5=None

• Fracture healing • Leg-length

discrepancy

• Cut-out • Pain around hip • Hip rotation • Use of walking

aids • Living conditions

6mths

Janzing105 2002 Surgical ASA 1=2 2=40 3=39 4=2

• BMD (Singh) • Fracture healing • Pain (VAS) • Need of walking

aids • Living conditions

3, 6 & 12mths

89

Primary Author



• Mortality Naglie122 2002 Rehab SMSQ 3 to 6 = 74

• MBI • IADL (Lawton &

Brody)

3 & 6 mths

Van Dortmont150

2000 Surgical DSM-III-R = 60 Co-existing conditions Neurological = 17 Cardiovascular = 16 Metabolic=11 Pulmonary=7 Rheumatoid=1 Malignancy, treated= 6 Other= 13

• Reduction and fixation

• Blood loss • Wound

complications

• No. of secondary interventions

• ADL • Mortality

4 mths, 1 and 2 yrs

Bachrach-Lindstrom71

2000 Medical Mod Norton Scale 1-3 = mentally impaired = 41

• Body weight (Nutritional status)

• Triceps skinfold thickness (Nutritional status)

• Mid-arm circumference (Nutritional status)

• Serum albumin • ADL (Katz) • IADL (Hamrin) • Arm muscle

circumference (Nutritional status)

• HHS • Mortality

1,3 mths & 1 yr, Pain at 3mths & 1 yr

90

Keypoints from Chapter Five

• ADLs are critical for monitoring disease progression such as dementia, determining the need for health services, and evaluating the efficacy of therapeutic interventions

• The measurement of function using Daily Living activities should be measured as per categories of activities of daily living, instrumental activities of daily living and mobility related to walking and to managing stairs

• The majority of trials excluded patients with cognitive impairment such as dementia

• Use of proxy was underutilized in patients with cognitive impairment

• As a result, assessment of health-status and utility were not adequately captured in these patients

91

Chapter 6

CONCLUSIONS

6.1 Implications of study

There are numerous functional instruments that include an individual’s ability to carry out basic and

instrumental activities of daily living as well as their participation in those activities. Complications arise if

the instruments were developed without a conceptual basis which makes it challenging to apply the

composite score to reflect the functional recovery of the aged with hip fractures. Our systematic review, for

instance demonstrated a wide range of mobility measures applied to this target population. This reflects the

limitation of a time-consuming scale, questions which are redundant, or are either too easy or too difficult

or altogether irrelevant. Furthermore, the different rating scales, scores, different administration and scoring

options make outcomes results across trials difficult to compare. In this particular fragile and aged

population, co-morbidity increases respondent burden and examiner burden. The pragmatic considerations

of an instrument can lead to obvious disability bias as those with dementia are then not included in trials.

Functional outcomes are consequently measured for the most part in healthy lucid individuals. We believe

that a proxy can be utilized to the advantage of trialists in order to obtain information on all patients,

including those with severe cognitive impairment, who otherwise are not represented.

There is not only a need to adopt reliable methods to assess patient recovery following a hip fracture, it is

also important to capture with the use of multiple instruments domains under the categories of Body

Function and Structure as well as Activity and Participation according to the ICF scheme. Consideration of

functional outcome measures that include measures of impairment that are either physician-rated or

observer-based, together with measures of activity limitations and participation restrictions that are either

patient self-report or observer-based can place the effectiveness of the intervention within the context of

important and expected outcomes from the perspective of the patient.

6.2 Limitations

In this study, the quality assessment of trials relied upon the judgment and experience of one reviewer.

However potential bias was minimized since the trials were masked and the criteria for the Detsky-scale

92

were carefully adhered to. Another potential limitation encountered was the choice of the ICF framework

and blending the categories Activities and Participation together. Although we believe that clear

distinctions between these two categories could not be identified, distinct underlying concepts with two

different definitions are presented in the ICF framework. For the purposes of this review, we found

blending the categories of Activities and Participation when evaluating activities of daily living cut across

the different subdomains. The two different concepts can be operationalized using qualifiers of

performance and capacity to aid with the distinction. Perhaps in the future, modifications to the ICF may

address the gaps present in the current framework used. Furthermore, we have not considered in depth the

measurement properties such as internal validity, responsiveness, and reliability that are critical in selecting

the appropriate instruments.25 However, because of the scarcity of literature on many of these instruments,

we believe that assessing the pragmatic qualities of the instruments could provide valuable information as

to which instruments deserve further psychometric testing if needed.

6.3 Future directions

Implementing a universal language such as the ICF and more specifically the domains which can be coded

to three decimal places and specific categorizations of impairment, activity limitation and participation

restriction is worth further investigation. For instance in a recent publication by Jette and Haley, a core set

of 124 Basic Mobility items and 65 Daily Activity items were chosen for inclusion in an Activity Measure

for Post-Acute Care (AM-PAC) item bank based on a comprehensive review of items from existing

instruments, the ICF framework, a review by 10 measurement and content experts and suggestions from

several focus groups of individuals with disabilities.228 In turn, these will be completed in a self-report

tablet computer provided to the patients in the waiting room prior to their therapy visit. This system uses an

algorithm that selects items from the AM-PAC items banks directly tailored to the person, and shortens or

lengthens the test to achieve either the desired precision or a pre-assigned item stopping rule.

93

6.3 Summary

Clinical trials are still dominated by impairment-based scoring systems although our results show a positive

trend over time in the use of functional outcome instruments. Disease, generic and utility measures tend to

be more standardized than measures assessing activities of daily living, performance-base and impairment-

base scoring systems. Although disease, generic and utility measures have undergone more field-testing,

their use in hip fracture clinical trials remains limited. This may be in part due to poor format compatibility

for this specific age group with co-morbidities as well as dealing with the burden of a hip fracture. Low

respondent burden is also critical; however, a good instrument with relevant items pertaining to the hip

fracture may compensate. Furthermore, the use of proxy responders could be useful in identifying treatment

and management gaps in patients with co-morbidities. The results of using PROs in clinical practice have

been inconsistent, and ascertaining the circumstances under which PROs are truly helpful beyond research

settings remains a challenge.

94

References

1. Jaglal SB. Sherry PG. Schatzker J. The impact and consequences of hip fracture in Ontario. Canadian Journal of Surgery. 1996; 39(2):105-11.

2. Papadimitropoulos, E A. Coyte, P C. Josse, R G. Current and projected rates of hip fracture in Canada. CMAJ Canadian Medical Association Journal. 1997; 157(10):1357-63.

3. Brainsky A, Glick H, Lydick E, et al. The economic cost of hip fractures in community-dwelling older adults: a prospective study. J Am Geriatr Soc 1997; 45: 281-7.

4. Coper C, Campion G, Melton LJ. Hip fractures in the elderly—a world-wide projection. Osteoporosis Int 1992; 2: 285-9.

5. Gullberg B, Johnell O, Kanis JA. World-wide projections for hip fracture. Osteoporosis Int. 1997; 7: 407-13.

6. Maggi S, Kelsey JL, Litvak J, Heyse SP. Incidence of hip fractures in the elderly: a cross-national analysis. Osteoporosis Int 1991; 1: 232-41.

7. National Osteoporosis Foundation. Osteoporosis: review of the evidence for prevention, diagnosis, and treatment and cost-effectiveness analysis. Osteoporos Int 1998; 8 (suppl 4): S1-S88.

8. Jaglal SB, Weller I, Mamdani M et al. Population trends in BMD testing, treatment, and hip and wrist fracture rates: Are the hip fracture projections wrong? J Bone Miner Res 2005; 20: 898-905.

9. Jaglal SB. Falling hip fracture rates. J Bone and Min Res 2007; 22(7): 1098.

10. Egan M, Jaglal S, Byrne K et al. Factors associated with a second hip fracture: a systematic review. Clin Rehabil 2008; 22: 272.

11. Shabat S, Gepstein R, Mann G et al. The Second Hip Fracture—An analysis of 84 elderly patients. The Orthop Trauma 2003; 17 (9): 613-17.

12. Schroeder HM, Petersen KK, Erlandsend M. Occurrence and incidence of the second hip fracture. Clin Orthop Rel Res 1993; 289: 166-69.

13. Colon-Emeric CS, Sloane R, Hawkes WG et al. The risk of subsequent fractures in community-dwelling men and male veterans with hip fracture. Am J Med 2000; 109: 324-26?

14. Riley RL, Carnes ML, Gudmundsson A, et al. Outcomes and secondary prevention strategies for male hip fractures. Ann Pharmacother 2002; 36:17-23.

15. Formiga F, Rivera A, Nolla JM et al. Failure to treat osteoporosis and the risk of subsequent fractures in elderly patients with previous hip fracture: a five-year retrospective study. Aging Clin Exp Res 2005; 17: 96-99.

16. Wiktorowicz ME, Goeree R, Papaioannou A, et al. Economic implications of hip fracture: health service use, institutional care and cost in Canada. Osteoporosis International. 2001; 12(4): 271-8.

17. Haentjens P, Lamraski G. Endoprosthetic replacement of unstable, comminuted intertrochanteric fracture of the femur in the elderly, osteoporotic patient: A review. Disability & Rehabilitation 2005; 27 (18 & 19): 1167–1180.

18. Bernardini B, Meinecke C, Pagani M et al. Comorbidity and adverse clinical events in the rehabilitation of older adults after hip fracture. J Am Geriatr Soc 1995; 43: 894-8.

19. Wilkins K. Health care consequences of falls for senior. Health Reports. 1999; 10: 47-5.

95

20. Wells JL, Seabrook J, Stolee P et al. State of the Art in Geriatric Rehabilitation. Part II: clinical challenges. Arch Phys Med Rehab 2003; 84: 898-903.

21. Craik RL. Disability following hip fracture Phys Ther. 1994; 74 (5): 387-398.

22. Egol K, Koval K, Zuckerman J. Functional recovery following hip fracture in the elderly. J Orthop Trauma. 1997; 11(8): 594-99.

23. McDermott W. Absence of indicators of the influence of its physicians on society’s health. Am J of Med 1981; 70: 833-843.

24. Beaton DE, Schemitsch E. Measures of health-related quality of life and physical function. Clin Orthop 2003; 413, 90-105.

25. Swiontkowski MF. Outcomes measurement in orthopaedic trauma surgery. Injury 1995; 26 (10): 653-657.

26. Bombardier C. Outcome assessments in the evaluation of treatment of spinal disorders. SPINE 2000; 15(24): 3097-3099.

27. Guyatt GH, Rennie D. Users guides to the medical literature Essentials of evidence-based clinical practice. Chicago, IL: American Medical Association; 2002.

28. Jette A. Functional disability and rehabilitation of the aged. Topics in Geriatric Rehabilitation 1986; 1: 1-7.

29. Bhandari M, Guyatt G, Montori V. User’s Guide to the Orthopaedic Literature: how to use a systematic literature review. J Bone Joint Surg 2002; 1672-1682.

30. McDowell, I. Measuring Health: A Guide to Rating Scales and Questionnaires. Oxford University Press, Incorporated, UK. 2006

31. Guyatt GH, Feeny DH, and Patrick DL. Measuring Health-related Quality of Life Annals of Int Med 1993; 118 (8): 622-629.

32. Putnam M. Linking aging theory and disability models: increasing the potential to explore aging with physical impairment. Gerontologist 2002; 42(6):799-806.

33. Bickenbach JE, Chatterji S, Badley EM et al. Models of disablement, universalism and the international classification of impairments, disabilities and handicaps. Social Science & Medicine 1999; 48(9): 1173-87.

34. Jette DU, Jette AM. Physical therapy and health outcomes in patients with spinal impairments. Physical Therapy 1996; 76(9): 930-45.

35. Wilson IB, Cleary PD. Linking clinical variable with health-related quality of life: a conceptual model of patient outcomes. JAMA 1995; 273: 59-65.

36. World Health Organization. Towards a common language for functioning, disability, and health—ICF. Geneva: WHO; 2002.

37. Mulrow CD. Rationale for systematic reviews. In: Chalmers I, Altman DG, eds. Systematic Reviews. London. BMJ 1995: 1-8.

38. Scientific Advisory Committee of the Medical Outcomes Trust. Assessing health status and quality-of-life instruments: Attributes and review criteria. Qual Life Res 2002; 11: 193-205.

96

39. Goldstein FC, Strasser DC, Woodard JL et al. Functional outcome of cognitively impaired hip fracture patients on a geriatric rehabilitation unit. J of American geriatrics society 1997; 45 (1): 35-42.

40. World Health Organization International Classification of Functioning, Disability and Health (ICF). Geneva: WHO; 2001.

41. Stucki G. International Classification of Functioning, Disability, and Health (ICF): a promising framework and classification for rehabilitation medicine. American Journal of Physical Medicine & Rehabilitation. 2005; 84(10):733-40.

42. Verbrugge LM, Jette AM. The disablement process. Soc Sci. Med 1994; 38 (1): 1-14.

43. Bowling A. Measuring Health—A review of quality of life Measurement Scales, The concept of functional ability—2nd edition, Open University Press 1997.

44. Buckingham UK, McColl MA, Bikenbach JE (eds). Introduction to Disability. WB Sunders Company Ltd. London, England1998.

45. Katz S, Ford AB, Moskowitz RW et al. Studies of illness in the aged. The index of ADL: A standardized Measure of Biological and psychosocial function. JAMA 1963; 185:914-919.

46. Lawton HP, Brody EM. Assessment of older people: self-maintaining and instrumental activities of daily living. Gerontologist 1969; 9(3): 179-86.

47. Jette AM, Haley SM, Coster WJ, et al. Late life function and disability instrument: I. Development and evaluation of the disability component. J Gerontology Medical Sciences. 2002; 57A (4):M209-221.

48. Hoeymans N, Feskens EJ, van den Bos et al. Measuring functional status: cross-sectional and longitudinal associations between performance and self-report. J Clin Epi 1996; 49, 1102-1110.

49. Young NL, Williams JI, Yoshida KK et al. The context of measuring disability: does it matter whether capability or performance is measured? J Clin Epid 1996; 49 (10): 1097-1101.

50. Nagi SZ. A study in the evaluation of disability and rehabilitation potential: concepts, methods and procedures. Am J Public Health Nations Health 1964; 54 (9): 1568.

51. Granger CV, and Gresham GE, eds. Functional assessment in rehabilitation medicine (1984). Williams and Wilkins. Baltimore, MD.

52. Jette AM. Physical disablement concepts for physical therapy research and practice. Physical therapy 1994; 74 (5).

53. National Institutes of Health, Research Plan for the National Centre for Medical Rehabilitation Research, NIH Publication No. 93-3509. Bethesda, MD: National Institutes of Health, 1993.

54. Pope AM, and Tarlov eds. Disability in America. Towards a National Agenda for Prevention. Washington DC: Institutes of Medicine, National Academy Press. 1991

55. World Health Organization International Classification of Impairments, Disabilities, and Handicaps (ICIDH). Geneva: WHO; 1980.

56. Lezzoni LI, Greenberg MS. Capturing and classifying functional status information in administrative databases. Health Care Finance review 2003; 24 (3): 61-76.

97

57. Badley E. Enhancing the conceptual clarity of the activity and participation components of the International Classification of Functioning, Disability, and Health. Social Science & Medicine 2008; 66:2335-2345.

58. Jette A, Norweg A and Haley S. Achieving meaningful measurements of ICF concepts. Disability and Rehabilitation 2008; 30 (12-13): 963-969.

59. Journal of Bone and Joint Surgery Inc. “Levels of Evidence for Primary Research Question” in Instructions to Authors. Stanford University Libraries Highwire Press. March 14, 2008. www2.ejbjs.org/mis/instrux.dtl#levels.

60. Jette AM, Davies AR, Cleary PD et al. The Functional Status Questionnaire: reliability and validity when used in primary care. Journal of general internal medicine. 1986; 1: 143-149.

61. Detsky AS, Naylor CD, O’Rourke K et al. Incorporating variations in the quality of individual randomized trials into meta-analysis. J Clin Epi 1992; 45 (3): 255-65.

62. Systematic Reviews in Health Care: Meta-analysis in context. Eds.Egger M, Davey Smith G, Altman DG. 2nd edition. BMJ Publishing group. London, UK. 2001.

63. Bhandari M, Richards RR, Sprague S, et al. The quality of reporting of randomized trials in the Journal of Bone and Joint Surgery from 1988 through 2000. J bone Joint Surg Am. 2002; 84-A: 388-396.

64. Sukhdeep KD, Bronwyn LT, Slobogean BA et al. A quality assessment of randomized clinical trials in pediatric orthopaedics. J Pediatr Orthop 2007;27(5):573-581.

65. Donner A, Klar N. The statistical analysis of kappa statistics in multiple samples. J Clin Epidemiol 1996; 49: 1053-8.

66. Fleiss J. The statistical basis of meta-analysis. Stat Methods Med Res. 1993; 2: 121-45.

67. Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics. 1977; 33:159-174.

68. Adams, CI et al Prospective Randomized controlled trial of an intramedullary nail versus dynamic screw and plate for intertrochnateric fractures of the femur. Journal of orthopaedic trauma 15.6 (2001): 394-400.

69. Ahrengart, L., et al. A Randomized Study of the Compression Hip Screw and Gamma Nail in 426 Fractures. Clinical Orthopaedics & Related Research. 2002; 401: 209-22.

70. Bach, M. A., et al. The Effects of MK-0677, an Oral Growth Hormone Secretagogue, in Patients

with Hip Fracture. Journal of the American Geriatrics Society 2004; 52 (4): 516-23. 71. Bachrach-Lindstrom, M., et al. Nutritional Status and Functional Capacity After Femoral Neck

Fractures: A Prospective Randomized One-Year Follow-Up Study. Aging-Clinical & Experimental Research. 2000; 12(5): 366-74.

72. Baker, R. P., et al. Total Hip Arthroplasty and Hemiarthroplasty in Mobile, Independent Patients

with a Displaced Intracapsular Fracture of the Femoral Neck. A Randomized, Controlled Trial. The Journal of bone and joint surgery. American volume. 2006; 88 (12): 2583-9.

73. Bal, S et al., “Ceramic-on-ceramic versus ceramic-on-polyethylene bearings in total hip arthroplasty: Results of a multicenter prospective randomized study and update of modern ceramic total hip trials in the United States” in Ceramics in Orthopaedics . (2005): 101-108

98

74. Bannister, G. C., et al. The Fixation and Prognosis of Trochanteric Fractures. A Randomized Prospective Controlled Trial. Clinical Orthopaedics & Related Research.254 (1990): 242-6.

75. Binder, E. F., et al. Effects of Extended Outpatient Rehabilitation after Hip Fracture: A Randomized Controlled Trial. JAMA 2004; 292(7): 837-46.

76. Blomfeldt, R., et al. Comparison of Internal Fixation with Total Hip Replacement for Displaced

Femoral Neck Fractures. Randomized, Controlled Trial Performed at Four Years. Journal of Bone & Joint Surgery - American Volume 2005; 87(8): 1680-8.

77. Blomfeldt R. Internal Fixation versus Hemiarthroplasty for Displaced Fractures of the Femoral

Neck in Elderly Patients with Severe Cognitive Impairment. Journal of Bone & Joint Surgery - British Volume 2005; 87(4): 523-9.

78. Blomfeldt, R., et al. A Randomised Controlled Trial Comparing Bipolar Hemiarthroplasty with

Total Hip Replacement for Displaced Intracapsular Fractures of the Femoral Neck in Elderly Patients. Journal of Bone & Joint Surgery - British Volume 2007; 89(2): 160-5.

79. Boonen, S., et al. Musculoskeletal Effects of the Recombinant Human IGF-I/IGF Binding Protein-

3 Complex in Osteoporotic Patients with Proximal Femoral Fracture: A Double-Blind, Placebo-Controlled Pilot Study. Journal of Clinical Endocrinology & Metabolism 2002; 87(4): 1593-9.

80. Bruce, D., et al. Nutritional Supplements After Hip Fracture: Poor Compliance Limits Effectiveness. Clinical Nutrition 2003;22(5): 497-500.

81. Burns A, et al. Treatment and Prevention of Depression After Surgery for Hip Fracture in Older People: Randomized, Controlled Trials. Journal of the American Geriatrics Society 2007; 55 (1): 75-80.

82. Calder SJ, Anderson GH, Harper WM, et al. A subjective health indicator for follow-up: A

randomised trial after treatment of displaced intracapsular hip fractures. J Bone Joint Surg 1995; 77B: 494-496.

83. Calder SJ, et al. Unipolar Or Bipolar Prosthesis for Displaced Intracapsular Hip Fracture in

Octogenarians: A Randomised Prospective Study. Journal of Bone & Joint Surgery - British 1996; 78 (3): 391-4.

84. Cornell, C. N., et al. Unipolar Versus Bipolar Hemiarthroplasty for the Treatment of Femoral

Neck Fractures in the Elderly. Clinical Orthopaedics & Related Research. 1998; 348: 67-71.

85. Crotty, M., et al. Early Discharge and Home Rehabilitation After Hip Fracture Achieves Functional Improvements: A Randomized Controlled Trial. Clinical rehabilitation 2002; 16 (4): 406-13.

86. Crotty, M., et al. Patient and Caregiver Outcomes 12 Months After Home-Based Therapy for Hip

Fracture: A Randomized Controlled Trial. Archives of Physical Medicine and Rehabilitation 2003; 84 (8): 1237-9.

87. Davison, J. N., et al. Treatment for Displaced Intracapsular Fracture of the Proximal Femur. A Prospective, Randomised Trial in Patients Aged 65 to 79 Years." Journal of Bone & Joint Surgery - British Volume 2001;83(2): 206-12.

99

88. Desjardins, A. L., et al. Unstable Intertrochanteric Fracture of the Femur. A Prospective Randomised Study Comparing Anatomical Reduction and Medial Displacement Osteotomy. Journal of Bone & Joint Surgery - British Volume 1993; 75(3): 445-7.

89. Di L et al. Low back pain after unstable extracapsular hip fractures: randomised control trial on a

specific training. Europa Medicophysica 2007;43(3): 349-57.

90. Dujardin FH, et al. Prospective Randomized Comparison between a Dynamic Hip Screw and a Mini-Invasive Static Nail in Fractures of the Trochanteric Area: Preliminary Results. Journal of orthopaedic trauma 2001;15(6): 401-6.

91. Efstathopoulos NE, Nikolaou VS and Lazarettos JT. Intramedullary Fixation of Intertrochanteric Hip Fractures: A Comparison of Two Implant Designs. International orthopaedics 2007; 31(1): 71-6.

92. El-Abed K, et al. Comparison of Outcomes Following Uncemented Hemiarthroplasty and Dynamic Hip Screw in the Treatment of Displaced Subcapital Hip Fractures in Patients Aged Greater than 70 Years. Acta Orthopaedica Belgica 2005; 71(1): 48-54.

93. Espaulella J, et al. Nutritional Supplementation of Elderly Hip Fracture Patients. A Randomized, Double-Blind, Placebo-Controlled Trial. Age & Ageing 2000; 29 (5): 425-31.

94. Foss NB, et al. Effect of Postoperative Epidural Analgesia on Rehabilitation and Pain After Hip

Fracture Surgery: A Randomized, Double-Blind, Placebo-Controlled Trial. Anesthesiology 2005; 102(6): 1197-204.

95. Hagsten B, Svensson O and Gardulf A. Early Individualized Postoperative Occupational Therapy Training in 100 Patients Improves ADL After Hip Fracture: A Randomized Trial. Acta Orthopaedica Scandinavica 2004; 75(2): 177-83.

96. Hagsten B. Health-Related Quality of Life and Self-Reported Ability Concerning ADL and IADL

After Hip Fracture: A Randomized Trial. Acta Orthopaedica 2006; 77(1): 114-9. 97. Hallan G, et al. Palamed G Compared with Palacos R with Gentamicin in Charnley Total Hip

Replacement. A Randomised, Radiostereometric Study of 60 HIPS. Journal of Bone & Joint Surgery - British Volume 2006; 88(9): 1143-8.

98. Hardy DC, et al. Use of an Intramedullary Hip-Screw Compared with a Compression Hip-Screw with a Plate for Intertrochanteric Femoral Fractures. A Prospective, Randomized Study of One Hundred Patients. J Bone Joint Surg - American Volume 1998; 80(5): 618-30.

99. Hardy D, et al. Slotted intramedullary hip screw nails reduce proximal mechanical unloading. Clin Orthop 2003; 406: 176-184.

100. Harrington P et al. Intramedullary Hip Screw Versus Sliding Hip Screw for Unstable

Intertrochanteric Femoral Fractures in the Elderly. Injury 2002; 33(1): 23-8.

101. Hauer K et al. Intensive Physical Training in Geriatric Patients After Severe Falls and Hip Surgery. Age & Ageing 2002; 31(1): 49-57.

100

102. Hedstrom M., et al. Positive Effects of Anabolic Steroids, Vitamin D and Calcium on Muscle Mass, Bone Mineral Density and Clinical Function After a Hip Fracture. A Randomised Study of 63 Women. J Bone & Joint Surg - British Volume 2002; 84(4): 497-503.

103. Huang TT, Liang SH. A Randomized Clinical Trial of the Effectiveness of a Discharge Planning

Intervention in Hospitalized Elders with Hip Fracture due to Falling. Journal of clinical nursing 2005; 14(10): 1193-201.

104. Huusko TM, et al. Intensive Geriatric Rehabilitation of Hip Fracture Patients: A Randomized,

Controlled Trial. Acta Orthopaedica Scandinavica 2002; 73(4): 425-31.

105. Janzing HM, et al. The Gotfried Percutaneous Compression Plate Versus the Dynamic Hip Screw in the Treatment of Pertrochanteric Hip Fractures: Minimal Invasive Treatment Reduces Operative Time and Postoperative Pain. Journal of Trauma-Injury Infection & Critical Care 2002; 52(2): 293-8.

106. Johansson T et al. Internal Fixation Versus Total Hip Arthroplasty in the Treatment of Displaced

Femoral Neck Fractures: A Prospective Randomized Study of 100 Hips. Acta Orthopaedica Scandinavica 2000; 71(6): 597-602.

107. Johansson T, et al. The Total Costs of a Displaced Femoral Neck Fracture: Comparison of Internal

Fixation and Total Hip Replacement. A Randomised Study of 146 Hips. International orthopaedics 2006; 30(1): 1-6.

108. Jonsson B et al. Social Function After Cervical Hip Fracture. A Comparison of Hook-Pins and Total Hip Replacement in 47 Patients. Acta Orthopaedica Scandinavica 1996;67(5): 431-4.

109. Karn NK, et al. Comparison between External Fixation and Sliding Hip Screw in the Management of Trochanteric Fracture of the Femur in Nepal. The Journal of bone and joint surgery. British volume 2006; 88(10): 1347-50.

110. Keating JF, et al. Displaced Intracapsular Hip Fractures in Fit, Older People: A Randomised

Comparison of Reduction and Fixation, Bipolar Hemiarthroplasty and Total Hip Arthroplasty. Health technology assessment (Winchester, England) 2005; 9(41): iii,iv, ix-x, 1-65.

111. Keating JF. Randomized Comparison of Reduction and Fixation, Bipolar Hemiarthroplasty, and

Total Hip Arthroplasty. Treatment of Displaced Intracapsular Hip Fractures in Healthy Older Patients. Journal of Bone & Joint Surgery - American Volume 2006; 88(2): 249-60.

112. Kim SY, Kim YG, and Hwang JK. Cementless Calcar-Replacement Hemiarthroplasty Compared

with Intramedullary Fixation of Unstable Intertrochanteric Fractures. A Prospective, Randomized Study. Journal of Bone & Joint Surgery - American Volume 2005; 87(10): 2186-92.

113. Krichbaum, K. GAPN post-acute care coordination improves hip fracture outcomes. Western

Journal of Nursing Research. 2007; 29(5):523-44. 114. Kuisma RA. Randomized, Controlled Comparison of Home Versus Institutional Rehabilitation of

Patients with Hip Fracture. Clinical rehabilitation 2002; 16 (5): 553-61. 115. Lamb, S. E., et al. Neuromuscular Stimulation of the Quadriceps Muscle After Hip Fracture: A

Randomized Controlled Trial. Archives of Physical Medicine & Rehabilitation 2002; 83(8): 1087-92.

116. Lauridsen UB, et al. Intensive physical therapy after hip fracture: a randomized clinical trial. Dan Med Bull 2002; 49: 70-2.

101

117. Mangione et al. Can elderly patients who have had a hip fracture perform moderate- to high-intensity exercise at home? Phys Ther 2005; 85 (8): 727-739.

118. Mattsson P, et al. Resorbable Cement for the Augmentation of Internally-Fixed Unstable

Trochanteric Fractures. A Prospective, Randomised Multicentre Study. Journal of Bone & Joint Surgery - British Volume 2005; 87(9): 1203-9.

119. Mattsson P, and Larsson S. Calcium Phosphate Cement for Augmentation did Not Improve

Results After Internal Fixation of Displaced Femoral Neck Fractures: A Randomized Study of 118 Patients. Acta Orthopaedica 2006; 77(2): 251-6.

120. Miedel R, et al. The Standard Gamma Nail or the Medoff Sliding Plate for Unstable Trochanteric

and Subtrochanteric Fractures. A Randomised, Controlled Trial. Journal of Bone & Joint Surgery - British Volume 2005; 87(1): 68-75.

121. Moroni A, et al. HA-Coated Screws Decrease the Incidence of Fixation Failure in Osteoporotic Trochanteric Fractures. Clinical Orthopaedics & Related Research. 2004;425: 87-92.

122. Naglie G, et al. Interdisciplinary Inpatient Care for Elderly People with Hip Fracture: A Randomized Controlled Trial. Canadian Medical Association Journal 2002; 167(1): 25-32.

123. Neumann, M., et al. Provision of High-Protein Supplement for Patients Recovering from Hip

Fracture. Nutrition 2004; 20(5): 415-9.

124. Oldmeadow LB, et al. No Rest for the Wounded: Early Ambulation After Hip Surgery Accelerates Recovery. ANZ Journal of Surgery 2006;76(7): 607-11.

125. Oude Voshaar RC, et al. Fear of Falling More Important than Pain and Depression for Functional Recovery After Surgery for Hip Fracture in Older People." Psychological medicine 2006; 36(11): 1635-45.

126. Ovesen S et al. The trochanteric gamma nail versus the dynamic hip screw: A prospective randomised study. One-year follow-up of 146 intertrochanteric fractures. Hip International 2004; 16(4): 293-298.

127. Pajarinen, J., et al. Pertrochanteric Femoral Fractures Treated with a Dynamic Hip Screw Or a

Proximal Femoral Nail. A Randomised Study Comparing Post-Operative Rehabilitation. Journal of Bone & Joint Surgery - British Volume 2005; 87(1): 76-81.

128. Papasimos S, et al. A Randomised Comparison of AMBI, TGN and PFN for Treatment of Unstable Trochanteric Fractures. Archives of Orthopaedic & Trauma Surgery 2005;125(7): 462-8.

129. Parker MJ, and Pryor GA. Internal Fixation Or Arthroplasty for Displaced Cervical Hip Fractures in the Elderly: A Randomised Controlled Trial of 208 Patients. Acta Orthopaedica Scandinavica 2000; 71 (5): 440-6.

130. Parker, M. J., et al. Hemiarthroplasty Versus Internal Fixation for Displaced Intracapsular Hip

Fractures in the Elderly. A Randomised Trial of 455 Patients. Journal of Bone & Joint Surgery - British Volume 2002; 84(8): 1150-5.

102

131. Peterson, MG et al. High-intensity exercise training following hip fracture. Topics in Geriatric Rehabilitation 2004; 20(4): 273-284.

132. Peyser A, et al. A Prospective, Randomised Study Comparing the Percutaneous Compression Plate and the Compression Hip Screw for the Treatment of Intertrochanteric Fractures of the Hip. Journal of Bone & Joint Surgery - British Volume 2007; 89(9): 1210-7.

133. Raia, F. J., et al. Unipolar Or Bipolar Hemiarthroplasty for Femoral Neck Fractures in the Elderly?

Clinical orthopaedics and related research 2003; 414: 259-65.

134. Ravikumar KJ, and Marsh G. Internal Fixation Versus Hemiarthroplasty Versus Total Hip Arthroplasty for Displaced Subcapital Fractures of Femur--13 Year Results of a Prospective Randomised Study. Injury 2000; 31(10): 793-7.

135. Resnick, B. et al. Testing the Effectiveness of the Exercise Plus Program in Older Women Post-Hip Fracture Ann Behav Medicine 2007; 34 (1): 67-76.

136. Rogmark C, et al. A Prospective Randomised Trial of Internal Fixation Versus Arthroplasty for

Displaced Fractures of the Neck of the Femur. Functional Outcome for 450 Patients at Two Years. Journal of Bone & Joint Surgery - British Volume 2002; 84(2): 183-8.

137. Santini S et al., Hip fractures in elderly patients treated with bipolar hemiarthroplasty: Comparison between cemented and cementless implants. Journal of Orthopaedics and Traumatology 2005;6(2): 80-87.

138. Sherrington C, Lord SR, and Herbert RD. A Randomised Trial of Weight-Bearing Versus Non-Weight-Bearing Exercise for Improving Physical Ability in Inpatients After Hip Fracture. Australian Journal of Physiotherapy 2003; 49(1): 15-22.

139. Sherrington C, Lord SR, and Herbert RD. A Randomized Controlled Trial of Weight-Bearing Versus Non-Weight-Bearing Exercise for Improving Physical Ability After Usual Care for Hip Fracture. Archives of Physical Medicine & Rehabilitation 2004; 85(5): 710-6.

140. Shyu YI, et al. A Pilot Investigation of the Short-Term Effects of an Interdisciplinary Intervention Program on Elderly Patients with Hip Fracture in Taiwan. Journal of the American Geriatrics Society 2005; 53(5): 811-8.

141. Sikorski JM, and Barrington R. Internal Fixation Versus Hemiarthroplasty for the Displaced Subcapital Fracture of the Femur. A Prospective Randomised Study. Journal of Bone & Joint Surgery - British Volume 1981; 63(B3): 357-61.

142. Soreide O et al. Internal fixation versus endoprosthesis in the treatment of femoral neck fractures in the elderly. A prospective analysis of the comparative costs and the consumption of hospital resources. Acta Orthop Scand. 1980;51: 827-31.

143. Stenvall M, et al. Improved Performance in Activities of Daily Living and Mobility After a Multidisciplinary Postoperative Rehabilitation in Older People with Femoral Neck Fracture: A Randomized Controlled Trial with 1-Year Follow-Up. Journal of rehabilitation medicine: official journal of the UEMS European Board of Physical and Rehabilitation Medicine 2007; 39(3): 232-8.

103

144. Tidermark J, et al. Internal Fixation Compared with Total Hip Replacement for Displaced Femoral

Neck Fractures in the Elderly. A Randomised, Controlled Trial. Journal of Bone & Joint Surgery - British Volume 2003; 85(3): 380-8.

145. Tidermark J, et al. Effects of Protein-Rich Supplementation and Nandrolone in Lean Elderly

Women with Femoral Neck Fractures. Clinical Nutrition 2004; 23(4): 587-96. 146. Tinetti ME, et al. Home-based multicomponent rehabilitation program for older persons after hip

fracture: a randomized trial. Arch Phys Med Rehabil 1999; 80 (8): 916-922.

147. Tsauo JY, et al. Effects on Function and Quality of Life of Postoperative Home-Based Physical Therapy for Patients with Hip Fracture. Archives of Physical Medicine and Rehabilitation 2005; 86(10): 1953-7.

148. Utrilla, A. L., et al. Trochanteric Gamma Nail and Compression Hip Screw for Trochanteric Fractures: A Randomized, Prospective, Comparative Study in 210 Elderly Patients with a New Design of the Gamma Nail. Journal of orthopaedic trauma 2005; 19 (4): 229-33.

149. Van der Lely A, et al. Use of Human GH in Elderly Patients with Accidental Hip Fracture.

European Journal of Endocrinology 2000; 143: 585-592. 150. van Dortmont LM, et al. Cannulated Screws Versus Hemiarthroplasty for Displaced Intracapsular

Femoral Neck Fractures in Demented Patients. Annales Chirurgiae et Gynaecologiae 2000; 89(2): 132-7.

151. van Vugt, A B, Oosterwijk WM, and Goris RJ. Osteosynthesis vs endoprosthesis in the treatment of unstable intracapsular hip fractures in the elderly. A randomized clinical trial. Archives of Orthopaedic & Trauma Surgery 1993;113(1): 39-45.

152. Vidan M, et al. Efficacy of a Comprehensive Geriatric Intervention in Older Patients Hospitalized for Hip Fracture: A Randomized, Controlled Trial. Journal of the American Geriatrics Society 2005; 53(9): 1476-82.

153. Vidyadhara S. & Rao SK. One and two femoral neck screws with intramedullary nails for unstable trochanteric fractures of femur in the elderly--randomised clinical trial. Injury 2007; 38 (7):806-14.

154. Harris WH. Traumatic arthritis of the hip after dislocation and acetabular fractures: treatment by mold arthroplasty. JBJS 1969; 51-A (4): 737-755.

155. Parker MJ, Palmer CR. A new mobility score for predicting mortality after hip fractures. J Bone

Joint Surg Br 1993; 75B: 797-798.

156. Euroqol group. Euroqol—a new facility for the measurement of health-related quality of life. Health policy 1990, 16: 199-208.

157. Ware J, Sherbourne CD. The MOS 36-item Short-Form Health Survey (SF-36): Conceptual

framework and item selection. 1992: 30 (6): 473-483.

158. Charnley J. The Long-Term Results Of Low-Friction Arthroplasty Of The Hip Performed As A Primary Intervention J Bone Joint Surg 1972; 54—B (1): 61-76

104

159. Dawson J, Fitzpatrick R, Carr A. Questionnaire On The Perceptions Of Patients About Total Hip

Replacement. J Bone Joint Surg 1996; 78-B (2): 185-190.

160. Johanson NA, Charlson ME, Szatrowski TP. A self-administered hip-rating questionnaire for the assessment of outcome after total hip replacement. J Bone Joint Surg 1992; 74-A: 587-97.

161. Hunt SM, McEwen J, McKanna SP. Measuring health status. London, Croom Helm, 1986

162. World Health Organization Quality of Life (WHOQOL). Study protocol for the World Health

Organization project to develop a Quality of life assessment instrument (WHOQOL). Quality of life research, 1993, 2: 153-9.

163. Karanicolas PJ, Bhandari M, Taromi B. Blinding of Outcomes in Trials of Orthopaedic Trauma:

An Opportunity to Enhance the Validity of Clinical Trials. The Journal of Bone and Joint Surgery (American). 2008; 90:1026-1033.

164. Bhandari M, Morrow F, Kulkarni AV, Meta-Analyses in Orthopaedic Surgery: A Systematic

Review of Their Methodologies The Journal of Bone and Joint Surgery 2001; 83:15.

165. Haynes RB, Strauss SE. Evidence-based medicine: how to practice and teach EBM. Edinburgh, New York: Elsevier/Churchill Livingstone; 2005.

166. Sackett DL, Cook DJ. Can we learn anything from small trials? Ann NY Acad Sci 1993; 703: 25-

31.

167. Begg CB, Cho MK, Eastwood S et al. Improving the quality of reporting of randomized controlled trials: the CONSORT statement. JAMA 1996; 276: 637-639.

168. Montori VM, Kleinbart J, Newman TB, et al. Tips for learners of evidence-based medicine 2.

Measures of precision (confidence intervals). CMAJ 2004; 171: 611-5.

169. Auger C, Demers L, Swaine B. Making sense of pragmatic criteria for the selection of geriatric rehabilitation measurement tools. Arch Gerontol Ger 2006; 43: 65-83.

170. Andresen EM. Criteria for assessing the tools of disability outcomes research. Arch Phys Med

Rehabil 2000; 81, S15-S20.

171. Reuben DB. An objective measure of physical function of elderly outpatients. The physical performance test. J American geriatrics society. 1990; 38 (10): 1105-12.

172. Podsiadlo D. The timed “up and go” test: a test of basic functional mobility for frail elderly

persons. J American Geriatrics Society. 1991; 39: 142-148. 173. Mathias S, Nayak USL, Isaacs B. Balance in the elderly patient: the get-up and go test. Arch Phys

Med Rehabil 1986; 67: 387.

174. Guyatt GH, Sullivan MJ, Thompson PJ et al. The 6-minute walk: a new measure of exercise capacity in patients with chronic heart failure. C Med Assoc 1985;132 (8): 919-923.

175. Butland RJA, Pang J, Gross ER et al. Two-, six-, and twelve-minute walking tests in respiratory

disease. Br Med J 1982; 284: 1607-8

176. Mahoney FI, Barthel DW. Functional evaluation; the Barthel Index. Md State Med J 1965; 14: 61-5.

105

177. Shah S, Vanclay F, Cooper B. Improving the sensitivity of the Barthel Index for stroke rehabilitation. J Clin Epidemiol, 1989; 42(8):703-9

178. Older Americans Resources and Services, Duke Univesity. The OARS Multidimensional

Functional Assessment Questionnaire. Durham NC: Duke University Press; 1975, revised 1988

179. Fillenbaum GG, Smyer MA. The development, validity, and reliability of the OARS multidimensional functional assessment questionnaire. J Gerontol 1981; 36: 428-434.

180. Klein RM, Bell B. The Klein-Bell ADL scale manual. Seattle, WA, University of Washington

Medical School, Health Sciences Resource Center, 1979. 181. Klein RM, Bell B. Self-care skills: Behavioral measurement with Klein-Bell ADL scale. Arch

Phys Med Rehabil 1982; 63: 335-8. 182. Salén BA, Spangfort EV, Nygren AL et al. The disability rating index: an instrument for the

assessment of disability in clinical settings. J clin Epidemiol 1994; 47: 1423-34.

183. Jette AM. Functional Status Index. Reliability of a chronic disease evaluation instrument. Arch Phys Med Rehabil 1980; 61: 395-401.

184. Judet R, Judet J. Technique and results with acrylic femoral head prosthesis. J Bone Joint Surg,

1952; 34-B, 173-80 185. Stinchfield FE, Cooperman B, Sheaj CE. Replacement of the Femoral Head by Judet or Austin

Moore Prosthesis. J Bone Joint Surg 1957 39-A (5), 1043-58.

186. Matta J. Fractures of the Acetabulum: Accuracy of Reduction and Clinical Results in Patients Managed Operatively within Three Weeks after the Injury. Clin Orthop Relat Res 1986, 205: 241-50.

187. Salvati EA, Wilson PD Jr. Long-Term Results of Femoral-Head Replacement. J Bone Joint Surg

1973 55A: 516-524

188. Bergner M et al. The Sickness Impact Profile: development and final revision of a health status measure. Medical care, 1981, 19: 787-805.

189. Brorsson B, Ifver J, Hays RD. The Swedish Health-Related Quality of Life Survey (SWED-QAL).

Quality of life research, 1993, 2: 33-45.

190. Hunt SM, McEwen J. Measuring health status: a tool for clinicians and epidemiologists. J Royal college of general practitioners 1985 35, 185-88

191. Harwood RH, Rogers A, Dickenson E et al. Measuring handicap: The London handicap scale, a new outcome measure for chronic disease. Quality in Health Care 1994; 3: 11-16.

192. Bischoff HA, Stahelin HB, Monsch AU et al. Identifying a cut-off point for normal mobility: a comparison of the timed ‘up and go’ test in community-dwelling and institutionalized elderly women. Age and Ageing 2003; 32: 315-20

193. Fox KM, Feisenthal G, Hebel JR et al. A portable neuromuscular functionl assessment for

studying recovery from hip fracture. Arch Phys Med Rehab 1998; 77: 171-176.

194. Guralnik JM. Lower-extremity function in persons over age of 70 years as a predictor of subsequent disability. 1995; 332: 556-60.

106

195. Binder EF, Schechtman KB, Ehsani AA. Effects of exercise training on frailty in community-dwelling older adults: Results of a randomized, controlled trial. J Am Geriatr Soc 2002; 50: 1921-8.

196. Stratford PW, Kennedy D. The relationship between self-report and performance-related

measures: Questioning the content validity of timed tests Arthritis Care & Research 2003; 49 (4):535 – 540

197. Freemantle N, Calvert M, Wood J, et al. Composite outcomes in randomized trials: greater

precision but with greater uncertainty? JAMA 2003; 289: 2554-2559.

198. Thomas VS, Rockwood K, McDowell I. Multidimensionality in Instrumental and Basic Activities of Daily Living. J Clin Epidemiol 1998; 51(4): 315-321.

199. Coucill W, Bryan S, Bentham P et al. EQ-5D in patients with dementia: an investigation of inter-

rater agreement. Med Care 2001; 39: 760-71

200. Nagano K. Appropriate outcome measures for evaluating change in activities of daily living of elderly residents. Nippon Koshu Eisei Zasshi 2002; 49: 76-87.

201. Teresi J, Lawton P, Ory M et al. Measurement issues in chronic care populations: dementia special

care. Alzheimer’s dis Assoc Disord 1994; 8 (Suppl): S144-S183.

202. Anderson SW, Tranel D. Awareness of disease states following cerebral infarction, dementia, and

head trauma: standardized assessment. Clin neuropsychol 1989; 3: 327-339. 203. Givens JL, Sanft TB, Marcantonio ER. Functional recovery after hip fracture: the combined

effects of depressive symptoms, cognitive impairment, and delirium. J Am Geriatr Soc 2008; 1075-79.

204. Fried LP, Wallace RB. The complexity of chronic illness in the elderly: from clinic to community.

In: Wallace RB, Woolson RF, eds. The epidemiologic study of the elderly. New York: Oxford University Press, 1992: 10.

205. Bargerger-Gateau P, Alioum A, Peres K et al. The contribution of dementia to the disablement

process and modifying factors. Dement Geriatr Cogn Disord 2004; 18: 330-337.

206. Aguero Torres H, Thomas VS Winbald B et al. The impact of somatic and cognitive disorders on the functional status of the elderly. J Clin Epidemiol 2002; 55: 1007-1012.

207. Njegovan V, Man Son Hing M, Mitchell SI et al. The hierarchy of functional loss associated with

cognitive decline in older persons. J Gerontol A Biol Sci Med Sci 2001; 56: M638-M643. 208. Mortiz DJ, Kasl SV, Berkman LF. Cognitive functioning and the incidence of limitations in

activities of daily living in an elderly community sample. Am J Epidemiol 1995; 141: 41-49.

209. Diagnosis and management of dementia: a manual for memory disorders. Eds. Wilcock GK, Buck RS, Rockwood K. Oxford Medical Publications, London, UK. 1999.

210. American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders

(Revised 4th edition). Washington, DC; APA, 2000.

211. Petersen RC. Mild cognitive impairment as a diagnostic entity. J Intern Med 2004; 256: 183-94.

107

212. American Society of Anesthesiologists. New classification of physical status. Anesthesiology 1963; 24:111.

213. Folstein MF. The Mini-Mental State Examination. In: Crook T. Ferris S, Bartus R, eds.

Assessment in geriatric psychopharmacology. New Canaan, Connecticut: Mark Powley, 1983: 50-51.

214. Folstein MF, Folstein SE, McHugh PR. Mini mental state: a practical method for grading the

cognitive state of patients for the clinician. J Psychiatr Res 1975; 12: 189-198.

215. Van Dortmont LMC, Oner FC, Wereldsma JCJ et al. Effect of mental state on mortality after hemi-arthroplasty for fracture of the femoral neck. Eur J Surg 1994; 160: 203-208.

216. Ostbye T, Tyas S, McDowell et al. Reported activities of daily living: agreement between elderly

subjects with and without dementia and their caregivers. Age Ageing 1997; 26: 99-106.

217. Larson EB, Kukull WA, Katzman RL. Cognitive impairment: dementia and Alzheimer’s disease. Annu Rev Public Health 1992; 13: 431-439.

218. Whitehouse PJ. Dementia: the medical perspective. In: Binstock RH, Post SG, Whitehouse PJ,

eds. Dementia and aging. Baltimore: Johns Hopkins University Press, 1992: 21.

219. Mahurin RK, DeBettignies BH, Pirozzolo FJ. Structured assessment of independent living skills: preliminary report of performance measures of functional abilities in dementia. J Gerontol Psychol Sci 1991; 46: P58-P66.

220. Lundstrom M, Edlund A, Lundstrom G et al. Reorganization of nursing and medical care to

reduce the incidence of postoperative delirium and improve rehabilitation outcome in elderly patients treated for femoral neck fractures. Scand J Caring Sci 1999; 13: 193-200.

221. Homberg S, Conradi P, Kalen R et al. Mortality after cervical hip fracture: 3002 patients followed

for 6 years. Acta Orthop Scand 1986; 57: 8-11.

222. Ions GK, Stevens J. Prediction of survival in patients with femoral neck fractures. J Bone Joint Surg Br 1987; 218: 32-41.

223. Kyo T, Takaoka K, and Ono K. Femoral neck fracture. Factors related to ambulation and

prognosis Clin Orthop 1993; 292: 215-22.

224. Mak PH, Campbell RCH, Irwine M. The ASA physical status classification: inter-observer consistency 2002; 30: 633-640.

225. Ward PH, Jay RL, Eric LF et al. Influence of health status and the timing of surgery on mortality

in hip fracture patients. Am J Orthop 1997; 26 (9): 621-7. 226. Vellinga A. PhD thesis, To Know or Not to be: Development of an instrument to assess decision-

making capacity of cognitively impaired elderly patients. Vrije University Amsterdam 2006 227. Jones CA, Feeny DH. Agreement between patient and proxy responses during recovery after hip

fracture: Evidence for the FIM instrument. Arch Phy med and Rehab 2006; 87 (10): 1382-7.

228. Jette A, Haley S, Ni P et al. Adaptive short forms for outpatient rehabilitation outcome assessment. Am J Phys Med and Rehab 2008: 842-852.

108

APPENDICES

109

Appendix 1 Search strategy Ovid MEDLINE(R) In-Process & Other Non-Indexed Citations and Ovid MEDLINE(R) 1950 to Present

# Searches 1 exp hip fractures/ 2 (exp fractures, bone/ and femur$.mp.) or femoral$.mp. [mp=title, original title, abstract, name

of substance word, subject heading word] 3 exp fractures, bone/ and trochant$.mp. [mp=title, original title, abstract, name of substance

word, subject heading word] 4 exp fractures, bone/ and intertrochant$.mp. [mp=title, original title, abstract, name of substance

word, subject heading word] 5 exp fractures, bone/ and subtrochant$.mp. [mp=title, original title, abstract, name of substance

word, subject heading word] 6 exp fractures, bone/ and intracapsul$.mp. [mp=title, original title, abstract, name of substance

word, subject heading word] 7 exp fractures, bone/ and pertrochant$.mp. [mp=title, original title, abstract, name of substance

word, subject heading word] 8 exp fractures, bone/ and extracapsul$.mp. [mp=title, original title, abstract, name of substance

word, subject heading word] 9 exp fractures, bone/ and exp hip/ 10 exp hip/ and (pin$ or screw$ or plate$ or arthroplast$ or internal fix$ or fracture fix$).mp. 11 or/1-10 12 exp Randomized Controlled Trials as Topic/ 13 exp Prospective Studies/ or exp Treatment Outcome/ 14 15 exp Clinical Trials as Topic/ 16 exp "Recovery of Function"/ 17 Range of motion.mp. or exp "Range of Motion, Articular"/ 18 exp Mobility Limitation/ 19 exp Disability Evaluation/ 20 musculoskeletal evaluation.mp. 21 exp Health Status Indicators/ 22 exp Cross-Over Studies/ 23 exp "Activities of Daily Living"/ 24 exp Diagnostic Errors/ 25 exp questionnaires/ 26 11 and (or/12-25) 27 limit 26 to (english language and ("all aged (65 and over)" or "aged (80 and over)")) 28 limit 27 to randomized controlled trial

Ovid Healthstar

# Searches 1 exp hip fractures/ 2 (exp fractures, bone/ and femur$.mp.) or femoral$.mp. [mp=title, original title, abstract, name

of substance word, subject heading word]

110

3 exp fractures, bone/ and trochant$.mp. [mp=title, original title, abstract, name of substance word, subject heading word]

4 exp fractures, bone/ and intertrochant$.mp. [mp=title, original title, abstract, name of substance word, subject heading word]

5 exp fractures, bone/ and subtrochant$.mp. [mp=title, original title, abstract, name of substance word, subject heading word]

6 exp fractures, bone/ and intracapsul$.mp. [mp=title, original title, abstract, name of substance word, subject heading word]

7 exp fractures, bone/ and pertrochant$.mp. [mp=title, original title, abstract, name of substance word, subject heading word]

8 exp fractures, bone/ and extracapsul$.mp. [mp=title, original title, abstract, name of substance word, subject heading word]

9 exp fractures, bone/ and exp hip/ 10 exp hip/ and (pin$ or screw$ or plate$ or arthroplast$ or internal fix$ or fracture fix$).mp. 11 or/1-10 11 exp Treatment Outcome/ 12 exp Prospective Studies/ or 14 exp Double-Blind Method/ or double blind.mp. 15 16 exp "Recovery of Function"/ 17 Range of motion.mp. or exp "Range of Motion, Articular"/ 18 exp Mobility Limitation/ 19 exp Disability Evaluation/ 20 musculoskeletal evaluation.mp. 21 exp Health Status Indicators/ 22 exp Cross-Over Studies/ 23 exp "Activities of Daily Living"/ 24 exp Diagnostic Errors/ 25 exp questionnaires/ 26 11 and (or/12-25) 27 limit 26 to (english language and ("all aged (65 and over)" or "aged (80 and over)")) 28 limit 27 to randomized controlled trial

CINAHL

# Searches 1 exp hip fractures/ 2 exp fractures/ and (femur$ or femoral$).mp. [mp=title, subject heading word, abstract,

instrumentation] 3 exp fractures/ and trochant$.mp. [mp=title, subject heading word, abstract, instrumentation] 4 exp fractures/ and intertrochant$.mp. 5 (exp fractures,/ and pertrochant$.mp.) or subtrochant$.mp. 6 exp fractures/ and intracapsul$.mp. 7 exp fractures/ and extracapsul$.mp. [mp=title, subject heading word, abstract, instrumentation] 8 exp fractures,/ and exp hip/

111

9 hip fracture$.mp. 10 exp hip/ and (pin$ or screw$ or plate$ or arthroplast$ or internal fix$ or fracture fix$).mp. 11 or/1-10 12 Prospective Studies/ or exp Treatment Outcome/ 13 exp Clinical Nursing Research/ 14 exp Functional Status/ 15 Recovery of Function.mp. 16 Range of Motion.mp. or exp "Range of Motion"/ 17 exp Physical Mobility/ or Mobility Limitation.mp. 18 exp Disability Evaluation/ 19 musculoskeletal evaluation.mp. 20 exp Health Status Indicators/ 21 exp "Activities of Daily Living"/ 22 21 or 17 or 12 or 20 or 15 or 14 or 18 or 13 or 16 or 19 23 22 and 11 24 limit 23 to (research and english) 25 limit 24 to (aged <65 to 79 years> or "aged <80 and over>")

EMBASE 1973-1979, 1980 to 2007 Week 38

# Searches 1 exp hip fractures/

2 exp fracture/ and ((femur$ or femor$ or trochant$ or intertrochant$ or pertorchant$ or extracap$ or intercap$).mp. or exp/hip) [mp=ti, ab, sh, hw, tn, ot, dm, mf]

3 1 or 2

4 exp Prospective Study/

5 exp Outcomes Research/ or exp Treatment Outcome/ or outcome study.mp.

6 Functional Status.mp. or exp Functional Assessment/ or exp Functional Status/ or exp Scoring System/ or exp Prognosis/ or exp Health Status/

7 Recovery of Function.mp. or exp Convalescence/

8 exp PHYSICAL MOBILITY/ or exp JOINT MOBILITY/

9 Disability Evaluation.mp. or exp Disability/

10 Activities of Daily Living.mp. or exp Daily Life Activity/

11 musculoskeletal evaluation.mp.

12 3 and (or/4-11)

13 limit 12 to (human and english language and aged <65+ years>)

14 Randomized Controlled Trial/

15 13 and 14

112

Appendix 2 The 21-Detsky scale used to assess quality in the randomized controlled clinical trials.

Randomization 1a) Were the patients assigned randomly?

Yes-1 No-0

1b) Was randomization adequately described?

Yes-2 Partly-1 No-0

1c) Was treatment group concealed to investigator?

Yes-1 No-0

Description of outcome measurement 2a) Was the description of outcome measurement adequate?

Yes-1 No-0

2b) Was the outcome measurement objective?

Yes-2 Partly-1 No-0

2c) Were the assessors blind to treatment?

Yes-1 No-0

Inclusion/Exclusion criteria 3a) Were inclusion/exclusion criteria well defined?

Yes-2 Partly-1 No-0

3b) Were the number of excluded patients and reasons for exclusion provided?

Yes-2 Partly-1 No-0

Description of treatment 4a) Was the therapy fully described for the treatment group?

Yes-2 Partly-1 No-0

4b) Was the therapy fully described for the controls?

Yes-2 Partly-1 No-0

Statistics 5a) Was the test stated and a p value given?

Yes-1 No-0

5b) Was the statistical analysis appropriate?

Yes-2 Partly-1 No-0

5c) If the trial was negative, were confidence intervals of post hoc power calculations performed?

Yes-1 No-0

5d) Was the sample size calculated before the study?

Yes-1 No-0

Total Negative trial- 21 Positive trial- 20 Percentage

113

Appendix 3 Quality assessment scores according to Detsky Author 1a 1b 1c 2a 2b 2c 3a 3b 4a 4b 5a 5b 5c 5d Total % Neg

Adams (2001) 1 2 1 1 2 1 2 2 2 2 1 2 1 0 20 95 X Ahrengart (2002)

1 1 0 1 1 1 2 2 2 2 1 2 0 0 16 80

Bach (2004) 1 1 1 1 1 1 2 2 2 2 1 2 1 1 19 90 X Bachrach-Lindstrom (2000)

1 1 0 1 1 0 2 2 2 2 1 2 - 0 15 75

Baker (2006) 1 2 1 1 1 0 2 2 2 2 1 2 - 1 18 90 Bal (2005) 0 0 1 0 1 1 0 0 1 1 0 1 - 0 6 30 Bannister (1990)

1 0 0 1 2 0 0 1 0 0 1 0 0 0 6 30

Binder (2004) 1 1 0 1 2 0 2 2 2 2 1 2 0 0 16 80 Blomfeldt (2005)

1 2 0 1 2 1 2 2 2 2 1 2 0 0 18 90

Blomfeldt (2005b)

1 2 0 1 2 0 2 2 2 2 1 2 0 0 17 85

Blomfeldt (2007)

1 2 1 1 1 1 2 2 2 2 1 2 0 1 17 85

Boonen (2002) 1 0 0 1 2 0 2 0 2 2 1 2 - 0 13 65 Bruce (2003) 0 1 0 1 1 0 2 0 2 2 1 1 0 0 11 52 X Burns (2007) 1 1 0 1 1 0 2 2 2 2 1 2 1 1 17 85 Calder (1995) 1 0 0 1 2 0 2 2 1 1 1 2 0 0 13 65 Calder (1996) 1 1 0 1 1 0 2 2 2 2 1 2 0 0 15 75 Cornell (1998) 1 2 1 1 2 1 2 2 2 2 1 2 0 0 19 90 X Crotty (2002) 1 1 1 1 1 1 2 0 2 2 1 2 1 1 17 81 X Crotty (2003) 1 2 1 1 2 0 2 2 2 2 1 2 0 0 18 90 Davison (2001) 1 0 0 0 1 0 2 0 2 2 1 2 - 0 11 55 Desjardins (1993)

1 1 0 1 2 1 2 0 2 2 1 0 0 0 13 65

Di Lorenzo (2007)

1 1 0 1 1 0 1 0 1 1 1 1 0 0 9 45

Dujardin (2001)

1 0 0 1 1 0 2 2 2 2 1 1 0 0 13 65

El-Abed (2005)

0 0 0 0 0 0 2 2 1 2 1 2 0 1 12 60

Espaulella (2000)

1 2 1 1 1 1 2 2 2 2 1 2 1 1 20 95 X

Efstathopoulos (2007)

1 2 0 1 2 0 2 2 2 2 1 1 0 1 17 85

Foss (2005) 1 2 1 1 2 1 2 2 2 2 1 2 - 1 20 95 Hagsten (2004)

1 2 1 1 1 1 2 2 2 2 1 2 - 1 19 95

Hagsten (2006)

1 2 1 1 1 1 2 2 2 2 1 2 0 0 18 90

Hallan (2006) 1 2 1 1 2 1 0 2 2 2 1 2 1 0 18 86 X Hardy (1998) 0 1 0 1 2 0 1 2 2 2 1 2 - 0 13 65 Hardy (2003) 1 0 0 1 1 0 2 2 2 2 1 2 0 0 14 70 Harrington (2002)

1 2 0 1 2 0 2 2 2 2 1 2 1 0 18 85 X

Hauer (2002) 1 1 0 1 1 1 2 2 1 1 1 2 0 0 14 70 Hedstrom (2002)

1 1 0 1 2 0 2 2 2 2 1 1 0 0 15 75

Huang (2005) 1 2 0 1 2 0 2 2 2 2 1 2 0 0 19 85 Huusko (2002) 1 2 1 1 1 1 2 2 2 2 1 2 1 1 20 95 Janzing (2002) 1 2 1 1 1 0 2 2 2 2 1 2 0 0 17 85 Johansson (2000)

1 1 0 1 1 0 2 2 2 2 1 2 0 0 15 75

Johansson (2006)

1 1 0 1 2 1 2 2 2 2 1 2 0 0 17 85

Jonsson (1996) 1 2 0 0 1 0 1 0 1 2 1 0 - 0 9 45 Karn (2006) 1 0 0 1 1 0 1 0 2 2 1 2 0 0 11 55 Keating (2005) 1 2 1 1 2 0 2 2 2 2 1 2 0 1 19 95 Keating (2006) 1 2 1 1 1 1 2 2 2 2 1 2 1 1 20 95

114

Kim (2005) 1 1 1 1 1 1 2 2 2 2 1 2 - 0 17 85 Krichbaum (2007)

1 2 1 1 1 1 1 2 2 1 1 2 0 0 16 80

Kuisma (2002) 1 2 1 1 1 0 2 0 2 2 1 2 - 1 16 80 Lamb (2002) 1 2 1 1 2 1 2 2 2 2 1 2 0 0 19 95 Lauridsen (2002)

1 0 0 0 0 0 2 2 2 2 1 2 0 0 12 57 X

Mangione (2005)

1 1 1 1 1 1 2 2 2 2 1 2 0 0 17 85

Mattson (2005)

1 2 0 1 2 1 2 2 2 2 1 2 0 0 18 90

Mattsson (2006)

1 2 0 1 1 0 2 2 2 2 1 2 0 1 17 85

Miedel (2005) 1 2 0 1 1 1 2 2 2 2 1 2 0 1 18 90 Moroni (2004) 1 1 0 1 2 0 2 0 2 2 1 2 0 0 14 70 Naglie (2002) 1 2 1 1 2 1 2 2 2 2 1 2 0 0 16 80 Neumann (2004)

1 0 1 1 2 1 2 0 2 2 1 2 0 1 16 80

Oldmeadow (2006)

1 1 0 1 1 0 2 0 2 2 1 1 0 1 13 65

Oude Voshaar (2006)

1 0 0 1 0 0 2 0 2 2 1 2 0 0 11 55

Ovesen (2006) 1 2 0 1 2 0 2 2 2 2 1 2 0 0 19 95 Pajarinen (2005)

1 1 0 1 2 0 2 2 2 2 1 2 0 0 16 80

Papasimos (2005)

1 0 0 1 1 0 2 1 2 2 1 2 0 0 13 65

Parker (2000) 1 2 0 1 1 0 2 2 2 2 1 2 1 0 17 80 Parker (2002) 1 2 1 1 2 0 2 2 2 2 1 2 0 0 18 90 Peterson (2004)

1 0 1 1 1 1 2 1 2 2 1 2 - 0 15 75

Peyser (2007) 1 2 0 1 1 0 2 2 2 2 1 2 0 1 17 85 Raia (2003) 1 1 0 1 2 0 2 2 2 2 1 2 0 0 16 80 Ravikumar (2000)

1 0 0 1 2 0 2 1 2 2 1 2 0 0 14 70

Resnick (2007) 1 0 1 1 1 1 1 2 2 2 1 2 0 0 15 75 Rogmark (2002)

1 2 0 1 2 0 2 2 1 1 1 2 0 1 16 80

Santini (2005) 0 0 0 0 1 0 2 0 1 1 0 0 - 0 5 25 Sherrington (2003)

1 2 0 1 1 0 2 2 2 2 1 2 1 1 18 86

Sherrington (2004)

1 2 1 1 1 0 2 2 2 2 1 2 1 0 18 85 X

Shyu (2005) 1 0 0 1 1 0 2 2 2 2 1 2 - 0 14 70 Sikorski (1981)

1 1 0 1 2 0 1 2 1 1 1 0 - 0 11 55

Soreide (1976) 1 2 0 0 1 0 1 0 1 2 1 0 - 0 9 45 Stenvall (2007) 1 2 1 1 1 0 2 2 2 2 1 2 1 0 18 85 Tidermark (2003)

1 1 0 1 2 0 2 2 2 2 1 2 0 0 16 80

Tidermark (2004)

1 2 1 1 2 1 2 2 2 2 1 2 0 0 19 95

Tinetti (1999) 1 0 1 1 1 1 2 2 2 2 1 2 0 1 17 85 Tsauo (2005) 1 0 0 1 1 0 0 1 2 2 1 2 0 0 11 55 Utrilla (2005) 1 1 0 1 2 0 2 2 2 2 1 2 0 0 16 80 Van der Lely (2000)

1 2 1 1 1 1 2 2 2 2 1 2 1 1 20 95 X

Van Dortmont (2000)

1 2 1 1 1 0 2 2 2 2 1 2 - 0 17 80 X

Van Vugt (1993)

1 0 0 1 1 0 1 1 2 1 1 2 0 0 11 55

Vidan (2005) 1 0 1 1 1 1 2 2 2 1 1 2 0 1 16 76 X Vidyadhara (2007)

1 0 0 1 1 0 0 0 2 2 0 0 0 0 7 35

*b- denotes British journal

115

Appendix 4. Included trials

Adams 2001

Adams CI. Prospective randomized controlled trial of an intramedullary nail versus dynamic screw and plate for intertrochanteric fractures of the femur. J Orthop Trauma 2001; 15(6):394-400.

Ahrengart 2002

Ahrengart L. A randomized study of the compression hip screw and gamma nail in 426 fractures. Clin Orthop Relat Res 2002; 401:209-22.

Bach 2004

Bach MA. The Effects of MK-0677, an oral growth hormone secretagogue, in patients with hip fracture. J American Geriatrics Soc 2004; 52(4):516-23.

Baker 2006

Baker RP. Total Hip Arthroplasty and Hemiarthroplasty in Mobile, Independent Patients with a Displaced Intracapsular Fracture of the Femoral Neck. A Randomized, Controlled Trial. J Bone Joint Surg 2006; 88(12):2583-9.

Bal 2005

Bal, S. Ceramic-on-ceramic versus ceramic-on-polyethylene bearings in total hip arthroplasty: Results of a multicenter prospective randomized study and update of modern ceramic total hip trials in the United States. Ceramics in Orthopaedics 2005 ;( 101-8).

Bannister 1990

Bannister, G. C. The Fixation and Prognosis of Trochanteric Fractures. A Randomized Prospective Controlled Trial. Clinical Orthopaedics & Related Research 1990; 254:242-6.

Binder 2004

Binder EF. Effects of Extended Outpatient Rehabilitation After Hip Fracture: A Randomized Controlled Trial. JAMA 2004; 292(7):837-46.

Blomfeldt 2005b

Blomfeldt R. Internal Fixation Versus Hemiarthroplasty for Displaced Fractures of the Femoral Neck in Elderly Patients with Severe Cognitive Impairment. JBJS Br 2005; 87(4):523-9.

116

Blomfeldt 2007

Blomfeldt R. A Randomised Controlled Trial Comparing Bipolar Hemiarthroplasty with Total Hip Replacement for Displaced Intracapsular Fractures of the Femoral Neck in Elderly Patients. JBJS -Br 2007; 89(2):160-5.

Boonen 2002

Boonen S. Musculoskeletal Effects of the Recombinant Human IGF-I/IGF Binding Protein-3 Complex in Osteoporotic Patients with Proximal Femoral Fracture: A Double-Blind, Placebo-Controlled Pilot Study. J Clin Endocrinology & Metabolism 2002; 87(4):1593-9.

Bruce 2003

Bruce, D. Nutritional Supplements After Hip Fracture: Poor Compliance Limits Effectiveness. Clinical Nutrition 2003; 22(5):497-500.

Burns 2007

Treatment and Prevention of Depression After Surgery for Hip Fracture in Older People: Randomized, Controlled Trials. J American Ger Soc 2007; 55(1):75-80.

Calder 1996

Calder SJ. Unipolar Or Bipolar Prosthesis for Displaced Intracapsular Hip Fracture in Octogenarians: A Randomised Prospective Study. JBJS Br 1996; 78(3):391-4.

Cornell 1998

Cornell CN. Unipolar Versus Bipolar Hemiarthroplasty for the Treatment of Femoral Neck Fractures in the Elderly. Clin Orthop Relat Res 1998; 348:67-71.

Crotty 2002

Crotty M. Early Discharge and Home Rehabilitation After Hip Fracture Achieves Functional Improvements: A Randomized Controlled Trial. Clin Rehab 2002; 16(4):406-13.

Crotty 2003

Crotty M. Patient and Caregiver Outcomes 12 Months After Home-Based Therapy for Hip Fracture: A Randomized Controlled Trial. Archives of Physical Medicine and Rehabilitation 2003; 84(8):1237-9.

117

Davison 2001

Davison, J. N. Treatment for Displaced Intracapsular Fracture of the Proximal Femur. A Prospective, Randomised Trial in Patients Aged 65 to 79 Years. Journal of Bone & Joint Surgery - British Volume 2001; 83(2):206-12.

Calder, S. J. A Subjective Health Indicator for Follow-Up. A Randomised Trial After Treatment of Displaced Intracapsular Hip Fractures. Journal of Bone & Joint Surgery - British Volume 1995; 77(3):494-6.

Desjardins 1993

Desjardins, A. L. Unstable Intertrochanteric Fracture of the Femur. A Prospective Randomised Study Comparing Anatomical Reduction and Medial Displacement Osteotomy. Journal of Bone & Joint Surgery - British Volume 1993; 75(3):445-7.

Di 2007

Di L. Low back pain after unstable extracapsular hip fractures: randomised control trial on a specific training. Europa Medicophysica 2007; 43(3):349-57.

Dujardin 2001

Dujardin, F. H. Prospective Randomized Comparison between a Dynamic Hip Screw and a Mini-Invasive Static Nail in Fractures of the Trochanteric Area: Preliminary Results. Journal of orthopaedic trauma 2001; 15(6):401-6.

Efstathopoulos 2007

Efstathopoulos NE, Nikolaou V. S, Lazarettos J. T. Intramedullary Fixation of Intertrochanteric Hip Fractures: A Comparison of Two Implant Designs. International orthopaedics 2007; 31(1):71-6.

El-Abed 2005

El-Abed, K. Comparison of Outcomes Following Uncemented Hemiarthroplasty and Dynamic Hip Screw in the Treatment of Displaced Subcapital Hip Fractures in Patients Aged Greater than 70 Years. Acta Orthopaedica Belgica 2005; 71(1):48-54.

Espaulella 2000

Espaullella J. Nutritional Supplementation of Elderly Hip Fracture Patients. A Randomized, Double-Blind, Placebo-Controlled Trial. Age & Ageing 2000; 29(5):425-31.

118

Foss 2005

Foss NB. Effect of Postoperative Epidural Analgesia on Rehabilitation and Pain After Hip Fracture Surgery: A Randomized, Double-Blind, Placebo-Controlled Trial. Anesthesiology 2005; 102(6):1197-204.

Hagsten 2004

Hagsten, B., Svensson O., and Gardulf A. Early Individualized Postoperative Occupational Therapy Training in 100 Patients Improves ADL After Hip Fracture: A Randomized Trial. Acta Orthopaedica Scandinavica 2004; 75(2):177-83.

Hagsten 2006

Hagsten B. Health-Related Quality of Life and Self-Reported Ability Concerning ADL and IADL After Hip Fracture: A Randomized Trial. Acta Orthopaedica 2006; 77(1):114-9.

Hallan 2006

Hallan G. Palamed G Compared with Palacos R with Gentamicin in Charnley Total Hip Replacement. A Randomised, Radiostereometric Study of 60 HIPS. Journal of Bone & Joint Surgery - British Volume 2006; 88(9):1143-8.

Hardy 1998

Hardy, D. C. Use of an Intramedullary Hip-Screw Compared with a Compression Hip-Screw with a Plate for Intertrochanteric Femoral Fractures. A Prospective, Randomized Study of One Hundred Patients. Journal of Bone & Joint Surgery - American Volume 1998; 80(5):618-30.

Hardy 2003

Hardy D. Slotted intramedullary hip screw nails reduce proximal mechanical unloading. Clin Orthop 2003; 406:176-184.

Harrington 2002

Harrington P. Intramedullary Hip Screw Versus Sliding Hip Screw for Unstable Intertrochanteric Femoral Fractures in the Elderly. Injury 2002; 33(1):23-8.

Hauer 2002

Hauer, K. Intensive Physical Training in Geriatric Patients After Severe Falls and Hip Surgery. Age & Ageing 2002; 31(1):49-57.

119

Hedstrom 2002

Hedstrom M. Positive Effects of Anabolic Steroids, Vitamin D and Calcium on Muscle Mass, Bone Mineral Density and Clinical Function After a Hip Fracture. A Randomised Study of 63 Women. Journal of Bone & Joint Surgery - British Volume 2002; 497-503.

Huang 2005

Huang TT, Liang SH. A Randomized Clinical Trial of the Effectiveness of a Discharge Planning Intervention in Hospitalized Elders with Hip Fracture due to Falling. J of clin nursing 2005; 14(10):1193-201.

Huusko 2002

Huusko TM. Intensive Geriatric Rehabilitation of Hip Fracture Patients: A Randomized, Controlled Trial. Acta Orthopaedica Scandinavica 2002; 73(4):425-31.

Huusko TM, Karppi P, Avikainen V et al. Randomised, clinically controlled trial of intensive geriatric rehabilitation in patients with hip fracture: Subgroup analysis of patients with dementia. BMJ 2000; 321: 1107-1111.

Janzing 2002

Janzing HM. The Gotfried PerCutaneous Compression Plate Versus the Dynamic Hip Screw in the Treatment of Pertrochanteric Hip Fractures: Minimal Invasive Treatment Reduces Operative Time and Postoperative Pain. Journal of Trauma-Injury Infection & Critical Care 2002; 52(2):293-8.

Johansson 2006

Johansson T. The Total Costs of a Displaced Femoral Neck Fracture: Comparison of Internal Fixation and Total Hip Replacement. A Randomised Study of 146 Hips. International orthopaedics 2006; 30(1):1-6.

Bachrach-Lindstrom, M. Nutritional status and functional capacity after femoral neck fractures: a prospective randomized one-year follow-up study. Aging-Clinical & Experimental Res 2000; 12(5):366-74.

Johansson T, et al. Internal fixation versus total hip arthroplasty in the treatment of displaced femoral neck fractures: a prospective randomized study of 100 hips. Acta Orthopaedica Scandinavica 2000; 71 (6): 597-602

Jonsson 1996

Jonsson, B. Social Function After Cervical Hip Fracture. A Comparison of Hook-Pins and Total Hip Replacement in 47 Patients. Acta Orthopaedica Scandinavica 1996; 67(5):431-4.

120

Karn 2006

Karn, N. K. Comparison between External Fixation and Sliding Hip Screw in the Management of Trochanteric Fracture of the Femur in Nepal. The Journal of bone and joint surgery. British volume 2006; 88(10):1347-50.

Keating 2006

Keating JF et al. Randomized Comparison of Reduction and Fixation, Bipolar Hemiarthroplasty, and Total Hip Arthroplasty. Treatment of Displaced Intracapsular Hip Fractures in Healthy Older Patients. Journal of Bone & Joint Surgery - American Volume 2006; 88(2):249-60. Keating JF et al. Displaced intracapsular hip farctures in fit, older people: a randomized comparison of reduction and fixation, bipoar hemiarthroplasty and total hip arthroplasty. Health technology assessment (Winchester, England) 2005; 9(41): 1-65

Kim 2005

Kim S Y, Kim Y G, and Hwang JK.. Cementless Calcar-Replacement Hemiarthroplasty Compared with Intramedullary Fixation of Unstable Intertrochanteric Fractures. A Prospective, Randomized Study. J of Bone & Joint Surgery - American Volume 2005; 87(10):2183-92.

Krichbaum 2007

Krichbaum, K. GAPN postacute care coordination improves hip fracture outcomes. Western Journal of Nursing Research 2007; 29(5):523-44.

Kuisma 2002

Kuisma, R. A Randomized, Controlled Comparison of Home Versus Institutional Rehabilitation of Patients with Hip Fracture. Clinical rehabilitation 2002; 16(5):553-61.

Lamb 2002

Lamb SE. Neuromuscular Stimulation of the Quadriceps Muscle After Hip Fracture: A Randomized Controlled Trial. Archives of Physical Medicine & Rehabilitation 2002; 83(8):1087-92.

Lauridsen 2002

Lauridsen UB. Intensive physical therapy after hip fracture: a randomized clinical trial. Dan Med Bull 2000; 49:70-2.

121

Mangione 2005

Can elderly patients who have had a hip fracture perform moderate- to high-intensity exercise at home. Phys.Ther 2005; 85(8):727-39.

Mattsson 2005

Mattsson, P. Resorbable Cement for the Augmentation of Internally-Fixed Unstable Trochanteric Fractures. A Prospective, Randomised Multicentre Study. Journal of Bone & Joint Surgery - British Volume 2005; 87(9):1203-9.

Mattsson 2006

Mattsson P, and Larsson S. Calcium Phosphate Cement for Augmentation did Not Improve Results after Internal Fixation of Displaced Femoral Neck Fractures: A Randomized Study of 118 Patients. Acta Orthopaedica 2006; 77(2):251-6.

Miedel 2005

Miedel, R. The Standard Gamma Nail or the Medoff Sliding Plate for Unstable Trochanteric and Subtrochanteric Fractures. A Randomised, Controlled Trial. Journal of Bone & Joint Surgery - British Volume 2005; 87(1):68-75.

Moroni 2004

Moroni, A. HA-Coated Screws Decrease the Incidence of Fixation Failure in Osteoporotic Trochanteric Fractures. Clinical Orthopaedics & Related Research 2004; 425:87-92.

Naglie 2002

Naglie, G. Interdisciplinary Inpatient Care for Elderly People with Hip Fracture: A Randomized Controlled Trial. Canadian Medical Association Journal 2002; 167(1):25-32.

Neumann 2004

Neumann, M. Provision of High-Protein Supplement for Patients Recovering from Hip Fracture. Nutrition 2004; 20(5):415-9.

Oldmeadow 2006

Oldmeadow, L. B. No Rest for the Wounded: Early Ambulation After Hip Surgery Accelerates Recovery. ANZ Journal of Surgery 2006; 76(7):607-11.

122

Oude Voshaar 2006

Oude Voshaar, R. C. Fear of Falling More Important than Pain and Depression for Functional Recovery After Surgery for Hip Fracture in Older People. Psychological medicine 2006; 36(11):1635-45.

Ovesen 2006

Ovesen O. The trochanteric gamma nail versus the dynamic hip screw: A prospective randomised study. One-year follow-up of 146 intertrochanteric fractures. Hip International 2006; 16(4):293-8.

Pajarinen 2005

Pajarinen, J. Pertrochanteric Femoral Fractures Treated with a Dynamic Hip Screw or a Proximal Femoral Nail. A Randomised Study Comparing Post-Operative Rehabilitation. Journal of Bone & Joint Surgery - British Volume 2005; 87(1):76-81.

Papasimos 2005

Papasimos, S. A Randomised Comparison of AMBI, TGN and PFN for Treatment of Unstable Trochanteric Fractures. Archives of Orthopaedic & Trauma Surgery 2005; 125(7):462-8.

Parker 2002

Parker MJ. Hemiarthroplasty versus Internal Fixation for Displaced Intracapsular Hip Fractures in the Elderly. A Randomised Trial of 455 Patients. Journal of Bone & Joint Surgery - British Volume 2002; 84(8):1150-5.

Parker MJ and GA Pryor. Internal fixation or arthroplasty for displaced cervical hip fractures in the elderly: A randomized controlled trial of 208 patients. Acta Orthopaedica Scandinavica 2000; 71 (5): 440-6

Peterson 2004

Peterson, MG. High-intensity exercise training following hip fracture. Topics in Geriatric Rehabilitation 2004; 20(4):273-284.

Peyser 2007

A Prospective, Randomised Study Comparing the Percutaneous Compression Plate and the Compression Hip Screw for the Treatment of Intertrochanteric Fractures of the Hip. Journal of Bone & Joint Surgery - British Volume 2007; 89(9):1210-7.

Raia 2003

Raia FJ. Unipolar or Bipolar Hemiarthroplasty for Femoral Neck Fractures in the Elderly? Clin Orthop Res 2003; 414:259-65.

123

Ravikumar 2000

Ravikumar KJ, Marsh G. Internal Fixation versus Hemiarthroplasty Versus Total Hip Arthroplasty for Displaced Subcapital Fractures of Femur--13 Year Results of a Prospective Randomised Study. Injury 2000; 31(10):793-7.

Resnick 2007

Resnick, B. Testing the Effectiveness of the Exercise Plus Program in Older Women Post-Hip Fracture. Ann Behav Medicine 2007; 34(1):67-76.

Rogmark 2002

Rogmark C. A Prospective Randomised Trial of Internal Fixation versus Arthroplasty for Displaced Fractures of the Neck of the Femur. Functional Outcome for 450 Patients at Two Years. Journal of Bone & Joint Surgery - British Volume 2002; 84(2):183-8.

Santini 2005

Santini S. Hip fractures in elderly patients treated with bipolar hemiarthroplasty: Comparison between cemented and cementless implants. Journal of Orthopaedics and Traumatology 2005; 6(2):80-7.

Sherrington 2004

Sherrington C, Lord SR, Herbert RD. A Randomized Controlled Trial of Weight-Bearing versus Non-Weight-Bearing Exercise for Improving Physical Ability after Usual Care for Hip Fracture. Archives of Physical Medicine & Rehabilitation 2004; 85(5):710-6.

Sherrington, C., Lord, S. R.and Herbert R. D. A Randomised Trial of Weight-Bearing versus Non-Weight-Bearing Exercise for Improving Physical Ability in Inpatients after Hip Fracture. Australian Journal of Physiotherapy 2003; 49(1):15-22.

Shyu 2005

Shyu YI. A Pilot Investigation of the Short-Term Effects of an Interdisciplinary Intervention Program on Elderly Patients with Hip Fracture in Taiwan. Journal of the American Geriatrics Society 2005; 53(5):811-8.

Sikorski 1981

Sikorski, J. M., and Barrington R. Internal Fixation versus Hemiarthroplasty for the Displaced Subcapital Fracture of the Femur. A Prospective Randomised Study. Journal of Bone & Joint Surgery - British Volume 1981; 63(3):357-61.

124

Soreide 1980

Soreide O. Internal fixation versus endoprosthesis in the treatment of femoral neck fractures in the elderly. A prospective analysis of the comparative costs and the consumption of hospital resources. Acta Orthop Scand 1980; 51:827-31.

Stenvall 2007

Stenvall, M. Improved Performance in Activities of Daily Living and Mobility after a Multidisciplinary Postoperative Rehabilitation in Older People with Femoral Neck Fracture: A Randomized Controlled Trial with 1-Year Follow-Up. Journal of rehabilitation medicine 2007; 232-8.

Tidermark 2003

Tidermark, J. Internal Fixation Compared with Total Hip Replacement for Displaced Femoral Neck Fractures in the Elderly. A Randomised, Controlled Trial. Journal of Bone & Joint Surgery - British Volume 2004; 23(4):587-96.

Blomfeldt R. Comparison of Internal Fixation with Total Hip Replacement for Displaced Femoral Neck Fractures. Randomized, Controlled Trial Performed at Four Years. 2005; 87(8):1680-8.

Tidermark 2004

Tidermark, J. Effects of Protein-Rich Supplementation and Nandrolone in Lean Elderly Women with Femoral Neck Fractures. Clinical Nutrition 2004; 23(4):587-96.

Tinetti 1999

Tinetti ME. Home-based multicomponent rehabilitation program for older persons after hip fracture: a randomized trial. Arch Phys Med Rehabil 1999; 80(8):916-22.

Tsauo 2005

Tsauo, J. Y., Effects on Function and Quality of Life of Postoperative Home-Based Physical Therapy for Patients with Hip Fracture. Archives of Physical Medicine and Rehabilitation 2005; 86(10):1953-7.

Utrilla 2005

Utrilla, A. L. Trochanteric Gamma Nail and Compression Hip Screw for Trochanteric Fractures: A Randomized, Prospective, Comparative Study in 210 Elderly Patients with a New Design of the Gamma Nail. Journal of orthopaedic trauma 2005; 19(4):229-33.

Van Der Lely 2000

Van der Lely, A. Use of Human GH in Elderly Patients with Accidental Hip Fracture. European Journal of Endocrinology 2000; 143:585-92.

125

Van Dortmont 2000

van Dortmont, L. M. Cannulated Screws Versus Hemiarthroplasty for Displaced Intracapsular Femoral Neck Fractures in Demented Patients. Annales Chirurgiae et Gynaecologiae 2000; 89(2):132-7.

van Vugt 1993

van Vugt, A. B., Oosterwijk, W. M and Goris R. J. Osteosynthesis vs endoprosthesis in the treatment of unstable intracapsular hip fracture in the elderly. A randomized clinical trial. Archives of Orthopaedic & Trauma Surgery 1993; 113(1):39-45.

Vidan 2005

Vidan, M. Efficacy of a Comprehensive Geriatric Intervention in Older Patients Hospitalized for Hip Fracture: A Randomized, Controlled Trial. Journal of the American Geriatrics Society 2005; 53(9):1476-82.

Vidyadhara 2007

Vidyadhara, S. and Rao, S. K. One and two femoral neck screws with intramedullary nails for unstable trochanteric fractures of femur in the elderly--randomised clinical trial. Injury 2007; 38(7):806-14.

Other studies used as reference

1. Bhandari M, Devereaux PJ, Swiontkowski MF et al. Internal fixation compared with arthroplasty for displaced fractures of the femoral neck. A meta-analysis. J Bone Joint Surg- American volume 2003; 85-A (9): 1673-81.

2. Lu-Yao GL, Keller RB, Littenberg B et al. Outcomes after displaced fractures of the femoral neck. A meta-analysis of one hundred and six published reports. J Bone Joint Surg American Volume 1994; 76 (1): 15-25

3. Parker MJ, Gurusamy K. Internal fixation versus arthroplasty for intracapsular proximal

femoral fractures in adults. 2008; Cochrane Library.

4. Parker MJ, Gurusamy K. Arthorplasties (with and without bone cement) for proximal femoral fractures in adults. Cochrane Database of Systematic Reviews 2004.

5. Parker MJ, Stockton G. Internal fixation implants for intracapsular proximal femoral fractures in adults. Cochrane Database of Systematic Reviews 2001.

6. Tidermark J. Quality of life and femoral neck fractures. Acta Orthopaedica. 2003; 74(2):1-62.

7. Handoll HH, Sherrington C. Mobilisation strategies after hip fracture surgery in adults. Cochrane Database Syst Rev. 2007 Jan 24;(1)

126

Appendix 5 Study characteristics and patient demographics meeting inclusion and exclusion criteria.

Characteristic Surgical trials, n Medical Trials, n Rehabilitation Trials, n

Number of trials 30 11 17

Sample size population 1-50 1 2 2 51-100 6 4 8 101-500 23 5 7 Inclusion Criteria No age limitations specified

5 1

Minimum age (yrs) 80 1 75 3 1 1 70 9 2 2 65 6 4 11 60 5 3 2 50 1 Maximum age (yrs) 90 1 Hip fracture type Extracapsular fractures 11 2 2 Intracapsular fractures 19 1 2 Both 1 Unspecified 7 13 Mechanism of Injury Falling 2 Residence Living inside region of hospital

1

Independent living status

4 3 3

Community dweller 4 Nursing home 1 Discharge destination Community 1 Nursing home 1 Mobility Ability to walk ≥ 0.8 km 1 Walking without support

5 1 4

New mobility score ≥ 3 1 PPT 12-28 1 Previously ambulatory 1 2 1 Mental score DSM-III-R 1 Abbrev. MMTS 1 MMTS > 6 1 MMT ≥ 7 1 MTS ≥ 5/13 1

127

MTS ≥ 6/13 1 SPMSQ 3/10 correct 2 2 MMSE unspecified 1 MMSE > 24 3 MMSE > 20 1 1 MMSE > 15 1 Lucid 1 Blessed Dementia Rating ≥ 4

1

Mental & physical capacity to participate in rehabilitation program

2

Weight Too frail for any operative treatment

1

BMI ≤ 24 kg2 1 Other Home environment suitable for rehab

2

Completion of PT 1 Needed a formal rehab program

2

Exclusion criteria Hepatic dysfunction 2 Renal dysfunction 2 3 1 Uncontrolled diabetes mellitus

1

Abnormal thyroid function

1

Systolic BP > 200 or diastolic BP > 100

1

Malignancy 4 2 Stroke 1 Parkinson’s 1 1 1 Clotting disorder 2 Rheumatoid arthritis 11 3 Osteoarthritis 11 1 Paget’s disease 3 Diabetes 1 Uncontrolled hypertension

3

Cardiac heart failure 3 1 Hearing impairment 1 1 Visual impairment 1 ASA score V 2 Major medical conditions

2 2

Co-morbidities 1 Taking salmon calcitonin therapy

1

Bone-active therapy 1 Corticosteroid treatment 3 2 Needing intravenous 1

128

nutrition Non-ambulatory prior to fx

3 1 3

Dependence on person for ambulation

1

Inability to walk 50 feet 1 Dependency in all basic ADLs

1

Pre-fracture hospitalization

1

Long-term care facility 2 2 ICU 2 Independent or alone > 4hrs per day

1

Respiratory or cardiovascular failure preventing them to walk 50 ft

1

Cerebrovascular disease affecting mobility

1

Co-morbidity precluding ability to exercise

1

Neurosensory, neuromuscular or musculoskeletal deficiency

2 1 1

Mental or physical co-morbidities limiting walking distance to 0.8 km

1

Taking hypnotics, sedatives, muscle relaxants

1

Non-weight bearing or touch weight bearing

1

Clinical depression 1 Acute mental illness 1 Dementia 4 2 1 Advanced dementia 1 Mental instability 2 Cognitive impairment 1 1 Confusion 1 Cognitive impairment determined by first author

1

Terminal illness life expectancy < 12

1 3

Terminal illness < 6 mths

1

Terminal illness 1 Inability to provide informed consent

1

Alcohol or substance abuse

1 2

129

Smokers 1 *PPT denotes physical performance test; MMTS, Mini mental test score; SPMSQ Short portable mental status questionnaire; MMSE, mini mental status examination; BMI, Body Mass Index; PT, Physical therapy

Download - FUNCTIONAL OUTCOMES IN THE AGED WITH HIP FRACTURE: A … · iii Acknowledgements I would like to acknowledge and thank my mentor, Dr. Emil Schemitsch (Supervisor), and my Program

Top Related