measuring the educational environment in health professions studies: a systematic review

2010; 32: 947–952

BEME RAPID REVIEW

Measuring the educational environment inhealth professions studies: A systematic review

DIANTHA SOEMANTRI1, CRISTIAN HERRERA2,3 & ARNOLDO RIQUELME2,4

1Department of Medical Education, Faculty of Medicine, Universitas Indonesia, Indonesia, 2School of Medicine, PontificiaUniversidad Catolica de Chile, Chile, 3Evidence Based Medicine Unit, Pontificia Universidad Catolica de Chile, Chile,4Department of Gastroenterology, Pontificia Universidad Catolica de Chile, Chile

Abstract

Background: One of the determinants of the medical student’s behaviour is the medical school learning environment.

Aim: The aim of this research was to identify the instruments used to measure the educational environment in health professions

education and to assess their validity and reliability.

Methods: We performed an electronic search in the medical literature analysis and retrieval system online (MEDLINE) and Timelit

(Topics in medical education) databases through to October 2008. The non-electronic search (hand searching) was conducted

through reviewing the references of the retrieved studies and identifying the relevant ones. Two independent authors read, rated

and selected studies for the review according to the pre-specified criteria. The inter-rater agreement was measured with Kappa

coefficient.

Results: Seventy-nine studies were included with the Kappa coefficient of 0.79, which demonstrated a reliable process, and 31

instruments were extracted. The Dundee Ready Education Environment Measure, Postgraduate Hospital Educational Environment

Measure, Clinical Learning Environment and Supervision and Dental Student Learning Environment Survey are likely to be the

most suitable instruments for undergraduate medicine, postgraduate medicine, nursing and dental education, respectively.

Conclusions: As a valid and reliable instrument is available for each educational setting, a study to assess the educational

environment should become a part of an institution’s good educational practice. Further studies employing a wider range of

databases with more elaborated search strategies will increase the comprehensiveness of the systematic review.

Introduction

One determinant of a medical student’s behaviour is the

medical school learning environment (Genn 2001a). The

environment related to the student’s behaviour is as perceived

by the student. Genn and Harden (1986) suggested that

although the concept is rather intangible, the effects of an

educational environment are substantial, real and influential.

The climate makes a notable contribution to the prediction of

student achievement, satisfaction and success (Genn 2001b).

The relationship between the educational environment as

perceived or experienced by the students and their achieve-

ment, satisfaction and success in medical school has brought

into awareness the need to study or evaluate the medical

school’s environment. The World Federation for Medical

Education (1998) has considered the learning environment as

one of the targets for the evaluation of medical education

programmes. Based on all of the above, a study of the

environment should become an inseparable part of the study

of a medical school’s curriculum.

Genn (2001a) provides a definition for educational envi-

ronment as the manifestation of a curriculum. Rothman and

Ayoade (1970) also consider the learning environment as ‘a

manifestation of the effects on students of the various parts of

the curriculum’. Maudsley (2001) states that ‘a learning

environment exists wherever and whenever students

gather . . . ’ and it includes various factors, which contribute

to an effective education and becomes the background in

which the curriculum exists.

Pace and Stern (1958) suggest that an educational environ-

ment is characterized by the pressures, stresses, practices,

policies, rewards and values within the classroom or school of

which they refer to as ‘environmental press’. Furthermore, one

of the 10 questions to be asked when planning a course or

curriculum is ‘what educational environment or climate should

be fostered?’ (Harden 1986). The course or curriculum planner

Practice points

. Measuring the educational environment should become

a part of educational institutions’ good practices.

. Each educational context or setting demands different

aspects of environment to be measured.

. Various instruments to measure educational environ-

ment in different health professions education settings

are available.

. Some instruments have proven their psychometric

values.

Correspondence: Diantha Soemantri Department of Medical Education, Faculty of Medicine, Universitas Indonesia, Jakarta Pusat 10430, Indonesia.

Tel/Fax: 62 21 3901814; email: [email protected]; [email protected]

ISSN 0142–159X print/ISSN 1466–187X online/10/120947–6 � 2010 Informa UK Ltd. 947DOI: 10.3109/01421591003686229

Med

Tea

ch D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y U

nive

rsity

of

Maa

stri

cht o

n 06

/18/

14Fo

r pe

rson

al u

se o

nly.

should evaluate those areas that are fostered and encouraged

in the institution.

The term environment and climate are often used inter-

changeably in educational literature. Genn (2001b) defines the

term climate as the educational environment of an institution

as perceived by students. Roff and McAleer (2001) use the term

educational climate as the synonym of educational environ-

ment. In the study conducted by Rothman & Ayoade (1970),

the learning environment is defined as what the students

perceive and thus no differentiation is made between the term

environment and climate. Therefore, in this article the educa-

tional climate will be regarded as similar to educational

environment.

However, it is the students’ perceptions of the environment,

rather than the environment itself, that determines the behav-

iour (Hutchins 1961; Rothman & Ayoade 1970; Genn 2001a;

Konings et al. 2005). Students’ perceptions of the classroom

environment have been used to predict their outcomes (Fraser

et al. 1986). Students, as the ‘inhabitants’ of the (Genn 2001a),

are the ones who experience the environment and it is their

perceptions of the environment, which then becomes one of

the determinants of their behaviour.

A quantitative measurement of the educational environ-

ment requires the use of an inventory or instrument. The

selection of such an instrument should be based on the

qualities of the measurement process using a particular

instrument or the suitability of that instrument to measure

educational environment. The qualities, or often referred to as

psychometric features, usually come under two main head-

ings, which are validity and reliability. A valid and reliable

learning environment tool allows a meaningful measure of the

learning environment of an institution, and thus appropriate

measures to improve the environment can be taken. The

differences among various educational settings may require a

different educational environment inventory, which suits the

specific situation of a particular institution.

In any measurement process, it is necessary to ascertain

whether the instrument or inventory measures what it

supposes to measure; and validity deals with this specific

question. Content validity refers to the extent that the

instrument or inventory measures the intended subject

matter content (Gronlund 1976a). Criterion-related validity

refers to the extent to which the measurement results can be

used to predict a future outcome and also the extent to which

they correlate with current results (performance) obtained by

other valid measures (Gronlund 1976a). Construct validity may

be defined as the extent to which the test can be used to

measure certain psychological constructs and the results

interpreted in terms of those constructs (Gronlund 1976a).

Reliability refers to the reproducibility of the measurement

or assessment results. A result may be reliable over different

periods of time, over different raters or over different samples

of questions (Gronlund 1976b). An unreliable or inconsistent

measurement result cannot possibly permit valid interpretation

of the result. A high correlation between the scores of

individual items, in other words a high internal consistency,

would indicate that the scores measure a single construct. The

degree of reliability adequate for a particular measurement

depends on: the purpose of the measurement, the use of the

results, the importance of the decision that will be made and

ultimately the consequences resulting from the measurement.

Reliability coefficient of 0.60 is considered acceptable for

questionnaires (Nunnally 1978).

Objectives

The objectives of this systematic review were to respond to the

following two research questions:

(1) Which instruments or inventories are used to measure

the educational environment in health professions

education?

(2) How is the suitability of each instrument in measuring

the educational environment in health professions

education?

Methods

A systematic review approach is applicable in medical

education research. The purpose is similar to that of the

systematic review in medicine, which is to reflect the best

evidence available in medical education (BEME Collaboration

2006). Steps in conducting a systematic review can be

summarized as follows: formulation of a specific review

question, comprehensive search and inclusion of studies,

quality assessment of included studies and data extraction,

synthesis of study results, interpretation of results and report

writing (Pai et al. 2004). These steps should result in a

summary of evidence related to a specific research question.

Search strategy

We performed an electronic search in the MEDLINE and

Timelit (Topics in Medical Education) databases through to

October 2008. The non-electronic search (hand searching) was

conducted through reviewing the references of the retrieved

studies and identifying the relevant ones. The MEDLINE search

strategy combined the available and appropriate headings

medical subject (MeSH) which are (exp) educational mea-

surement/, (exp) education/, (exp) learning/ with the free

keywords ‘education* environment’, ‘climate’, ‘learning envi-

ronment’ and ‘education* climate’. In the Timelit database in

which the search using subject headings is not possible, a

search using the relevant keywords (‘educational climate’,

‘educational environment’, ‘education environment’ and

‘learning environment’) was conducted.

Inclusion and exclusion criteria

We included studies that were in the area of health professions

education (undergraduate or postgraduate), which measures

the educational or learning environment with a quantitative

method (employs an instrument or inventory). A study was

excluded if it was a review article, a study in non-health

professions education, did not measure the overall educational

or learning environment of the institution and employed a

qualitative research method.

D. Soemantri et al.

948

Med

Tea

ch D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y U

nive

rsity

of

Maa

stri

cht o

n 06

/18/

14Fo

r pe

rson

al u

se o

nly.

Data extraction

One author read all the titles retrieved by the searches and

eliminated the studies that were clearly of another subject.

From the resultant list, two independent authors read the

abstracts in the records to identify publications that appeared

to be eligible for this review. From the chosen abstracts, two

independent authors read the full texts, rated and selected

studies for the review according to the pre-specified (inclusion

and exclusion) criteria. Disagreements were resolved by

discussion. The inter-rater agreement was measured with

Kappa coefficient.

Data synthesis

To respond to the first question of this review, the search

strategy allowed us to identify the inventories or instruments

that were used to measure the educational environment in

health professions education.

To answer the second research question, the data regarding

the validity and reliability of the educational environment

measurement results using each particular inventory was

extracted, analyzed and synthesized. Some instruments were

used in several studies, therefore the information was gathered

from all the included studies utilizing those particular instru-

ments. Based on the available information, the most suitable

instrument or inventory to measure educational environment

in different fields of health professions education was identi-

fied. The information concerning the educational environment

measurement instrument was extracted from the included

studies: the health professions education setting in which the

instrument was developed and/or used; the general informa-

tion about the instrument, including its subscales; the validity

of the results (content validity, criterion-related validity,

construct validity) and the reliability of the results.

Results

Following the search strategy, the electronic databases

(MEDLINE 1966 to October 2008 and Timelit) yielded 951

titles. Duplication of titles from the two databases were

identified and removed. As a result of the search strategy and

subsequent screening, 178 studies (1958–2008) were consid-

ered to be relevant for further review. Full papers of the studies

were obtained. Two reviewers worked independently to

review the studies and make the decision on each of them.

The reviewers disagreed on 19 studies. Therefore, the inter-

rater agreement was substantial, with 90% agreement and the

Kappa coefficient of 0.79. Kappa coefficient takes into account

the occurrence of chance agreement.

Thirty one studies were excluded because they were not

primary studies of educational environment. Those studies

were literature reviews, editorials or commentaries/letters

around the topic of educational environment. Another exclu-

sion criterion was the primary educational environment study,

which was conducted in non-health professions education

settings, such as secondary education or other subject disci-

plines in higher education level, 14 studies were excluded

following this criterion.

Eighteen studies employed qualitative research methods,

such as interviews or observation, to assess the educational

environment or certain aspects of it. Thus, those studies were

excluded. Another 36 studies were excluded because they

only measured a certain aspect of the educational environ-

ment, such as students’ stress, coping, support services,

student abuse and harassment, perceptions of teaching and

course experience. The reviewers considered those studies to

have too narrow coverage of educational environment, and

therefore could not be regarded as measuring the full

educational environment.

Following the flowchart, 79 studies from 1961 to 2008 were

included (Figure 1). These were primary studies, which

measured the educational environment in health professions

education using specific instruments. The area of health

professions education included undergraduate and postgrad-

uate medicine, nursing and dental education.

Thirty one instruments were identified from the 79 included

studies. Some instruments were developed specifically to be

used in particular settings, while others were adapted from

another instrument. Ten instruments were used in undergrad-

uate medical education, seven instruments for postgraduate

medical education, eight and four instruments were used in

nursing and dental education, respectively. One instrument

was used for both undergraduate and postgraduate medical

education and one instrument was used in all areas (under-

graduate and postgraduate medicine, nursing, dental, chiro-

practic education). Table 1 summarizes the identified

instruments along with the settings in which the instruments

were used.

Data regarding the use of the identified instruments in each

article were extracted and synthesized. They include the

psychometric properties presented in the articles as the results

of the use of the instruments in the educational environment

Studies on educational or learningenvironment (178)

Review studies (31)Studies in non health

professions education (14)

Original undergraduate andpostgraduate studies (133)

Studies with qualitativemethods (18)

Studies with quantitative method (79)(Kappa coefficient=0.79)

Instruments which measure educational environment in health professions education (31)

Studies which measure onlya particular aspect of the

environment (36)

Figure 1. Disposition of 178 articles on educational/learning

environment.

Note: Solid and dash lines indicate included and excluded

studies, respectively.

BEME Rapid Review

949

Med

Tea

ch D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y U

nive

rsity

of

Maa

stri

cht o

n 06

/18/

14Fo

r pe

rson

al u

se o

nly.

measurement processes. The synthesis of data were presented

in the form of table for each area of the health profession study

(Tables 2–5, available at www.medicaltacher.org). The row

lists the educational environment measurement instrument

whereas the column lists the psychometric properties of each

instrument, which consist of content validity/content evidence,

criterion validity/relation other variables, construct validity and

reliability/internal structure.

Discussion

Specific databases which contain publications in medical

education are of limited availability. It becomes more chal-

lenging to conduct a comprehensive search because of this

issue. Relevant papers are more likely to be missed and large

numbers of false hits occur. Moreover, finding the appropriate

subject headings may become a problem because the database

(MEDLINE) is not specific to medical education and thus, the

availability of subject headings relevant to medical education is

limited. It was then necessary to combine the search with

relevant keywords (‘education environment’, ‘education cli-

mate’, ‘learning environment’ and ‘climate’) to increase the

specificity. Notwithstanding, our search strategy was likely to

mislay relevant studies due to the lack of specific search tools

for the medical education area. In that order, the use of only

two databases and a low-sensibility search strategy become

limitations of this study. This is caused by the limited resources

on the researcher’s side. For a systematic and comprehensive

search to be successfully conducted, adequate resources

(human resources, educational material, time and cost) are

essential. In order to improve the comprehensiveness of the

search, the hand searching of the references cited in the

retrieved articles was conducted. The process yielded a

substantial number of additional articles, which were not

retrieved through the database search.

The decision to include or exclude each study was based on

the criteria visualized in the flowchart (Figure 1). Although the

flowchart was clear and the reviewers understood the criteria

beforehand, some disagreements occurred inadvertently

during the process. The disagreements mostly emerged when

making the decisions on studies that only measured a certain

aspect of the educational environment; they resulted from

differences in interpretation about those particular studies.

A high percentage of agreement (90%) between the two

independent reviewers and the Kappa coefficient of 0.79 show

a fairly reliable result of the inclusion/exclusion (review)

process. The systematic search yielded a substantial number of

instruments for measuring educational environment in health

professions education. The measurement of educational envi-

ronment has received important attention from educationalists,

highlighted by the availability of various instruments to be

utilized in different educational settings.

Analysis of the validity and reliability of educationalenvironment measurement instruments

Medicine (undergraduate). There were 12 educational envi-

ronment measurement instruments identified in this study.

Table 1. Undergraduate and postgraduate health profession educational environment measurement instruments identified inthe systematic review.

No. Instrument/Inventory Setting

1 Medical school learning environment Survey – modified (MSLES) Medicine

2 Learning environment questionnaire (LEQ) Medicine

3 Questionnaire from Parry et al. Medicine

4 Veteran affairs (VA) learners’ perceptions survey Medicine (postgraduate, PG)

5 Learning environment assessment (LEA) Medicine (PG)

6 Medical school environment questionnaire (MSEQ) Medicine

7 Questionnaire from Robins et al. Medicine

8 Medical school environment inventory (MSEI) Medicine

9 Questionnaire from Rotem, Godwin and Du Medicine (PG)

10 Course valuing inventory (CVI) Medicine

11 DREEM Medicine (undergraduate and PG), nursing,

dental and chiropractic

12 Operating room educational environment measure (OREEM) Medicine (PG)

13 Instrument from Pololi and Price Medicine

14 Surgical theatre educational environment measure (STEEM) Medicine (undergraduate and PG)

15 Anaesthetic theatre educational environment measure (ATEEM) Medicine (PG)

16 Practice-based educational environment measure Medicine (PG)

17 Postgraduate hospital educational environment measure (PHEEM) Medicine (PG)

18 Questionnaire from Patel and Dauphinee Medicine

19 Learning Climate Measure from Wangsaturaka Medicine

20 Questionnaire from Orton – Modified Nursing

21 Questionnaire from Hart and Rotem Nursing

22 Clinical learning environment scale (CLE) Nursing

23 Clinical learning environment inventory (CLEI) Nursing

24 College and university classroom environment inventory (CUCEI) Nursing

25 Clinical learning environment and supervision (CLES) Nursing

26 Clinical post-conference learning environment survey (CPCLES) Nursing

27 Postgraduate research experience questionnaire (PREQ) Nursing

28 Questionnaire from Gerzina, McLean and Fairley Dentistry

29 Clinical education instructional quality (ClinEd IQ) Dentistry

30 Learning environment survey (LES) Dentistry

31 Dental student learning environment survey (DSLES) Dentistry

D. Soemantri et al.

950

Med

Tea

ch D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y U

nive

rsity

of

Maa

stri

cht o

n 06

/18/

14Fo

r pe

rson

al u

se o

nly.

The summary of the psychometric qualities of the instrument-

sare presented in Table 2, available at www.medicalteacher.

org. Several instruments (questionnaire from Parry et al.

(2002), Pololi & Price (2000), Patel and Dauphinee (1985),

Robins et al. (1997), MSEI (Hutchins 1961) and MSEQ

(Wakeford 1981, 1984)) had considerable weaknesses in

their validity and reliability. The Learning Climate Measure

was an instrument developed specifically to measure the

educational environment of Thai medical education

(Wangsaturaka 2005). Therefore, it was a nation-specific

instrument and its use was limited to that specific context.

The STEEM was originally developed for postgraduate surgical

learning setting. When applied to the undergraduate setting, it

has demonstrated some weaknesses in terms of its validity

(Nagraj et al. 2006).

The LEQ (Rothman 1970; Rothman & Ayoade 1970; Levy

et al. 1973; Kaufman & Mann 1996; Schwartz & Loten 2003,

2004a, b) MSLES (Feletti & Clarke 1981a, b; Clarke et al. 1984;

Lancaster et al. 1997) and DREEM (Roff et al. 1997, 2001;

Al-Qahtani 1999; Al-Zidgali 1999; Baozhi 2003; Bassaw et al.

2003; Vieira et al. 2003; Zaini 2003; Al-Hazimi et al. 2004a, b;

Mayya & Roff 2004; Till 2004, 2005; Jiffry 2005; Varma et al.

2005; Dunne et al. 2006) demonstrated the robustness in terms

of their psychometric qualities. Their content validity was

established and they also had high internal consistency. The

construct validity of the MSLES and DREEM were indicated by

their ability to significantly differentiate the students’ percep-

tions of the learning environment between medical schools

with the traditional curriculum and ones with the more

innovative curriculum (Feletti & Clarke 1981b; Al-Qahtani

1999; Zaini 2003; Al-Hazimi et al. 2004a). The DREEM showed

an additional strength because it could be applied in medical

schools in different countries, cultures and contexts and still

demonstrated consistent reliability. In relation to criterion

validity, there was a relationship between the DREEM scores,

which reflected the students’ perceptions of the learning

environment and academic achievement (Baozhi 2003; Mayya

& Roff 2004).

According to the data presented in the related articles, the

DREEM is likely to be the most suitable instrument to be

applied in undergraduate medical education settings. The

validity had been established, and the instrument demon-

strated highly reliable results consistently throughout its

administration in different contexts. The results also indicated

the ability of the DREEM to differentiate between the learning

environment of the traditional and one of the more innovative

medical schools.

Medicine (postgraduate). Most of the identified educational

environment measurement instruments in postgraduate med-

ical education were developed for use in specific postgraduate

specialty areas. The psychometric qualities of the identified

instruments are summarized in Table 3, available at www.

medicalteacher.org. Postgraduate medical education pro-

grammes consisted of many different specialties and levels of

training. Each speciality had its own uniqueness in terms of the

educational environment. This situation often required a

specific instrument which could address the certain aspects

of the educational environment in the particular setting.

Four instruments: the STEEM (Cassar 2004), the OREEM

(Kanashiro et al. 2006), the ATEEM (Holt & Roff 2004) and the

practice-based educational environment measure (Mulrooney

2005) were developed for use in the surgical/operating theatre

learning setting, anaesthetic theatre learning setting and

practice-based component in general practice (GP) training,

respectively. The VA Learners’ Perceptions Survey was

designed for residents from different specialties within the

United States VA clinical training setting (Keitz et al. 2003). In

the process of deciding the most suitable instrument for use in

postgraduate medical education, it was appropriate to exclude

those instruments. The decision was not based on the quality

of those instruments, since most of them demonstrated good

validity and reliability, but rather because of the specific

content of the instrument, which limited their use in a more

general postgraduate education setting.

The content validity of the LEA was not established,

although it showed fairly high internal consistency (Roth

et al. 2006). The questionnaire from Rotem et al. (1995) and

the PHEEM established content validity through clear descrip-

tion of the development process of the instruments (Roff et al.

2005). The strengths of the DREEM were the high internal

consistency and concurrent validity. However, the content and

construct validities were not established since the factor

analysis demonstrated that some modifications were necessary

for the instrument to be applied in postgraduate setting

(Bassaw et al. 2003; de Oliveira Filho & Schonhorst 2005;

de Oliveira Filho et al. 2005a, b).

The PHEEM consisted of 40 items divided into three

subscales (perceptions of role autonomy, perceptions of

teaching and perceptions of social support). The reliability of

the PHEEM was better compared to that of the Rotem, Godwin

and Du questionnaire. It has also been shown that by using

PHEEM, reliable results can be obtained with feasible sample

size. In addition, the PHEEM had been administered to several

different sample groups, such as senior house officers and

specialist registrars from different specialties, and demon-

strated almost similar reliability coefficients in those groups

(Jayashree 2004; Roff et al. 2005; Aspegren et al. 2007; Boor

et al. 2007; Clapham et al. 2007). Some researches have

conducted factor analysis on PHEEM and they demonstrated

different results regarding the factors, which the instrument

measures (Aspegren et al. 2007; Boor et al. 2007; Clapham

et al. 2007). The construct validity of the Rotem, Godwin and

Du questionnaire was not clearly established. Therefore, the

PHEEM is likely to be the most suitable instrument for use in

postgraduate medical education because of its content validity,

high reliability and also its ability to be used in different

postgraduate settings.

Nursing. Most of the identified instruments were used to

assess the clinical learning environment in nursing education

setting. A summary of the psychometric qualities of the

instruments’ is presented in Table 4, available at www.

medicalteacher.org. All instruments, except CUCEI (Fisher &

Parkinson 1998), provided moderate to high reliability coef-

ficients. Several instruments, such as the CPCLES (Letizia &

Jennrich 1998), the questionnaire from Hart and Rotem (1995),

and the DREEM (Pimparyon et al. 2000; Al-Sketty 2003;

BEME Rapid Review

951

Med

Tea

ch D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y U

nive

rsity

of

Maa

stri

cht o

n 06

/18/

14Fo

r pe

rson

al u

se o

nly.

O’Brien et al. 2008) demonstrated some weaknesses in regard

to the construct validity of the instruments. Almost all

instruments have demonstrated a high degree of content

validity. The PREQ covered several aspects of an educational

environment (supervision, intellectual climate, clarity, infra-

structure, skills development and thesis examination process).

However, according to the validity analysis, there are some

subscales or statements in the instrument that were too specific

for a particular nursing education setting (Drennan 2008),

which then limit its use at different nursing education settings.

The content and construct validities of the CLES have been

established. It was administered to the groups of nursing

students from different countries (Finland and United

Kingdom) and it has demonstrated marginal to high reliability

coefficient (internal consistency) in those studies (Saarikoski &

Leino-Kilpi 1999, 2002; Saarikoski et al. 2002). The CLEI had

also been administered in several contexts of nursing educa-

tion. However, the internal consistency was fairly poor

compared to that of other instruments (Chan 2001a, b, 2002,

2003; Ip & Chan 2005; Henderson et al. 2006; Midgley 2006;

Chan & Ip 2007). The reliability coefficients of the CLE Scale

(Dunn & Burnett 1995; Dunn & Hansford 1997) were lower

than those of the CLES, although both the instruments’

construct validity has been confirmed. Therefore, based on

the available information and analysis, the CLES is likely to be

the most suitable instrument to assess nursing students’

perceptions of their clinical learning environment.

Dentistry. There were five instruments for dental educational

environment measurement identified in this study. The validity

and reliability of those instruments varied and are summarized

in Table 5, available at www.medicalteacher.org.

There was only one instrument that was originally devel-

oped for dental educational environment (Gerzina et al. 2005).

The others were modifications or identical version of instru-

ments, which were originally designed to be used in medical

school. The content validity of those instruments was ques-

tionable since there were no processes of validating them

before they were applied in dental education settings,

although the original versions may have proven to be valid.

In addition to that, two instruments (LES and DSLES) were

developed from the MSLES. It was an instrument first devel-

oped in 1970s and it might not encompass current trends or

changes in educational practice nowadays. The DSLES was

more identical to the MSLES than the LES, because the LES was

a shorter version of the MSLES whereas the changes in the

DSLES were only minor.

The LES (Stewart et al. 2006) demonstrated the highest

overall reliability coefficient (Cronbach’s alpha of 0.97). The

reliability of the ClinEd IQ (Henzi et al. 2006), the DREEM

(Zamzuri et al. 2004) and the questionnaire from Gerzina et al.

(2005) were not tested. The DSLES also demonstrated good

reliability (Cronbach’s alpha of 0.91), which was comparable

to those of the MSLES (Henzi et al. 2005).

Based on the information provided in the studies and the

analysis conducted, the DSLES is likely to be the most suitable

instrument for measuring educational environment in dental

education setting. It demonstrated good reliability and since

the DSLES was more similar to MSLES than the LES, it was

more likely to have better content validity.

Conclusion

The systematic search yielded 178 studies, which were

considered to be eligible for further review. The pre-specified

criteria were applied to those studies and finally 79 studies,

which were the original/primary studies measuring the overall

educational/learning environment in health professions edu-

cation using specific instruments, were included. A substantial

number of educational environment measurement instruments

(31 instruments) were extracted from the included studies. The

use of a wider range of databases with more elaborated search

strategies will increase the comprehensiveness of the system-

atic review.

The DREEM, PHEEM, CLES and DSLES are likely to be the

most suitable instruments for undergraduate medicine, post-

graduate medicine, nursing and dental education, respectively.

The content validity was established through elaborate

description of the development process of the instruments.

Some instruments have added values with the establishment of

their construct validity. These educational environment mea-

surement instruments also demonstrated consistency through-

out their applications in different contexts/settings.

Further analysis will be useful to explore the ability of each

instrument as a predictor variable for a particular educational

achievement. Furthermore, a study on students’ perceptions of

the educational environment should become a part of good

educational practice of an institution, as a suitable, valid and

reliable instrument for each educational context is available.

Acknowledgements

The authors thank Sue Roff and Sean McAleer for their

comments and support in completing the project and finalizing

the manuscript. This work is partially supported by grants of

the National Commission for Scientific and Technological

Research (CONICYT), FONDECYT no. 11004336 to A.R.

Declaration of interest: The authors report no conflicts of

interest. The authors alone are responsible for the content and

writing of the article.

Notes on contributors

DIANTHA SOEMANTRI, MD, MmedEd, is a lecturer in medical education in

the Department of Medical Education, Faculty of Medicine, Universitas

Indonesia.

CRISTIAN HERRERA is member of the Evidence Based Medicine Unit and

the Health Policy and Systems Research Unit from the School of Medicine,

Pontificia Universidad Catolica de Chile.

ARNOLDO RIQUELME, MD, MmedEd, is an undergraduate and postgrad-

uate clinical tutor in Internal Medicine and consultant in the Department of

Gastroenterology, Pontificia Universidad Catolica de Chile School of

Medicine, Chile.

References

The references for this article can be viewed at www.medicalteacher.org.

D. Soemantri et al.

952

Med

Tea

ch D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y U

nive

rsity

of

Maa

stri

cht o

n 06

/18/

14Fo

r pe

rson

al u

se o

nly.

measuring the educational environment in health professions studies: a systematic review

Documents