CJA
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

This Article
Right arrow Abstract Freely available
Right arrow Résumé de cet Article
Right arrow Full Text (PDF)
Right arrow Additional Material
Right arrow Submit a scholarly reply
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Related articles in CJA
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Jacobsohn, E.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Jacobsohn, E.
Canadian Journal of Anesthesia 53:659-668 (2006)
© Canadian Anesthesiologists' Society, 2006

General Anesthesia

Poor inter-rater reliability on mock anesthesia oral examinations

[Le pauvre coefficient d’objectivité d’examens oraux en anesthésie simulés]

Eric Jacobsohn, MBCHB MHPE FRCPC*, P. Alan Klock, MD{dagger}, Michael Avidan, MBBCH* and the Oral Examinations Group{ddagger}

* From the Departments of Anesthesiology and Surgery, Washington University School of Medicine, St. Louis, MO; and the
{dagger} Anesthesia and Critical Care, University of Chicago, Chicago, Illinois, USA;
{ddagger} (see Appendix II).

Address correspondence to: Dr. Eric Jacobsohn, Department of Anesthesiology, Washington University School of Medicine, 660 S. Euclid, MC 8054, St. Louis, MO 63110, USA. Phone: 314-747-4155; Fax: 314-362-4551; E-mail: jacobsoe{at}msnotes.wustl.edu


    Abstract
 TOP
 Abstract
 Introduction
 Methods
 Statistical analysis
 Results
 Discussion
 APPENDIX I: Examination cases...
 APPENDIX II
 References
 
Purpose: Inter-rater reliability (IRR) and communication skills are both important factors that have been shown to affect oral examination scores. This study was designed to test: 1) IRR of a group of American Board of Anesthesiology (ABA) and the Royal College of Physicians and Surgeons of Canada (RCPSC) graders who graded in true isolation; 2) the effect of teaching residents examination techniques.

Methods: This was a randomized, pretest-posttest trial. Twenty-five residents did an initial oral examination (E1) resembling the ABA examination. They were then randomized into two groups, a routine education group, and an intervention group that was taught oral examination skills. Six weeks later they did another oral examination (E2). The videotaped examinations were subsequently scored by six experienced RCPSC and ABA graders.

Results: There was very poor IRR on E1 (weighted Kappa = 0.166, intraclass correlation coefficient 0.243), which improved only slightly on E2 (weighted Kappa = 0.275, P = NS; intraclass correlation coefficient = 0.405, P < 0.01). Pass rate for graderpairs increased from E1 to E2 (15% vs 43%, P = 0.01). The improved pass rate on E2 occurred in both the routine education group and in the intervention group. There was no significant difference between RCPSC and ABA graders. Teaching examination skills per se did not improve performance, but this conclusion may be limited by the poor IRR. Practice orals do appear to improve performance on future examinations.

Conclusions: Inter-rater reliability may be poor when graders score an oral examination in true isolation. Teaching candidates an oral examination communication and presentation technique did not appear to improve performance. Oral examination practice may be of value in training for future examinations.


    Introduction
 TOP
 Abstract
 Introduction
 Methods
 Statistical analysis
 Results
 Discussion
 APPENDIX I: Examination cases...
 APPENDIX II
 References
 
ORAL examinations remain an integral part of the certifying process in many countries, including Canada and the United States. The stated goals of both the Royal College of Physicians and Surgeons of Canada (RCPSC) and the American Board of Anesthesiology (ABA) oral examinations are similar, and these goals aim to test judgment, application of knowledge, adaptability, clarity of expression, and ability to communicate effectively. Many factors have been shown to affect the score in oral examinations, including the validity and inter-rater reliability (IRR) of the examination14 as well as several examinee factors. With regard to IRR, in two studies of mock anesthesia oral examinations, one resembling the ABA and one resembling the RCPSC examination, the IRR was reported as being acceptable.2,3 With regard to examinee factors, a study of surgery residents demonstrated that a candidate’s communication style may be a significant factor in his/her overall score.5 It is therefore possible that some candidates may fail oral examinations due to problems with IRR and/or because they are unable to communicate effectively.

There were two primary aims of this study: 1) to test the IRR of experienced oral examiners who had previously examined at the ABA or RCPSC; 2) to test the effect of teaching candidates certain oral examination skills, such as presentation and communication techniques. The secondary aim was to compare scores given by graders from the ABA and RCPSC.


    Methods
 TOP
 Abstract
 Introduction
 Methods
 Statistical analysis
 Results
 Discussion
 APPENDIX I: Examination cases...
 APPENDIX II
 References
 
After approval from the Institutional Review Board, 29 anesthesiology residents in the University of Chicago Department of Anesthesia and Critical Care residency program were sent recruitment letters inviting them to participate in the study. The study was a randomized, two-treatment group pretest posttest trial (FigureGo). Twenty-seven residents in years one to three of clinical anesthesia training (CA–1 to CA–3) participated and were paid a $50 stipend. Due to scheduling difficulties, two withdrew.


Figure 1
View larger version (24K):
[in this window]
[in a new window]
 
FIGURE The study design.

 
All of the residents were sent the oral examination information package distributed by the ABA, which includes a section on pitfalls in taking oral examinations. The residents then did an oral examination (pretest, examination E1) similar to session B of the ABA exam. After being given one of two stem questions (SC1 and SC2) and ten minutes of preparation, the candidate was questioned for 20 min on preoperative and intraoperative scenarios. After the stem, there was a ten minute period of questioning on one of two additional short-cases (AC1 and AC2, Appendix 1). The scenarios that were used had previously been used at a "real" ABA exam. These scenarios were assessed by the principal investigator (E.J.), the examiner (P.A.K.), and two other senior anesthesiologists who were active ABA examiners and the scenarios were deemed to be fair and of comparable difficulty. During the questioning portion of the examination one person (P.A.K.), who was blinded to the resident’s group assignment, examined all residents and used a grid to guide his questioning. All the examinations were videotaped. The residents were all similarly dressed in a white coat, were given a numerical code for identification, and could not be identified by name on the videotapes.

After E1, the residents were randomized and stratified by year of training to one of two groups: Group T (new educational intervention, n = 13) and Group R (routine residency education, n = 14). The residents in Group T received a package relating to oral examination presentation and communication techniques. This package included the information that was to be presented at a communication skills workshop they were required to attend. This workshop was designed to improve the resident’s oral examination communication and presentation skills. It consisted of a presentation on the structure and pitfalls in the oral examinations, a review of the ABA scoring system, a review of the information that the ABA sends to oral examination candidates (which includes reviewing a typical examination scenario and the accompanying examiner’s grid), and a discussion on oral examination communication techniques such as clarity and speed of speaking, intonation, eye contact, body language, and dress. This workshop also taught the participants a comprehensive system to present and communicate cases on an oral examination. After the workshop, the participants were divided into small groups to practice the presentation and communication system. They were urged to start using the system in their daily practice when communicating or presenting cases to their attending physicians. To reinforce learning the system, individual one-hour follow-up sessions were held with Group T. Group T continued to attend all the usual residency educational seminars. Group R did not attend the educational intervention, but continued their usual residency educational and reading schedules. The residents agreed not to communicate about the examination content. They understood that there was no gain or loss from a good or bad performance and they acknowledged that after accepting a stipend it would be unethical if they did communicate with each other along with invalidating the study. Six weeks after the workshop, the residents in both groups did another oral examination (posttest, E2). The same questions used in E1 were administered, but in crossover design (FigureGo).

The videotaped performances were randomly copied to a series of videotapes. They were "scrambled" as to the sequence of candidates and to E1 or E2; this was done to minimize the confounding variable of grader calibration. The series of videotapes were then sent to the six oral examination graders, three with current experience examining for the ABA, and three with current or very recent experience examining for the RCPSC. These graders had been selected on the basis of peer examiners (at the ABA and RCPSC) recommendations as neither being too difficult nor too lenient. The graders were paid for their time and knew nothing of study design. The graders used a scoring sheet similar to that used by the ABA that has an ordinal scale of 70, 73, 77, and 80 (Supplement 1: Scoring sheet A, available as Additional Material at www.cja-jca.org). In addition to the scoring sheet, to further try and standardize scoring graders were given an educational module regarding scoring of oral examinations (which included the information that the ABA examiners are given regarding the ABA oral examination). They also received an examiners’ grid for each question that listed the topics that had to be addressed in the question. A candidate passed if the average score given by two graders was > 75. To collect more detailed data that could not be assessed by the ABA scoring system, the investigators designed another scoring sheet that included an overall percentage score (0–100) and several ten–point ordinal scores (Supplement 2: Scoring sheet B, available as Additional Material at www.cja-jca.org).

After E1 and E2, the residents completed several self-assessment questionnaires, including one for fairness, difficulty, examiner intimidation and self-assessment score. An anonymous questionnaire after E2 was used to detect if there was any communication between residents about the examination.


    Statistical analysis
 TOP
 Abstract
 Introduction
 Methods
 Statistical analysis
 Results
 Discussion
 APPENDIX I: Examination cases...
 APPENDIX II
 References
 
Data were analyzed using the Statistical Analysis System (SAS 6.12, Cary, NC, USA) with assistance from the Biostatistics Department of the University of Manitoba. Unpaired t tests were used to compare normally distributed characteristics between the groups, and paired t tests were used to compare normally distributed characteristics within the groups. McNemar test was used to assess changes between proportions within the groups. Repeated measures ANOVA were used to compare the effect of the intervention on grader and self-assessment scores. For all these statistical tests, {alpha} was set at 0.05. Intraclass correlation coefficients (ICC) were used to calculate IRR between the graders for interval scores. Weighted Kappa tests were used for agreement between two methods, raters, or observers, when the observations are measured on an ordinal scale. The degree of agreement is indicated by the weighed Kappa statistic, which can be roughly interpreted as follows: 0 = trivial; 0.1 = small; moderate = 0.3; large = 0.5; very large = 0.7; nearly perfect = 0.9.


    Results
 TOP
 Abstract
 Introduction
 Methods
 Statistical analysis
 Results
 Discussion
 APPENDIX I: Examination cases...
 APPENDIX II
 References
 
On the resident questionnaires (Table IGo), there was no reported communication between residents, no intimidation by the examiner, and the examinations were rated as being fair and meeting the stated goals. There was no significant difference between difficulty of E1 or E2, or between the two question formats (SC1 + AC1 and SC2 + AC2). On the grader’s questionnaire (Table IIGo), the graders rated the examinations as fair, of comparable difficulty to each other, and to a "real" certifying examination at their respective boards.


View this table:
[in this window]
[in a new window]
 
TABLE I Residents’ questionnaire
 

View this table:
[in this window]
[in a new window]
 
TABLE II Graders’ evaluation of the examination process
 
Inter-rater reliability using ICC for the component and overall percentage scores, as well as the weighted Kappa coefficient for ordinal scores, are shown in Table IIIGo. On examination E1, ICC was generally poor, ranging from 0.113 [95% confidence interval (CI) 0.0.–0.305] to 0.283 (95% CI 0.087–0.497) for the component scores, and from 0.175 (95% CI 0.0135–0.378) to 0.182 (95% CI 0–0.415) for the overall percentage score. There was a significant improvement in the ICC for the overall percentage and component scores from examination E1 to E2, but ICC remained generally poor. The weighted Kappa for the ordinal scores ranged from 0.01 (95% CI -0.27–0.28) to 0.32 (95% CI -0.02–0.66) on E1, and from 0.16 (95% CI -0.16–0.48) to 0.5 (95% CI 0.25–0.74) on E2. Unlike the percentage scores, there was not a significant increase from E1 to E2.


View this table:
[in this window]
[in a new window]
 
TABLE III Inter-rater reliability: Intraclass correlation coefficients (ICC, with 95% CI) for the component scores and overall percentage score, and weighted Kappa coefficients for the overall ordinal score
 
There was a significant improvement in the pass rate from E1 to E2 for all RCPSC and ABA graders (Table IVGo). The average single RCPSC grader’s pass rate of 28% on E1 was lower than the average single ABA grader rate of 44% (P = 0.05), but this difference was not significant if the most lenient ABA grader (US1) and most stringent RCPSC grader (Can3) were omitted (RCPSC 34% vs US 24%, P = 0.28). The average single RCPSC and ABA graders’ pass rate were identical on E2. The mean pass rate for all single examiners improved from 36% on E1 to 64% on E2 (P < 0.01). The average RCPSC and ABA pairs pass rate was identical on E2. The overall pairs pass rate for all graders increased from 15% on E1 to 43% on E2 (P < 0.01). Using the ordinal system, 44% of candidates would have failed irrespective of the grader, but only 8% of candidates passed irrespective of the grader. In the remaining 48% of candidates the chance of passing or failing was examiner dependent. In the most extreme case one resident received a score of 80 (clear pass) from four graders, and a score of 70 (clear fail) from two graders.


View this table:
[in this window]
[in a new window]
 
TABLE IV Pass rates (%) with one and two graders
 
In Table VGo, we show the effect of the teaching intervention and the effect of a repeat examination on scores. Although residents in both Groups R and T had a significant improvement from E1 to E2 on all the component and overall scores, but there were no significant differences in the improvement between the groups. The group-time effect shows that there is no significant difference in the degree of improvement in scores between the Groups R and T, and the group effect shows that they were not significantly different from each other at any time point.


View this table:
[in this window]
[in a new window]
 
TABLE V The effect of the teaching inter vention per se and the effect of oral examination practice (i.e., the repeat examination) on the component and overall scores
 

    Discussion
 TOP
 Abstract
 Introduction
 Methods
 Statistical analysis
 Results
 Discussion
 APPENDIX I: Examination cases...
 APPENDIX II
 References
 
The results of this study show that: 1) IRR between experienced oral examiners, when grading in true isolation, was poor. The IRR improved only modestly on a repeat oral examination; 2) teaching an oral examination presentation and communication techniques did not improve scores, but that a repeat oral examination had a powerful effect at improving performance; 3) graders from the RCPSC and ABA gave similar scores.

The IRR of graders scoring in true isolation (i.e., videotaped performances) was very poor on both examinations. Although some improvement was noted on the second examination, the IRR never reached an acceptable level, that is, > 0.7. Most certifying boards strive for IRR levels of 0.8 to 0.9. In a four-year prospective study of anesthesiology residents, 441 practice oral examinations resembling the ABA exam were administered to 190 residents.2 Using the ABA ordinal score and pairs of examiners, the IRR was found to be 0.68. However, in this study, the examiners were faculty members in the residency program and therefore knew the residents.2 This fact could have lead to examiner bias due to the "halo effect" where a candidate is scored high or low based upon previous encounters or knowledge of the resident. In a study of practice oral examinations using the RCPSC oral examination format, the mean IRR ranged from 0.477 to 0.791 (measured at two different study sites and at two different times).3 However, the examiners were not truly grading in isolation, and not all examiners were experienced RCPSC examiners.

There are several possible explanations for the poor IRR in our study. Examiners were evaluating residents-in- training who were probably not as well prepared as true candidates. Given that IRR is weaker for poor performances than good performances, we may have expected poorer IRR for the practice orals than for real certifying examinations. This may be supported by the observation that the scores improved significantly from E1 to E2. This was accompanied by an improvement in the IRR for the component scores, the overall continuous score, and the ordinal score. It may be that if the residents had been in their CA3 years that the IRR would have been better. We did not attempt a subgroup analysis for the CA3 residents because the group was too small. Another possible explanation for the poor IRR may be that the examinations were inconsistent for content and quality. However, this is unlikely, as each question had previously been used at an ABA certifying exam, and the graders felt that they were fair and similar to a certifying ABA examination. Also, the examiner (P.A.K.) had been on faculty for five years, was trained in administering oral examinations, and had developed the grid himself.

A plausible explanation for the poor IRR may be that conscious or subconscious consensus building between graders was not possible due to the study design. In 1966, it was first suggested that "subtle communication" occurred between examiners.6 The graders in our study scored the examinations with-out being aware of the study aims, and there was no possibility for consensus building. Most exam boards report good IRR, and in the two practice oral examination studies2,3 the IRR was reported as being fair to good. However, graders were invariably not scoring in true isolation, and non-verbal communication could have occurred.

Another factor that may have affected the IRR is the type of grader; it is well known that there are strict and lenient graders. It appears that one of the US graders (US1) was lenient, and one of the Canadian examiners (Can3) was strict. This exposes one of the difficulties with oral examinations, namely that they are difficult to grade reliably even when they have been standardized. Standardizing the oral and having examiner grids and scoring sheets helps, but it does not eliminate this problem. In our study, when using the ordinal score, 44% of candidates failed irrespective of grader, and only 8% passed irrespective of grader. In the remaining 48%, the chance of passing or failing was examiner dependent. In order to minimize the IRR problems, many examining boards provide training for examiners and have ongoing monitoring of their examiners. The graders in our study were all trained and experienced in giving oral examinations. Another possible way to try and improve the process would be for the same examiners to ask the same question to all candidates, that is, have the candidates move from one room to another during their exam.3 This would mean that the examiners in the same room would ask only one question to multiple candidates. This would have the effect of standardizing the exam as well as assuring some degree of standardization in the scoring. This makes the examination logistically more challenging, especially for large numbers of candidates.

We thought that comparing RCPSC and ABA graders would be interesting as both countries have high standards in anesthesiology training, similar certification processes, and the competencies expected of a certified anesthesiologist are very similar.7 However, the analysis was complicated by the variability in grader stringency. One ABA grader was lenient (pass rate 80%), and one RCPSC grader was strict (pass rate 16%). When these two were removed from the analysis, there was no significant difference in the pass rates between the ABA and RCPSC graders. When the variability of the grader pass rates is considered, we would have needed 25 residents per group and 12 graders per country to detect a 10% difference in pass rates with a power of 80%.

The improved scores from the first to the second examination may have several possible explanations. Both groups reported significantly less anxiety at the time of the second examination. This is an important finding as it has been shown that less confident and more anxious examinees do worse.8 Anesthesiology residents who were exposed to repeat mock oral examinations became more confident on subsequent examinations, even though their anxiety remained unchanged.9 We speculate that the residents in our study probably felt less anxious on the second examination as they knew that the examiner (P.A.K.) was non-intimidating and fair (based on their experience on E1). The combination of less anxiety and improved confidence could partly explain the improvement in the graders’ scores. The intervening time was only six weeks, making it unlikely that the improvement could be attributed to maturation in knowledge. The fact that both groups improved suggests that a repeat oral examination may have a powerful effect on examination performance. Practice orals in anesthesiology are well established and have been proposed to enhance exam-taking skills.

One of the primary goals of this study was to establish whether a presentation and communication method could improve performance. We could not show this. It is unlikely that inadequate dosage (strength of intervention) occurred; the seminar and follow-up sessions were comprehensive. The intervention may have been ineffective if the residents were not motivated to use it, but the questionnaire suggests that residents liked the system and used it. Another possible explanation may be that the poor IRR may have made it difficult for the effect of the intervention to be seen, especially when there was such a powerful effect of a repeat oral examination. A reassuring explanation may be that experienced graders were not swayed by elegant communication and presentation skills in the absence of a substantive knowledge base. While it was once believed that communication skills were a discrete component in assessing clinical competence, it is now accepted that communication skills are dependent on the context in which they are performed.10 Using standardized patients, Colliver et al. showed that clinical competence and communication skills were related only in the clinical context in which they were presented.11 In their work, if students did badly in the clinical context, communication would be rated poorly. Since most of the residents in our study did not receive overall passing grades, the association between poor overall competences may have been another factor in addition to the poor IRR. The sample population consisted of residents in their CA1-CA3 years, and it may be that if we studied only residents with a more developed knowledge (CA3) that we would have shown an effect. Another possible explanation for not showing a difference between groups may be that the method we taught was simply not good enough.

The study has several limitations. The residents were a "convenience" sample and included residents from CA1-CA3 years. Junior residents were unlikely to do well, and it can be assumed that their performance would be below that of candidates presenting themselves for a certifying examination. The IRR of poor performances is less reliable than that of good performances.11 The IRR results of the study may therefore have been biased by the participation of the junior residents. However, potentially incompetent candidates do sometimes present themselves for certifying examinations, and it would be concerning if some of them passed. Another limitation of the study was in the examiner’s grid. Although the questions were standardized, each examination could take a slightly different course depending on how the resident responded. This could have made the scoring on the grid difficult to follow at times. However, this again is something that does occur in certifying examinations. Another weakness is that the self-assessment scores may have significant bias. The investigator had the ability to identify the residents, and hence they may have been less likely to criticize the intervention.

In conclusion, we have shown that on a mock anesthesiology oral examination resembling session B of the ABA certifying examination, that: 1) IRR between experienced examiners was poor when they graded in isolation, and many candidates would have passed or failed depending on the examiner; 2) practice oral examinations have a powerful effect on improving performance by reducing candidate anxiety and improving confidence; 3) oral examination presentation and communication skills per se did not improve scores; and 4) graders from the ABA and RCPSC appeared to give similar scores.


    APPENDIX I: Examination cases and examiner’s grids
 TOP
 Abstract
 Introduction
 Methods
 Statistical analysis
 Results
 Discussion
 APPENDIX I: Examination cases...
 APPENDIX II
 References
 
Stem case 1 (SC1): resident worksheet
A 68-yr-old male with steroid-dependent rheumatoid arthritis is scheduled for a revision total hip replacement. He has diabetes, a hiatal hernia, and coronary artery disease. He had a myocardial infarction in the past which was complicated by congestive heart failure. Medications include: digoxin, furosemide, captopril, atenolol, prednisone, ranitidine and glyburide. Vital signs: P 59, BP 130/70, R 14, T 36.9.

SC1: examiner’s grid
A 68-yr-old male with steroid-dependent rheumatoid arthritis is scheduled for a revision total hip replacement. He has diabetes, a hiatal hernia and coronary artery disease. He had a myocardial infarction in the past, which was complicated by congestive heart failure. Medications include: digoxin, furosemide, captopril, atenolol, prednisone, ranitidine and glyburide. Vital signs: P 59, BP 130/70, R 14, T36.9

What are your concerns about this patient?
PREOPERATIVE MANAGEMENT
Coronary disease

- History: duration, symptoms, complications, myocardial infarction, heart failure
- Examination: full cardiovascular examination
- Tests: Would you order any tests? Why? ECG shows sinus rhythm, LBBB, relevance? Do you want any further tests? If so, which ones and why?

Medications:

Would you continue digoxin, captopril, atenolol, furosemide? Why, why not?

Rheumatoid arthritis (RA):

- How does history of RA affect your evaluation and management plan?
- History: duration, multisystem disease, severity, medications, steroid use.
- Tests: c-spine films
- Upon examination, patient can only open mouth 2 cm. What if he opens 4 cm?
- Perioperative steroid management
- Positioning

Hiatal hernia:

- Preoperative precautions, medications

Diabetes:

- History (onset adult vs juvenile, nephropathy, retinopathy, neuropathy)
- Physical
- Labs: glucose is 360, postpone case? Glycosylated Hb
- Management of diabetic ketoacidosis or hyperosmolar state
- Perioperative management

Drug interactions: polypharmacy

ANESTHETIC PLAN AND INTRAOPERATIVE MANAGEMENT
Monitors: art line, CVP vs PAC, TEE

- Do your monitoring choices change if GA vs epidural?

Induction plan

Maintenance plan

Blood loss

Heat conservation

Positioning

PA catheter is placed

Critical incident: Bradycardia

Critical incident: Hypoxemia

Emergence

Postoperative disposition: ICU vs PACU

Decision to extubate.

POSTOPERATIVE MANAGEMENT
Pain control. Epidural management

Hypothermia (34.5°C)

Stem case 2 (SC2): resident worksheet
A 47-yr-old obese female is scheduled for a pelvic exenteration (radical hysterectomy, oophorectomy, possible partial cystectomy and/or partial colectomy). She has a history of diabetes, hypertension, and asthma. Her medications include diltiazem, captopril, insulin, hydrochlorothiazide, cimetidine, and an albuterol inhaler. Her examination reveals P 68, BP 145/90, R 14, T 37.0

SC2: examiners grid
A 47-yr-old obese female is scheduled for a pelvic exenteration (radical hysterectomy, oophorectomy, possible partial cystectomy and/or partial colectomy). She has a history of diabetes, hypertension, asthma, she has recently started treatment for reflux esophagitis. Her medications include diltiazem, captopril, insulin, hydrochlorothiazide, cimetidine and an albuterol inhaler. Her examination reveals P 68, BP 145/90, R 14, T 37.0

What are your concerns about this patient?
PREOPERATIVE MANAGEMENT
Hypertension:

- History: duration, severity, medications, complications
- Examination: full cardiovascular examination

Test:

Would you order any tests? Why? ECG shows LVH with "strain," relevance? Do you want any further tests? If so, which ones and why?
Would you continue diltiazem, captopril and hydrochlorothiazide; why, why not?

Asthma:

- History: duration, severity, medications, steroid use, hospitalization, recent wheezing
- Patient reports intermittent "chest tightness."
- Examination: end-expiratory wheezes are heard at the bases of both lungs.
- Cancel case?, Why? Why not?
- How to optimize patient preoperatively.

Obesity:

- How does obesity change your preoperative evaluation and anesthetic plan?
- What comorbid conditions are associated with obesity?
- Positioning

Reflux esophagitis:

- Preoperative precautions; medications

Diabetes:

- History: onset adult vs juvenile, nephropathy, retinopathy, neuropathy)
- Physical
- Tests: glycosylated Hb, glucose
- Ketoacidosis
- Perioperative management

Drug interactions: polypharmacy

ANESTHETIC PLAN AND INTRAOPERATIVE MANAGEMENT
Monitors: art line, CVP

Induction plan

Maintenance plan

Blood loss

Heat conservation

Positioning

Central line is placed

Critical incident: increased peak airway pressure.

Critical incident: hypotension

Emergence

Postoperative disposition ICU vs PACU

Decision to extubate

POSTOPERATIVE
Pain control

Hypothermia (34.5°C)

Additional case 1 (AC1)
A 60-yr-old man has a spinal anesthetic for transurethral prostatectomy (TURP). Six minutes after the administration of the spinal, his BP decreases from 120/60 to 80/50, and HR decreases from 85 to 45. He has difficulty breathing and is becoming somnolent. What has happened? Pathophysiology? Management?

Additional case 2 (AC2)
A 26-yr-old IV drug abuser requires an exploratory laparotomy. He has a history of hepatitis. What are your concerns? What anesthetic technique is the best? What agents to use? What are the risks?


    APPENDIX II
 TOP
 Abstract
 Introduction
 Methods
 Statistical analysis
 Results
 Discussion
 APPENDIX I: Examination cases...
 APPENDIX II
 References
 
The Oral Board Examiners Group were: Laurence Brownell MD, Professor, Department of Anesthesia, University of Manitoba, Winnipeg, Manitoba; Robert Friesen MD, Professor, Department of Anesthesia, University of Manitoba, Winnipeg; Joel B. Gunter MD, Professor, Department of Anesthesiology, University of Cincinnati, Cincinnati, Ohio; David Porembka DO, Professor, Department of Anesthesiology, University of Cincinnati, Cincinnati, Ohio; Heinz Reimer MD, Associate Professor, Department of Anesthesia, University of Manitoba, Winnipeg, Manitoba; David Rothenberg MD, Professor, Department of Anesthesiology, Rush Medical College, Chicago, Illinois. The Oral Board Examiners participated in the study by reviewing and scoring all the videotaped performances, and by reviewing the manuscript.


    Acknowledgments
 
We sincerely thank Ms. Mary Cheang, Biostatistician, University of Manitoba, for her expert advice and assistance in doing the analyses.


    Footnotes
 
Funding was provided by the Royal College of Physicians and Surgeons of Canada (Medical Education Fellowship to E.J.), the Section of Critical Care Research Fund, University of Manitoba (E.J.), and the Department of Anesthesia and Critical Care Research Trust, University of Chicago (E.J., P.A.K.).

Accepted for publication January 19, 2006. Revision accepted January 30. 2006.

Competing interests: None declared.

This article is accompanied by an editorial. Please see Can J Anesth 2006; 53: 639–42..


    References
 TOP
 Abstract
 Introduction
 Methods
 Statistical analysis
 Results
 Discussion
 APPENDIX I: Examination cases...
 APPENDIX II
 References
 
1 Eagle CJ, Martineau R, Hamilton K. The oral examination in anaesthetic resident evaluation. Can J Anaesth 1993; 40: 947–53.[Abstract/Free Full Text]

2 Schubert A, Tetzlaff JE, Tan M, Ryckman JV, Mascha E. Consistency, inter-rater reliability, and validity of 441 consecutive mock oral examinations in anesthesiology: implications for use as a tool for assessment of residents. Anesthesiology 1999; 91: 288–98.[Medline]

3 Kearney RA, Puchalski SA, Yang HY, Skakun EN. The inter-rater and intra-rater reliability of a new Canadian oral examination format in anesthesia is fair to good. Can J Anesth 2002; 49: 232–6.[Abstract/Free Full Text]

4 Muzzin LJ, Hart L. Oral examinations. In: Neufeld VR, Norman GR (Eds). Assessing Clinical Competence. New York: Springer Publishing Co.; 1985: 71–93.

5 Burchard KW, Rowland-Morin PA, Coe NP, Garb JL. A surgery oral examination: interrater agreement and the influence of rater characteristics. Acad Med 1995; 70: 1044–6.[Medline]

6 McGuire CH. The oral examination as a measure of professional competence. J Med Educ 1966; 41: 267– 74.[Medline]

7 Eagle C. Anaesthesia and education. Can J Anaesth 1992; 39: 158–65.[Abstract/Free Full Text]

8 Linn BS, Zeppa R. Stress in junior medical students: relationship to personality and performance. J Med Educ 1984; 59: 7–12.[Medline]

9 Schubert A, Tetzlaff JE, Licina M, Mascha E, Smith MP. Organization of a comprehensive anesthesiology oral practice examination program: planning, structure, startup, administration, growth, and evaluation. J Clin Anesth 1999; 11: 504–18.[Medline]

10 Newble DI, Swanson DB. Psychometric characteristics of the objective structured clinical examination. Med Educ 1988; 22: 325–34.[Medline]

11 Colliver JA, Swartz MH, Robbs RS, Cohen DS. Relationship between clinical competence and interpersonal and communication skills in standardized-patient assessment. Acad Med 1999; 74: 271–4.[Medline]


Related articles in CJA:

The oral examination process – gold standard or fool’s gold/Le processus d’examen oral - un vrai ou un faux étalon-or ?
Patricia Houston, Ramona A. Kearney, and Georges Savoldelli
CJA 2006 53: 639-642. [Full Text]  



This article has been cited by other articles:


Home page
Canadian J. AnesthesiaHome page
S. C. Hall
Poor inter-rater reliability on mock anesthesia oral examinations
Can J Anesth, December 1, 2006; 53(12): 1268 - 1269.
[Full Text] [PDF]


Home page
Canadian J. AnesthesiaHome page
P. Houston, R. A. Kearney, and G. Savoldelli
The oral examination process - gold standard or fool's gold/Le processus d'examen oral - un vrai ou un faux etalon-or ?
Can J Anesth, July 1, 2006; 53(7): 639 - 642.
[Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Résumé de cet Article
Right arrow Full Text (PDF)
Right arrow Additional Material
Right arrow Submit a scholarly reply
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Related articles in CJA
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Jacobsohn, E.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Jacobsohn, E.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS