User login
Community Nursing Home Program Oversight: Can the VA Meet Increased Demand for Community-Based Care?
The US Department of Veterans Affairs (VA) Community Nursing Home (CNH) program provides 24-hour skilled nursing care for eligible veterans in public or private community-based facilities that have established a contract to care for veterans. Veteran eligibility is based on service-connected status and level of disability, covering the cost of care for veterans who need long-term care because of their service-connected disability or for veterans with disabilities rated at ≥ 70%.1 Between 2014 and 2018, the average daily census of veterans in CNHs increased by 26% and the percentage of funds obligated to this program increased by 49%.2 The VA projects that the number of veterans receiving care in a CNH program will increase by 80% between 2017 and 2037, corresponding to a 149% increase in CNH expenditures.2
CNH program oversight teams are mandated at each VA medical center (VAMC) to monitor care coordination within the CNH program. These teams include nurses and social workers (SWs) who perform regular on-site assessments to monitor the clinical, functional, and psychosocial needs of veterans. These assessments include a review of the electronic health record (EHR) and face-to-face contact with veterans and CNH staff, regardless of the purchasing authority (hospice, long-term care, short-term rehabilitation, respite care).3 These teams represent key stakeholders impacted by CNH program expansion.
While the CNH program has focused primarily on the provision of long-term care, the VA is now expanding to include short-term rehabilitation through Veteran Care Agreements.4 These agreements are authorized under the MISSION Act, designed to improve care for veterans.5 Veteran Care Agreements are expected to be less burdensome to execute than traditional contracts and will permit the VA to partner with more CNHs, as noted in a Congressional Research Service report regarding long-term care services for veterans.6 However, increasing the number of CNHs increases demands on oversight teams, particularly if the coordinators are compelled to perform monthly on-site visits to facilities required under current guidelines.3
The objective of this study was to describe the experiences of VA and CNH staff involved in care coordination and the oversight of veterans receiving CNH care amid Veteran Care Agreement implementation and in anticipation of CNH program expansion. The results are intended to inform expansion efforts within the CNH program.
METHODS
This study was a component of a larger research project examining VA-purchased CNH care; recruitment methods are available in previous publications describing this work.7 Participants provided written or verbal consent before video and phone interviews, respectively. This study was approved by the Colorado Multiple Institutional Review Board (Protocol #18-1186).
Video and phone interviews were conducted by 3 team members from October 2018 to March 2020 with CNH staff and VA CNH program oversight team members. Participant recruitment was paused from May to October 2020 as a result of the COVID-19 pandemic and ambiguity about VA NH care purchasing policies following the passage of the VA MISSION Act.5 We used semistructured interview guides (eAppendix 1 for VA staff and eAppendix 2 for NH staff, available online at doi:10.12788/fp.0421). Recorded and transcribed interviews ranged from 15 to 90 minutes.
Two members of the research team analyzed transcripts using both deductive and inductive content analysis.8 The interview guide informed an a priori codebook, and in vivo codes were included as they emerged. We jointly coded 6 transcripts to reach a consensus on coding approaches and analyzed the remaining transcripts independently with frequent meetings to develop themes with a qualitative methodologist. All qualitative data were analyzed using ATLAS.ti software.
This was a retrospective observational study of veterans who received VA-paid care in CNHs during the 2019 fiscal year (10/1/2018-9/30/2019) using data from the enrollment, inpatient and outpatient encounters, and other care paid for by the VA in the VA Corporate Data Warehouse. We linked Centers for Medicare and Medicaid monthly Nursing Home Compare reports and the Brown University Long Term Care: Facts on Care in the US (LTC FoCUS) annual files to identify facility addresses.9
Descriptive analyses of quantitative data were conducted in parallel with the qualitative findings.8 Distance from the contracting VAMC to CNH was calculated using the greater-circle formula to find the linear distance between geographic coordinates. Quantitative and qualitative data were collected concurrently, analyzed independently, and integrated into the interpretation of results.10
RESULTS
We conducted 36 interviews with VA and NH staff who were affiliated with 6 VAMCs and 17 CNHs. Four themes emerged concerning CNH oversight: (1) benefits of VA CNH team engagement/visits; (2) burden of VA CNH oversight; (3) burden of oversight limited the ability to contract with additional NHs; and (4) factors that ease the burden and facilitate successful oversight.
Benefits of Engagement/Visits
VA SWs and nurses visit each veteran every 30 to 45 days to review their health records, meet with them, and check in with NH staff. In addition, VA SWs and nurses coordinate each veteran’s care by working as liaisons between the VA and the NH to help NH staff problem solve veteran-related issues through care conferences. VA SWs and nurses act as extra advocates for veterans to make sure their needs are met. “This program definitely helps ensure that veterans are receiving higher quality care because if we see that they aren’t, then we do something about it,” a VA NH coordinator reported in an interview.
NH staff noted benefits to monthly VA staff visits, including having an additional person coordinating care and built-in VA liaisons. “It’s nice to have that extra set of eyes, people that you can care plan with,” an NH administrator shared. “It’s definitely a true partnership, and we have open and honest conversations so we can really provide a good service for our veterans.”
Distance & High Veteran Census Burdens
VA participants described oversight components as burdensome. Specifically, several VA participants mentioned that the charting they completed in the facility during each visit proved time consuming and onerous, particularly for distant NHs. To accommodate veterans’ preferences to receive care in a facility close to their homes and families, VAMCs contract with NHs that are geographically spread out. “We’re just all spread out… staff have issues driving 2 and a half hours just to review charts all day,” a VA CNH coordinator explained. In 2019, the mean distance between VAMC and NH was 48 miles, with half located > 32 miles from the VAMC. One-quarter of NHs were > 70 miles and 44% were located > 50 miles from the VAMC (Figure 1).
Participants highlighted how regular oversight visits were particularly time consuming at CNHs with a large contracted population. VA nurses and SWs spend multiple days and up to a week conducting oversight visits at facilities with large numbers of veterans. Another VA nurse highlighted how charting requirements resulted in several days of documentation outside of the NH visit for facilities with many contracted veteran residents. Multiple VA participants noted that having many veterans at an NH exacerbated the oversight burdens. In 2019, 252 (28%) of VA CNHs had > 10 contracted veterans and 1 facility had 34 veterans (Figure 2). VA participants perceived having too many veterans concentrated at 1 facility as potentially challenging for CNHs due to the complex care needs of veterans and the added need for care coordination with the VA. One VA NH coordinator noted that while some facilities were “adept at being able to handle higher numbers” of veterans, others were “overwhelmed.” Too many veterans at an NH, an SW explained, might lead the “facility to fail because we are such a cumbersome system.”
Oversight & Staffing Burden
While several participants described wanting to contract with more NHs to avoid overwhelming existing CNHs and to increase choice for veterans, they expressed concerns about their ability to provide oversight at more facilities due to limited staffing and oversight requirements. Across VAMCs, the median number of VA CNHs varied substantially (Figure 3). One VA participant with about 35 CNHs explained that while adding more NHs could create “more opportunities and options” for veterans, it needs to be balanced with the required oversight responsibilities. One VA nurse insisted that more staff were needed to meet current and future oversight needs. “We’re all getting stretched pretty thin, and just so we don’t drop the ball on things… I would like to see a little more staff if we’re gonna have a lot more nursing homes.”
Participants had concerns related to the VA MISSION Act and the possibility of more VA-paid NHs for rehabilitation or short-term care. Participants underscored the necessity for additional staff to account for the increased oversight burden or a reduction in oversight requirements. One SW felt that increasing the number of CNHs would increase the required oversight and the need for collaboration with NH staff, which would limit her ability to establish close and trusting working relationships with NH staff. Participants also described the challenges of meeting their current oversight requirements, which limited extra visits for acute issues and care conferences. This was attributed to a lack of adequate staffing in the VA CNH program, given the time-intensive nature of VA oversight requirements.
Easing Burden & Facilitating Oversight
Participants noted how obtaining remote access to veterans’ EHRs allowed them to conduct chart reviews before oversight visits. This permitted more time for interaction with veterans and CNH staff as well as coordinating care. While providing access to the VA EHR would not change the chart review component of VA oversight, some participants felt it might improve care coordination between VA and NH staff during monthly visits.
Participants felt they were able to build strong working relationships with facilities with more veterans due to frequent communication and collaboration. VA participants also noted that CNHs with larger veteran censuses were more likely to respond to VA concerns about care to maintain the business relationship and contract. To optimize strong working relationships and decrease the challenges of having too many veterans at a facility, some VA participants suggested that CNH programs create a local policy to recommend the number of veterans placed in a CNH.
Discussion
Participants interviewed for this study echoed findings from previous work that identified the importance of developing trusted working relationships with CNHs to care for veterans.11,12 However, interorganizational care coordination, a shortage of health care professionals, and resource demands associated with caring for veterans reported in other community care settings were also noted in our findings.12,13
Building upon prior recommendations related to community care of veterans, our analysis identified key areas that could improve CNH program oversight efficiency, including: (1) improving the interoperability of EHRs to facilitate coordination of care and oversight; (2) addressing inefficiencies associated with traveling to geographically dispersed CNHs; and (3) “right-sizing” the number of veterans residing in each CNH.
The interoperability of EHRs has been cited by multiple studies of VA community care programs as critical to reducing inefficiencies and allowing more in-person time with veterans and staff in care coordination, especially at rural locations.11-15 The Veterans Health Information Exchange Program is designed to optimize data sharing as veterans are increasingly referred to non-VA care through the MISSION Act. This program is organized around patient engagement, clinician adoption, partner engagement, and technological capabilities.16
Unfortunately, significant barriers exist for the VA CNH program within each of these information exchange domains. For example, patient engagement requires veteran consent for consumer-initiated exchange of medical information, which is not practical due to the high prevalence of cognitive impairment in NHs. Similarly, VA consent requirements prohibit EHR download and sharing with non-VA facilities like CNHs, limiting use. eHealth Exchange partnerships allow organizations caring for veterans to connect with the VA via networks that provide a common trust agreement and technical compliance testing. Unfortunately, in 2017, only 257 NHs in which veterans received care nationally were eHealth Exchange partners, which indicates that while NHs could partner in this information exchange, very few did.16
Finally, while the exchange is possible, it is not practical; most CNHs lack the staff that would be required to support data transfer on their technology platform into the appropriate translational gateways. Although remote access to EHRs in CNHs improved during the pandemic, the Veterans Health Information Exchange Program is not designed to facilitate interoperability of VA and CNH records and remains a significant barrier for this and many other VA community care programs.
The dispersal of veterans across CNHs that are > 50 miles from the nearest VAMC represents an additional area to improve program efficiency and meet growing demands for rural care. While recent field guidance to CNH oversight teams reduces the frequency of visits by VA CNH teams, the burden of driving to each facility is not likely to decrease as CNHs increasingly offer rehabilitation as a part of Veteran Care Agreements.17 Visits performed by telehealth or by trained local VA staff may represent alternatives.15
Finally, interview participants described the ideal range of the number of veterans in each CNH necessary to optimize efficiencies. Veterans who rely more heavily upon VA care tend to have more medical and mental health comorbidities than average Medicare beneficiaries.18,19 This was reflected in participants’ recommendation to have enough veterans to benefit from economies of scale but to also identify a limit when efficiencies are lost. Given that most CNHs cared for 8 to 15 veterans, facilities seem to have identified how best to match the resources available with veterans’ care needs. Based on these observations, new care networks that will be established because of the MISSION Act may benefit from establishing evidence-based policies that support flexibility in oversight frequency and either allow for remote oversight or consolidate the number of CNHs to improve efficiencies in care provision and oversight.20
Limitations
Limitations include the unique relationship between VA and CNH staff overseeing the quality of care provided to veterans in CNHs, which is not replicated in other models of care. Data collection was interrupted following the passage of the MISSION Act in 2018 until guidance on changes to practice resulting from the law were clarified in 2020. Interviews were also interrupted at the onset of the COVID-19 pandemic.
Conclusions
The current quality of the CNH care oversight structure will require adaptation as demand for CNH care increases. While the VA CNH program is one of the longest-standing programs collaborating with non-VA community care partners, it is now only one of many following the MISSION Act. The success of this and other programs will depend on matching available CNH resources to the complex medical and psychological needs of veterans. At a time when strategies to ease the burden on NHs and VA CNH coordinators are desperately needed, Veterans Health Information Exchange capabilities need to improve. Evidence is needed to guide the scaling of the program to meet the needs of the rapidly expanding veteran population who are eligible for CNH care.
Acknowledgments
The authors acknowledge Amy Mochel of the Providence Veterans Affairs Medical Center for project management support of this project.
1. Miller EA, Gadbois E, Gidmark S, Intrator O. Purchasing nursing home care within the Veterans Health Administration: lessons for nursing home recruitment, contracting, and oversight. J Health Admin Educ. 2015;32(2):165-197.
2. GAO. VA health care. Veterans’ use of long-term care is increasing, and VA faces challenges in meeting the demand. February 19, 2020. Accessed September 19, 2023. https://www.gao.gov/assets/gao-20-284.pdf
3. VHA Handbook 1143.2, VHA community nursing home oversight procedures. US Department of Veterans Affairs, Veterans Health Administration. June 2004. https://www.vendorportal.ecms.va.gov/FBODocumentServer/DocumentServer.aspx?DocumentId=3740930&FileName=VA259-17-Q-0501-007.pdf
4. Community care: veteran care agreements. US Department of Veterans Affairs. 2022. Updated August 8, 2023. Accessed September 7, 2023. https://www.va.gov/COMMUNITYCARE/providers/Veterans_Care_Agreements.asp
5. Massarweh NN, Itani KMF, Morris MS. The VA MISSION Act and the future of veterans’ access to quality health care. JAMA. 2020;324(4):343-344. doi:10.1001/jama.2020.4505
6. Colello KJ, Panangala SV; Congressional Research Service. Long-term care services for veterans. February 14, 2017. Accessed September 7, 2023. https://crsreports.congress.gov/product/pdf/R/R44697
7. Magid KH, Galenbeck E, Haverhals LM, et al. Purchasing high-quality community nursing home care: a will to work with VHA diminished by contracting burdens. J Am Med Dir Assoc. 2022;23(11):1757-1764. doi:10.1016/j.jamda.2022.03.007
8. Vaismoradi M, Turunen H, Bondas T. Content analysis and thematic analysis: Implications for conducting a qualitative descriptive study. Nurs Health Sci. 2013;15(3):398-405. doi:10.1111/nhs.12048
9. Brown University. LTC Focus. Accessed September 18, 2023. https://ltcfocus.org/about
10. Zhang W, Creswell J. The use of “mixing” procedure of mixed methods in health services research. Med Care. 2013;51(8):e51-e57. doi:10.1097/MLR.0b013e31824642fd
11. Haverhals LM, Magid KH, Blanchard KN, Levy CR. Veterans Health Administration staff perceptions of overseeing care in community nursing homes during COVID-19. Gerontol Geriatr Med. 2022;8:23337214221080307. Published 2022 Feb 15. doi:10.1177/23337214221080307
12. Garvin LA, Pugatch M, Gurewich D, Pendergast JN, Miller CJ. Interorganizational care coordination of rural veterans by Veterans Affairs and community care programs: a systematic review. Med Care. 2021;59(suppl 3):S259-S269. doi:10.1097/MLR.0000000000001542
13. Schlosser J, Kollisch D, Johnson D, Perkins T, Olson A. VA-community dual care: veteran and clinician perspectives. J Community Health. 2020;45(4):795-802. doi:10.1007/s10900-020-00795-y
14. Nevedal AL, Wong EP, Urech TH, Peppiatt JL, Sorie MR, Vashi AA. Veterans’ experiences with accessing community emergency care. Mil Med. 2023;188(1-2):e58-e64. doi:10.1093/milmed/usab196
15. Levenson SA. Smart case review: a model for successful remote medical direction and enhanced nursing home quality improvement. J Am Med Dir Assoc. 2021;22(10):2212-2215.e6. doi:10.1016/j.jamda.2021.05.043
16. Donahue M, Bouhaddou O, Hsing N, et al. Veterans Health Information Exchange: successes and challenges of nationwide interoperability. AMIA Annu Symp Proc. 2018;2018:385-394. Published 2018 Dec 5.
17. US Department of Veterans Affairs. VHA Notice 2023-07. Community Nursing Home Program. September 5, 2023:1-4.
18. Helmer DA, Dwibedi N, Rowneki M, et al. Mental health conditions and hospitalizations for ambulatory care sensitive conditions among veterans with diabetes. Am Health Drug Benefits. 2020;13(2):61-71.
19. Rosen AK, Wagner TH, Pettey WBP, et al. Differences in risk scores of veterans receiving community care purchased by the Veterans Health Administration. Health Serv Res. 2018;53(suppl 3):5438-5454. doi:10.1111/1475-6773.13051
20. Mattocks KM, Kroll-Desrosiers A, Kinney R, Elwy AR, Cunningham KJ, Mengeling MA. Understanding VA’s use of and relationships with community care providers under the MISSION Act. Med Care. 2021;59(suppl 3):S252-S258. doi:10.1097/MLR.0000000000001545
The US Department of Veterans Affairs (VA) Community Nursing Home (CNH) program provides 24-hour skilled nursing care for eligible veterans in public or private community-based facilities that have established a contract to care for veterans. Veteran eligibility is based on service-connected status and level of disability, covering the cost of care for veterans who need long-term care because of their service-connected disability or for veterans with disabilities rated at ≥ 70%.1 Between 2014 and 2018, the average daily census of veterans in CNHs increased by 26% and the percentage of funds obligated to this program increased by 49%.2 The VA projects that the number of veterans receiving care in a CNH program will increase by 80% between 2017 and 2037, corresponding to a 149% increase in CNH expenditures.2
CNH program oversight teams are mandated at each VA medical center (VAMC) to monitor care coordination within the CNH program. These teams include nurses and social workers (SWs) who perform regular on-site assessments to monitor the clinical, functional, and psychosocial needs of veterans. These assessments include a review of the electronic health record (EHR) and face-to-face contact with veterans and CNH staff, regardless of the purchasing authority (hospice, long-term care, short-term rehabilitation, respite care).3 These teams represent key stakeholders impacted by CNH program expansion.
While the CNH program has focused primarily on the provision of long-term care, the VA is now expanding to include short-term rehabilitation through Veteran Care Agreements.4 These agreements are authorized under the MISSION Act, designed to improve care for veterans.5 Veteran Care Agreements are expected to be less burdensome to execute than traditional contracts and will permit the VA to partner with more CNHs, as noted in a Congressional Research Service report regarding long-term care services for veterans.6 However, increasing the number of CNHs increases demands on oversight teams, particularly if the coordinators are compelled to perform monthly on-site visits to facilities required under current guidelines.3
The objective of this study was to describe the experiences of VA and CNH staff involved in care coordination and the oversight of veterans receiving CNH care amid Veteran Care Agreement implementation and in anticipation of CNH program expansion. The results are intended to inform expansion efforts within the CNH program.
METHODS
This study was a component of a larger research project examining VA-purchased CNH care; recruitment methods are available in previous publications describing this work.7 Participants provided written or verbal consent before video and phone interviews, respectively. This study was approved by the Colorado Multiple Institutional Review Board (Protocol #18-1186).
Video and phone interviews were conducted by 3 team members from October 2018 to March 2020 with CNH staff and VA CNH program oversight team members. Participant recruitment was paused from May to October 2020 as a result of the COVID-19 pandemic and ambiguity about VA NH care purchasing policies following the passage of the VA MISSION Act.5 We used semistructured interview guides (eAppendix 1 for VA staff and eAppendix 2 for NH staff, available online at doi:10.12788/fp.0421). Recorded and transcribed interviews ranged from 15 to 90 minutes.
Two members of the research team analyzed transcripts using both deductive and inductive content analysis.8 The interview guide informed an a priori codebook, and in vivo codes were included as they emerged. We jointly coded 6 transcripts to reach a consensus on coding approaches and analyzed the remaining transcripts independently with frequent meetings to develop themes with a qualitative methodologist. All qualitative data were analyzed using ATLAS.ti software.
This was a retrospective observational study of veterans who received VA-paid care in CNHs during the 2019 fiscal year (10/1/2018-9/30/2019) using data from the enrollment, inpatient and outpatient encounters, and other care paid for by the VA in the VA Corporate Data Warehouse. We linked Centers for Medicare and Medicaid monthly Nursing Home Compare reports and the Brown University Long Term Care: Facts on Care in the US (LTC FoCUS) annual files to identify facility addresses.9
Descriptive analyses of quantitative data were conducted in parallel with the qualitative findings.8 Distance from the contracting VAMC to CNH was calculated using the greater-circle formula to find the linear distance between geographic coordinates. Quantitative and qualitative data were collected concurrently, analyzed independently, and integrated into the interpretation of results.10
RESULTS
We conducted 36 interviews with VA and NH staff who were affiliated with 6 VAMCs and 17 CNHs. Four themes emerged concerning CNH oversight: (1) benefits of VA CNH team engagement/visits; (2) burden of VA CNH oversight; (3) burden of oversight limited the ability to contract with additional NHs; and (4) factors that ease the burden and facilitate successful oversight.
Benefits of Engagement/Visits
VA SWs and nurses visit each veteran every 30 to 45 days to review their health records, meet with them, and check in with NH staff. In addition, VA SWs and nurses coordinate each veteran’s care by working as liaisons between the VA and the NH to help NH staff problem solve veteran-related issues through care conferences. VA SWs and nurses act as extra advocates for veterans to make sure their needs are met. “This program definitely helps ensure that veterans are receiving higher quality care because if we see that they aren’t, then we do something about it,” a VA NH coordinator reported in an interview.
NH staff noted benefits to monthly VA staff visits, including having an additional person coordinating care and built-in VA liaisons. “It’s nice to have that extra set of eyes, people that you can care plan with,” an NH administrator shared. “It’s definitely a true partnership, and we have open and honest conversations so we can really provide a good service for our veterans.”
Distance & High Veteran Census Burdens
VA participants described oversight components as burdensome. Specifically, several VA participants mentioned that the charting they completed in the facility during each visit proved time consuming and onerous, particularly for distant NHs. To accommodate veterans’ preferences to receive care in a facility close to their homes and families, VAMCs contract with NHs that are geographically spread out. “We’re just all spread out… staff have issues driving 2 and a half hours just to review charts all day,” a VA CNH coordinator explained. In 2019, the mean distance between VAMC and NH was 48 miles, with half located > 32 miles from the VAMC. One-quarter of NHs were > 70 miles and 44% were located > 50 miles from the VAMC (Figure 1).
Participants highlighted how regular oversight visits were particularly time consuming at CNHs with a large contracted population. VA nurses and SWs spend multiple days and up to a week conducting oversight visits at facilities with large numbers of veterans. Another VA nurse highlighted how charting requirements resulted in several days of documentation outside of the NH visit for facilities with many contracted veteran residents. Multiple VA participants noted that having many veterans at an NH exacerbated the oversight burdens. In 2019, 252 (28%) of VA CNHs had > 10 contracted veterans and 1 facility had 34 veterans (Figure 2). VA participants perceived having too many veterans concentrated at 1 facility as potentially challenging for CNHs due to the complex care needs of veterans and the added need for care coordination with the VA. One VA NH coordinator noted that while some facilities were “adept at being able to handle higher numbers” of veterans, others were “overwhelmed.” Too many veterans at an NH, an SW explained, might lead the “facility to fail because we are such a cumbersome system.”
Oversight & Staffing Burden
While several participants described wanting to contract with more NHs to avoid overwhelming existing CNHs and to increase choice for veterans, they expressed concerns about their ability to provide oversight at more facilities due to limited staffing and oversight requirements. Across VAMCs, the median number of VA CNHs varied substantially (Figure 3). One VA participant with about 35 CNHs explained that while adding more NHs could create “more opportunities and options” for veterans, it needs to be balanced with the required oversight responsibilities. One VA nurse insisted that more staff were needed to meet current and future oversight needs. “We’re all getting stretched pretty thin, and just so we don’t drop the ball on things… I would like to see a little more staff if we’re gonna have a lot more nursing homes.”
Participants had concerns related to the VA MISSION Act and the possibility of more VA-paid NHs for rehabilitation or short-term care. Participants underscored the necessity for additional staff to account for the increased oversight burden or a reduction in oversight requirements. One SW felt that increasing the number of CNHs would increase the required oversight and the need for collaboration with NH staff, which would limit her ability to establish close and trusting working relationships with NH staff. Participants also described the challenges of meeting their current oversight requirements, which limited extra visits for acute issues and care conferences. This was attributed to a lack of adequate staffing in the VA CNH program, given the time-intensive nature of VA oversight requirements.
Easing Burden & Facilitating Oversight
Participants noted how obtaining remote access to veterans’ EHRs allowed them to conduct chart reviews before oversight visits. This permitted more time for interaction with veterans and CNH staff as well as coordinating care. While providing access to the VA EHR would not change the chart review component of VA oversight, some participants felt it might improve care coordination between VA and NH staff during monthly visits.
Participants felt they were able to build strong working relationships with facilities with more veterans due to frequent communication and collaboration. VA participants also noted that CNHs with larger veteran censuses were more likely to respond to VA concerns about care to maintain the business relationship and contract. To optimize strong working relationships and decrease the challenges of having too many veterans at a facility, some VA participants suggested that CNH programs create a local policy to recommend the number of veterans placed in a CNH.
Discussion
Participants interviewed for this study echoed findings from previous work that identified the importance of developing trusted working relationships with CNHs to care for veterans.11,12 However, interorganizational care coordination, a shortage of health care professionals, and resource demands associated with caring for veterans reported in other community care settings were also noted in our findings.12,13
Building upon prior recommendations related to community care of veterans, our analysis identified key areas that could improve CNH program oversight efficiency, including: (1) improving the interoperability of EHRs to facilitate coordination of care and oversight; (2) addressing inefficiencies associated with traveling to geographically dispersed CNHs; and (3) “right-sizing” the number of veterans residing in each CNH.
The interoperability of EHRs has been cited by multiple studies of VA community care programs as critical to reducing inefficiencies and allowing more in-person time with veterans and staff in care coordination, especially at rural locations.11-15 The Veterans Health Information Exchange Program is designed to optimize data sharing as veterans are increasingly referred to non-VA care through the MISSION Act. This program is organized around patient engagement, clinician adoption, partner engagement, and technological capabilities.16
Unfortunately, significant barriers exist for the VA CNH program within each of these information exchange domains. For example, patient engagement requires veteran consent for consumer-initiated exchange of medical information, which is not practical due to the high prevalence of cognitive impairment in NHs. Similarly, VA consent requirements prohibit EHR download and sharing with non-VA facilities like CNHs, limiting use. eHealth Exchange partnerships allow organizations caring for veterans to connect with the VA via networks that provide a common trust agreement and technical compliance testing. Unfortunately, in 2017, only 257 NHs in which veterans received care nationally were eHealth Exchange partners, which indicates that while NHs could partner in this information exchange, very few did.16
Finally, while the exchange is possible, it is not practical; most CNHs lack the staff that would be required to support data transfer on their technology platform into the appropriate translational gateways. Although remote access to EHRs in CNHs improved during the pandemic, the Veterans Health Information Exchange Program is not designed to facilitate interoperability of VA and CNH records and remains a significant barrier for this and many other VA community care programs.
The dispersal of veterans across CNHs that are > 50 miles from the nearest VAMC represents an additional area to improve program efficiency and meet growing demands for rural care. While recent field guidance to CNH oversight teams reduces the frequency of visits by VA CNH teams, the burden of driving to each facility is not likely to decrease as CNHs increasingly offer rehabilitation as a part of Veteran Care Agreements.17 Visits performed by telehealth or by trained local VA staff may represent alternatives.15
Finally, interview participants described the ideal range of the number of veterans in each CNH necessary to optimize efficiencies. Veterans who rely more heavily upon VA care tend to have more medical and mental health comorbidities than average Medicare beneficiaries.18,19 This was reflected in participants’ recommendation to have enough veterans to benefit from economies of scale but to also identify a limit when efficiencies are lost. Given that most CNHs cared for 8 to 15 veterans, facilities seem to have identified how best to match the resources available with veterans’ care needs. Based on these observations, new care networks that will be established because of the MISSION Act may benefit from establishing evidence-based policies that support flexibility in oversight frequency and either allow for remote oversight or consolidate the number of CNHs to improve efficiencies in care provision and oversight.20
Limitations
Limitations include the unique relationship between VA and CNH staff overseeing the quality of care provided to veterans in CNHs, which is not replicated in other models of care. Data collection was interrupted following the passage of the MISSION Act in 2018 until guidance on changes to practice resulting from the law were clarified in 2020. Interviews were also interrupted at the onset of the COVID-19 pandemic.
Conclusions
The current quality of the CNH care oversight structure will require adaptation as demand for CNH care increases. While the VA CNH program is one of the longest-standing programs collaborating with non-VA community care partners, it is now only one of many following the MISSION Act. The success of this and other programs will depend on matching available CNH resources to the complex medical and psychological needs of veterans. At a time when strategies to ease the burden on NHs and VA CNH coordinators are desperately needed, Veterans Health Information Exchange capabilities need to improve. Evidence is needed to guide the scaling of the program to meet the needs of the rapidly expanding veteran population who are eligible for CNH care.
Acknowledgments
The authors acknowledge Amy Mochel of the Providence Veterans Affairs Medical Center for project management support of this project.
The US Department of Veterans Affairs (VA) Community Nursing Home (CNH) program provides 24-hour skilled nursing care for eligible veterans in public or private community-based facilities that have established a contract to care for veterans. Veteran eligibility is based on service-connected status and level of disability, covering the cost of care for veterans who need long-term care because of their service-connected disability or for veterans with disabilities rated at ≥ 70%.1 Between 2014 and 2018, the average daily census of veterans in CNHs increased by 26% and the percentage of funds obligated to this program increased by 49%.2 The VA projects that the number of veterans receiving care in a CNH program will increase by 80% between 2017 and 2037, corresponding to a 149% increase in CNH expenditures.2
CNH program oversight teams are mandated at each VA medical center (VAMC) to monitor care coordination within the CNH program. These teams include nurses and social workers (SWs) who perform regular on-site assessments to monitor the clinical, functional, and psychosocial needs of veterans. These assessments include a review of the electronic health record (EHR) and face-to-face contact with veterans and CNH staff, regardless of the purchasing authority (hospice, long-term care, short-term rehabilitation, respite care).3 These teams represent key stakeholders impacted by CNH program expansion.
While the CNH program has focused primarily on the provision of long-term care, the VA is now expanding to include short-term rehabilitation through Veteran Care Agreements.4 These agreements are authorized under the MISSION Act, designed to improve care for veterans.5 Veteran Care Agreements are expected to be less burdensome to execute than traditional contracts and will permit the VA to partner with more CNHs, as noted in a Congressional Research Service report regarding long-term care services for veterans.6 However, increasing the number of CNHs increases demands on oversight teams, particularly if the coordinators are compelled to perform monthly on-site visits to facilities required under current guidelines.3
The objective of this study was to describe the experiences of VA and CNH staff involved in care coordination and the oversight of veterans receiving CNH care amid Veteran Care Agreement implementation and in anticipation of CNH program expansion. The results are intended to inform expansion efforts within the CNH program.
METHODS
This study was a component of a larger research project examining VA-purchased CNH care; recruitment methods are available in previous publications describing this work.7 Participants provided written or verbal consent before video and phone interviews, respectively. This study was approved by the Colorado Multiple Institutional Review Board (Protocol #18-1186).
Video and phone interviews were conducted by 3 team members from October 2018 to March 2020 with CNH staff and VA CNH program oversight team members. Participant recruitment was paused from May to October 2020 as a result of the COVID-19 pandemic and ambiguity about VA NH care purchasing policies following the passage of the VA MISSION Act.5 We used semistructured interview guides (eAppendix 1 for VA staff and eAppendix 2 for NH staff, available online at doi:10.12788/fp.0421). Recorded and transcribed interviews ranged from 15 to 90 minutes.
Two members of the research team analyzed transcripts using both deductive and inductive content analysis.8 The interview guide informed an a priori codebook, and in vivo codes were included as they emerged. We jointly coded 6 transcripts to reach a consensus on coding approaches and analyzed the remaining transcripts independently with frequent meetings to develop themes with a qualitative methodologist. All qualitative data were analyzed using ATLAS.ti software.
This was a retrospective observational study of veterans who received VA-paid care in CNHs during the 2019 fiscal year (10/1/2018-9/30/2019) using data from the enrollment, inpatient and outpatient encounters, and other care paid for by the VA in the VA Corporate Data Warehouse. We linked Centers for Medicare and Medicaid monthly Nursing Home Compare reports and the Brown University Long Term Care: Facts on Care in the US (LTC FoCUS) annual files to identify facility addresses.9
Descriptive analyses of quantitative data were conducted in parallel with the qualitative findings.8 Distance from the contracting VAMC to CNH was calculated using the greater-circle formula to find the linear distance between geographic coordinates. Quantitative and qualitative data were collected concurrently, analyzed independently, and integrated into the interpretation of results.10
RESULTS
We conducted 36 interviews with VA and NH staff who were affiliated with 6 VAMCs and 17 CNHs. Four themes emerged concerning CNH oversight: (1) benefits of VA CNH team engagement/visits; (2) burden of VA CNH oversight; (3) burden of oversight limited the ability to contract with additional NHs; and (4) factors that ease the burden and facilitate successful oversight.
Benefits of Engagement/Visits
VA SWs and nurses visit each veteran every 30 to 45 days to review their health records, meet with them, and check in with NH staff. In addition, VA SWs and nurses coordinate each veteran’s care by working as liaisons between the VA and the NH to help NH staff problem solve veteran-related issues through care conferences. VA SWs and nurses act as extra advocates for veterans to make sure their needs are met. “This program definitely helps ensure that veterans are receiving higher quality care because if we see that they aren’t, then we do something about it,” a VA NH coordinator reported in an interview.
NH staff noted benefits to monthly VA staff visits, including having an additional person coordinating care and built-in VA liaisons. “It’s nice to have that extra set of eyes, people that you can care plan with,” an NH administrator shared. “It’s definitely a true partnership, and we have open and honest conversations so we can really provide a good service for our veterans.”
Distance & High Veteran Census Burdens
VA participants described oversight components as burdensome. Specifically, several VA participants mentioned that the charting they completed in the facility during each visit proved time consuming and onerous, particularly for distant NHs. To accommodate veterans’ preferences to receive care in a facility close to their homes and families, VAMCs contract with NHs that are geographically spread out. “We’re just all spread out… staff have issues driving 2 and a half hours just to review charts all day,” a VA CNH coordinator explained. In 2019, the mean distance between VAMC and NH was 48 miles, with half located > 32 miles from the VAMC. One-quarter of NHs were > 70 miles and 44% were located > 50 miles from the VAMC (Figure 1).
Participants highlighted how regular oversight visits were particularly time consuming at CNHs with a large contracted population. VA nurses and SWs spend multiple days and up to a week conducting oversight visits at facilities with large numbers of veterans. Another VA nurse highlighted how charting requirements resulted in several days of documentation outside of the NH visit for facilities with many contracted veteran residents. Multiple VA participants noted that having many veterans at an NH exacerbated the oversight burdens. In 2019, 252 (28%) of VA CNHs had > 10 contracted veterans and 1 facility had 34 veterans (Figure 2). VA participants perceived having too many veterans concentrated at 1 facility as potentially challenging for CNHs due to the complex care needs of veterans and the added need for care coordination with the VA. One VA NH coordinator noted that while some facilities were “adept at being able to handle higher numbers” of veterans, others were “overwhelmed.” Too many veterans at an NH, an SW explained, might lead the “facility to fail because we are such a cumbersome system.”
Oversight & Staffing Burden
While several participants described wanting to contract with more NHs to avoid overwhelming existing CNHs and to increase choice for veterans, they expressed concerns about their ability to provide oversight at more facilities due to limited staffing and oversight requirements. Across VAMCs, the median number of VA CNHs varied substantially (Figure 3). One VA participant with about 35 CNHs explained that while adding more NHs could create “more opportunities and options” for veterans, it needs to be balanced with the required oversight responsibilities. One VA nurse insisted that more staff were needed to meet current and future oversight needs. “We’re all getting stretched pretty thin, and just so we don’t drop the ball on things… I would like to see a little more staff if we’re gonna have a lot more nursing homes.”
Participants had concerns related to the VA MISSION Act and the possibility of more VA-paid NHs for rehabilitation or short-term care. Participants underscored the necessity for additional staff to account for the increased oversight burden or a reduction in oversight requirements. One SW felt that increasing the number of CNHs would increase the required oversight and the need for collaboration with NH staff, which would limit her ability to establish close and trusting working relationships with NH staff. Participants also described the challenges of meeting their current oversight requirements, which limited extra visits for acute issues and care conferences. This was attributed to a lack of adequate staffing in the VA CNH program, given the time-intensive nature of VA oversight requirements.
Easing Burden & Facilitating Oversight
Participants noted how obtaining remote access to veterans’ EHRs allowed them to conduct chart reviews before oversight visits. This permitted more time for interaction with veterans and CNH staff as well as coordinating care. While providing access to the VA EHR would not change the chart review component of VA oversight, some participants felt it might improve care coordination between VA and NH staff during monthly visits.
Participants felt they were able to build strong working relationships with facilities with more veterans due to frequent communication and collaboration. VA participants also noted that CNHs with larger veteran censuses were more likely to respond to VA concerns about care to maintain the business relationship and contract. To optimize strong working relationships and decrease the challenges of having too many veterans at a facility, some VA participants suggested that CNH programs create a local policy to recommend the number of veterans placed in a CNH.
Discussion
Participants interviewed for this study echoed findings from previous work that identified the importance of developing trusted working relationships with CNHs to care for veterans.11,12 However, interorganizational care coordination, a shortage of health care professionals, and resource demands associated with caring for veterans reported in other community care settings were also noted in our findings.12,13
Building upon prior recommendations related to community care of veterans, our analysis identified key areas that could improve CNH program oversight efficiency, including: (1) improving the interoperability of EHRs to facilitate coordination of care and oversight; (2) addressing inefficiencies associated with traveling to geographically dispersed CNHs; and (3) “right-sizing” the number of veterans residing in each CNH.
The interoperability of EHRs has been cited by multiple studies of VA community care programs as critical to reducing inefficiencies and allowing more in-person time with veterans and staff in care coordination, especially at rural locations.11-15 The Veterans Health Information Exchange Program is designed to optimize data sharing as veterans are increasingly referred to non-VA care through the MISSION Act. This program is organized around patient engagement, clinician adoption, partner engagement, and technological capabilities.16
Unfortunately, significant barriers exist for the VA CNH program within each of these information exchange domains. For example, patient engagement requires veteran consent for consumer-initiated exchange of medical information, which is not practical due to the high prevalence of cognitive impairment in NHs. Similarly, VA consent requirements prohibit EHR download and sharing with non-VA facilities like CNHs, limiting use. eHealth Exchange partnerships allow organizations caring for veterans to connect with the VA via networks that provide a common trust agreement and technical compliance testing. Unfortunately, in 2017, only 257 NHs in which veterans received care nationally were eHealth Exchange partners, which indicates that while NHs could partner in this information exchange, very few did.16
Finally, while the exchange is possible, it is not practical; most CNHs lack the staff that would be required to support data transfer on their technology platform into the appropriate translational gateways. Although remote access to EHRs in CNHs improved during the pandemic, the Veterans Health Information Exchange Program is not designed to facilitate interoperability of VA and CNH records and remains a significant barrier for this and many other VA community care programs.
The dispersal of veterans across CNHs that are > 50 miles from the nearest VAMC represents an additional area to improve program efficiency and meet growing demands for rural care. While recent field guidance to CNH oversight teams reduces the frequency of visits by VA CNH teams, the burden of driving to each facility is not likely to decrease as CNHs increasingly offer rehabilitation as a part of Veteran Care Agreements.17 Visits performed by telehealth or by trained local VA staff may represent alternatives.15
Finally, interview participants described the ideal range of the number of veterans in each CNH necessary to optimize efficiencies. Veterans who rely more heavily upon VA care tend to have more medical and mental health comorbidities than average Medicare beneficiaries.18,19 This was reflected in participants’ recommendation to have enough veterans to benefit from economies of scale but to also identify a limit when efficiencies are lost. Given that most CNHs cared for 8 to 15 veterans, facilities seem to have identified how best to match the resources available with veterans’ care needs. Based on these observations, new care networks that will be established because of the MISSION Act may benefit from establishing evidence-based policies that support flexibility in oversight frequency and either allow for remote oversight or consolidate the number of CNHs to improve efficiencies in care provision and oversight.20
Limitations
Limitations include the unique relationship between VA and CNH staff overseeing the quality of care provided to veterans in CNHs, which is not replicated in other models of care. Data collection was interrupted following the passage of the MISSION Act in 2018 until guidance on changes to practice resulting from the law were clarified in 2020. Interviews were also interrupted at the onset of the COVID-19 pandemic.
Conclusions
The current quality of the CNH care oversight structure will require adaptation as demand for CNH care increases. While the VA CNH program is one of the longest-standing programs collaborating with non-VA community care partners, it is now only one of many following the MISSION Act. The success of this and other programs will depend on matching available CNH resources to the complex medical and psychological needs of veterans. At a time when strategies to ease the burden on NHs and VA CNH coordinators are desperately needed, Veterans Health Information Exchange capabilities need to improve. Evidence is needed to guide the scaling of the program to meet the needs of the rapidly expanding veteran population who are eligible for CNH care.
Acknowledgments
The authors acknowledge Amy Mochel of the Providence Veterans Affairs Medical Center for project management support of this project.
1. Miller EA, Gadbois E, Gidmark S, Intrator O. Purchasing nursing home care within the Veterans Health Administration: lessons for nursing home recruitment, contracting, and oversight. J Health Admin Educ. 2015;32(2):165-197.
2. GAO. VA health care. Veterans’ use of long-term care is increasing, and VA faces challenges in meeting the demand. February 19, 2020. Accessed September 19, 2023. https://www.gao.gov/assets/gao-20-284.pdf
3. VHA Handbook 1143.2, VHA community nursing home oversight procedures. US Department of Veterans Affairs, Veterans Health Administration. June 2004. https://www.vendorportal.ecms.va.gov/FBODocumentServer/DocumentServer.aspx?DocumentId=3740930&FileName=VA259-17-Q-0501-007.pdf
4. Community care: veteran care agreements. US Department of Veterans Affairs. 2022. Updated August 8, 2023. Accessed September 7, 2023. https://www.va.gov/COMMUNITYCARE/providers/Veterans_Care_Agreements.asp
5. Massarweh NN, Itani KMF, Morris MS. The VA MISSION Act and the future of veterans’ access to quality health care. JAMA. 2020;324(4):343-344. doi:10.1001/jama.2020.4505
6. Colello KJ, Panangala SV; Congressional Research Service. Long-term care services for veterans. February 14, 2017. Accessed September 7, 2023. https://crsreports.congress.gov/product/pdf/R/R44697
7. Magid KH, Galenbeck E, Haverhals LM, et al. Purchasing high-quality community nursing home care: a will to work with VHA diminished by contracting burdens. J Am Med Dir Assoc. 2022;23(11):1757-1764. doi:10.1016/j.jamda.2022.03.007
8. Vaismoradi M, Turunen H, Bondas T. Content analysis and thematic analysis: Implications for conducting a qualitative descriptive study. Nurs Health Sci. 2013;15(3):398-405. doi:10.1111/nhs.12048
9. Brown University. LTC Focus. Accessed September 18, 2023. https://ltcfocus.org/about
10. Zhang W, Creswell J. The use of “mixing” procedure of mixed methods in health services research. Med Care. 2013;51(8):e51-e57. doi:10.1097/MLR.0b013e31824642fd
11. Haverhals LM, Magid KH, Blanchard KN, Levy CR. Veterans Health Administration staff perceptions of overseeing care in community nursing homes during COVID-19. Gerontol Geriatr Med. 2022;8:23337214221080307. Published 2022 Feb 15. doi:10.1177/23337214221080307
12. Garvin LA, Pugatch M, Gurewich D, Pendergast JN, Miller CJ. Interorganizational care coordination of rural veterans by Veterans Affairs and community care programs: a systematic review. Med Care. 2021;59(suppl 3):S259-S269. doi:10.1097/MLR.0000000000001542
13. Schlosser J, Kollisch D, Johnson D, Perkins T, Olson A. VA-community dual care: veteran and clinician perspectives. J Community Health. 2020;45(4):795-802. doi:10.1007/s10900-020-00795-y
14. Nevedal AL, Wong EP, Urech TH, Peppiatt JL, Sorie MR, Vashi AA. Veterans’ experiences with accessing community emergency care. Mil Med. 2023;188(1-2):e58-e64. doi:10.1093/milmed/usab196
15. Levenson SA. Smart case review: a model for successful remote medical direction and enhanced nursing home quality improvement. J Am Med Dir Assoc. 2021;22(10):2212-2215.e6. doi:10.1016/j.jamda.2021.05.043
16. Donahue M, Bouhaddou O, Hsing N, et al. Veterans Health Information Exchange: successes and challenges of nationwide interoperability. AMIA Annu Symp Proc. 2018;2018:385-394. Published 2018 Dec 5.
17. US Department of Veterans Affairs. VHA Notice 2023-07. Community Nursing Home Program. September 5, 2023:1-4.
18. Helmer DA, Dwibedi N, Rowneki M, et al. Mental health conditions and hospitalizations for ambulatory care sensitive conditions among veterans with diabetes. Am Health Drug Benefits. 2020;13(2):61-71.
19. Rosen AK, Wagner TH, Pettey WBP, et al. Differences in risk scores of veterans receiving community care purchased by the Veterans Health Administration. Health Serv Res. 2018;53(suppl 3):5438-5454. doi:10.1111/1475-6773.13051
20. Mattocks KM, Kroll-Desrosiers A, Kinney R, Elwy AR, Cunningham KJ, Mengeling MA. Understanding VA’s use of and relationships with community care providers under the MISSION Act. Med Care. 2021;59(suppl 3):S252-S258. doi:10.1097/MLR.0000000000001545
1. Miller EA, Gadbois E, Gidmark S, Intrator O. Purchasing nursing home care within the Veterans Health Administration: lessons for nursing home recruitment, contracting, and oversight. J Health Admin Educ. 2015;32(2):165-197.
2. GAO. VA health care. Veterans’ use of long-term care is increasing, and VA faces challenges in meeting the demand. February 19, 2020. Accessed September 19, 2023. https://www.gao.gov/assets/gao-20-284.pdf
3. VHA Handbook 1143.2, VHA community nursing home oversight procedures. US Department of Veterans Affairs, Veterans Health Administration. June 2004. https://www.vendorportal.ecms.va.gov/FBODocumentServer/DocumentServer.aspx?DocumentId=3740930&FileName=VA259-17-Q-0501-007.pdf
4. Community care: veteran care agreements. US Department of Veterans Affairs. 2022. Updated August 8, 2023. Accessed September 7, 2023. https://www.va.gov/COMMUNITYCARE/providers/Veterans_Care_Agreements.asp
5. Massarweh NN, Itani KMF, Morris MS. The VA MISSION Act and the future of veterans’ access to quality health care. JAMA. 2020;324(4):343-344. doi:10.1001/jama.2020.4505
6. Colello KJ, Panangala SV; Congressional Research Service. Long-term care services for veterans. February 14, 2017. Accessed September 7, 2023. https://crsreports.congress.gov/product/pdf/R/R44697
7. Magid KH, Galenbeck E, Haverhals LM, et al. Purchasing high-quality community nursing home care: a will to work with VHA diminished by contracting burdens. J Am Med Dir Assoc. 2022;23(11):1757-1764. doi:10.1016/j.jamda.2022.03.007
8. Vaismoradi M, Turunen H, Bondas T. Content analysis and thematic analysis: Implications for conducting a qualitative descriptive study. Nurs Health Sci. 2013;15(3):398-405. doi:10.1111/nhs.12048
9. Brown University. LTC Focus. Accessed September 18, 2023. https://ltcfocus.org/about
10. Zhang W, Creswell J. The use of “mixing” procedure of mixed methods in health services research. Med Care. 2013;51(8):e51-e57. doi:10.1097/MLR.0b013e31824642fd
11. Haverhals LM, Magid KH, Blanchard KN, Levy CR. Veterans Health Administration staff perceptions of overseeing care in community nursing homes during COVID-19. Gerontol Geriatr Med. 2022;8:23337214221080307. Published 2022 Feb 15. doi:10.1177/23337214221080307
12. Garvin LA, Pugatch M, Gurewich D, Pendergast JN, Miller CJ. Interorganizational care coordination of rural veterans by Veterans Affairs and community care programs: a systematic review. Med Care. 2021;59(suppl 3):S259-S269. doi:10.1097/MLR.0000000000001542
13. Schlosser J, Kollisch D, Johnson D, Perkins T, Olson A. VA-community dual care: veteran and clinician perspectives. J Community Health. 2020;45(4):795-802. doi:10.1007/s10900-020-00795-y
14. Nevedal AL, Wong EP, Urech TH, Peppiatt JL, Sorie MR, Vashi AA. Veterans’ experiences with accessing community emergency care. Mil Med. 2023;188(1-2):e58-e64. doi:10.1093/milmed/usab196
15. Levenson SA. Smart case review: a model for successful remote medical direction and enhanced nursing home quality improvement. J Am Med Dir Assoc. 2021;22(10):2212-2215.e6. doi:10.1016/j.jamda.2021.05.043
16. Donahue M, Bouhaddou O, Hsing N, et al. Veterans Health Information Exchange: successes and challenges of nationwide interoperability. AMIA Annu Symp Proc. 2018;2018:385-394. Published 2018 Dec 5.
17. US Department of Veterans Affairs. VHA Notice 2023-07. Community Nursing Home Program. September 5, 2023:1-4.
18. Helmer DA, Dwibedi N, Rowneki M, et al. Mental health conditions and hospitalizations for ambulatory care sensitive conditions among veterans with diabetes. Am Health Drug Benefits. 2020;13(2):61-71.
19. Rosen AK, Wagner TH, Pettey WBP, et al. Differences in risk scores of veterans receiving community care purchased by the Veterans Health Administration. Health Serv Res. 2018;53(suppl 3):5438-5454. doi:10.1111/1475-6773.13051
20. Mattocks KM, Kroll-Desrosiers A, Kinney R, Elwy AR, Cunningham KJ, Mengeling MA. Understanding VA’s use of and relationships with community care providers under the MISSION Act. Med Care. 2021;59(suppl 3):S252-S258. doi:10.1097/MLR.0000000000001545
Impact of Liraglutide to Semaglutide Conversion on Glycemic Control and Cost Savings at a Veterans Affairs Medical Center
Semaglutide and liraglutide are glucagon-like peptide 1 receptor agonists (GLP-1 RAs) that are approved by the US Food and Drug Administration as subcutaneous injections for patients with type 2 diabetes mellitus (T2DM). Both are recommended by the American Diabetes Association (ADA) as first-line options for patients with concomitant atherosclerotic cardiovascular (CV) disease and exert therapeutic effect via incretin-like mechanisms.1 These agents lower blood glucose levels by stimulating insulin release, increasing the body’s sensitivity to insulin, and inhibiting inappropriate glucagon secretion.2,3 They also slow gastric emptying, resulting in decreased appetite and potential weight loss.4
The SUSTAIN (1-7) trials concluded that semaglutide presented an equivalent safety profile and greater efficacy compared with other GLP-1 RAs, including exenatide and dulaglutide.2 The SUSTAIN-10 open-label, head-to-head trial evaluating 1 mg semaglutide once weekly vs 1.2 mg liraglutide daily concluded that semaglutide was superior in hemoglobin A1c (HbA1c) and body weight reduction compared with liraglutide, with slightly increased gastrointestinal (GI) adverse effects (AEs).5 Similar to the LEADER trial assessing liraglutide, SUSTAIN-6 evaluated semaglutide in patients at increased CV risk and found that compared with placebo, semaglutide decreased rates of serious CV events, such as CV death, myocardial infarction, and stroke and were similar to the CV outcomes in the LEADER trial.2,6 Although initial results of the SUSTAIN-6 trial were thought to be nearly equivalent to the LEADER trial, analyses later published comparing both trials noted that semaglutide had more potent HbA1c lowering and weight loss benefit when compared with liraglutide.2,6 The cardioprotective outcomes of SUSTAIN-6 qualified semaglutide for inclusion in the current ADA Standards of Medical Care recommendations for CV risk reduction.6,7 However, despite the CV safety profile and efficacy associated with semaglutide, the SUSTAIN-6 trial noted an increased risk of diabetic retinopathy (DR) complications in 50 of 1648 patients (3%) treated with semaglutide compared with 29 of 1649 (1.8%) who received placebo (P = .02; hazard ratio, 1.76; 95% CI, 1.11-2.78).6 Of the 79 total patients who experienced retinopathy complications, 66 had retinopathy at baseline (42 of 50 [84%]) in the semaglutide group; 24 of 29 [83%] in the placebo group).6 Worsening of DR became one of the most notable AEs of semaglutide evaluated in clinical trials. This further deemed the effect as a warning in the semaglutide package insert to assist clinicians with treatment decisions.
As part of a US Department of Veterans Affairs (VA) National Lost Opportunity Cost Savings Initiative, which encompasses administrative efforts to promote more cost-effective yet safe and efficacious therapy options for veterans, the Michael E. DeBakey VA Medical Center (MEDVAMC) in Houston, Texas, converted a portion of patients with T2DM established on liraglutide to semaglutide. The 30-day supply cost of the 2-pack liraglutide 6 mg/mL (3 mL) injection pens for the MEDVAMC was $197.64. The 30-day supply cost for the singular multidose semaglutide 0.5 mg/0.375 mL (1.5 mL) injection pen was $115.15. Cost savings for the MEDVAMC facility were initially estimated to reach $642,522.
The subset of patients converted had to have undergone teleretinal imaging and not have a diagnosis of nonproliferative DR (NPDR), proliferative DR (PDR), or PDR with or without
In the fall of 2021, there was also a standing list of patients on liraglutide who were not converted due to a lack of teleretinal imaging. As a result, there was potential for a quality improvement (QI) intervention to target this patient population, which could result in further cost savings for MEDVAMC and improved glycemic control because of increased conversion from liraglutide to semaglutide. The purpose of this project was to perform a QI assessment on this subset of patients both initially converted from liraglutide to semaglutide, and those who were yet to be converted due to a lack of teleretinal imaging to determine the impact on glycemic control and cost savings.
Methods
This QI project was a single-center, prospective cohort study with a retrospective chart review of veterans with T2DM converted from liraglutide to semaglutide at the MEDVAMC. Patient data were collected from the Computerized Patient Record System (CPRS) between March 1, 2021, and November 30, 2021. An initial subset of patients was converted to semaglutide in March and April 2021. Patients initially excluded underwent a second chart review to determine whether they truly met exclusion criteria. Patients who did not have a definitive diagnosis of NPDR or PDR, those due for updated teleretinal imaging, as well as those with updated teleretinal imaging that excluded NPDR or PDR were targeted for clinician education interventions.
Following this intervention, a subset of patients with negative DR findings were converted from liraglutide to semaglutide. Primary care and endocrinology clinicians were notified that patients who met the criteria should be referred for teleretinal imaging if no updated results were present or that patients were eligible for semaglutide conversion based on negative findings. Both patients who were initially converted as well as those converted following education were included for data collection/analysis of glycemic control via HbA1c and blood glucose levels.
Cost savings were evaluated using outpatient pharmacy procurement pricing data. This project was approved by the MEDVAMC Quality Assurance and Regulatory Affairs Office.
Participants
Patients included in the study were adults aged ≥ 18 years with T2DM, converted from liraglutide 0.6 and 1.2 mg daily to semaglutide 0.25 mg weekly (titrated to 0.5 mg weekly after 4 weeks), and had an active prescription for semaglutide, with or without insulin or other oral antihyperglycemics. Patients with NPDR or PDR, type 1 DM, no HbA1c data, no filled semaglutide prescriptions, insulin pumps, and those without teleretinal imaging within the postintervention period or who died during the study period were excluded.
Patient baseline characteristics collected included demographic data, CV comorbidities, antihyperglycemic medications, and changes in insulin doses. Parameters analyzed at baseline and 3 to 12 months postconversion included body weight, HbA1c, and blood glucose levels.
Outcomes
The primary objectives of this QI project were to assess glycemic control (via changes in HbA1c levels) and cost savings following patient conversion from liraglutide to semaglutide. A second objective was to educate clinicians for referral of T2DM patients without teleretinal imaging in the past 2 years.
The purpose of the latter objective was to encourage conversion from liraglutide to semaglutide in the absence of DR. We predicted that 50% of patients with clinician education would be converted. Secondary objectives included assessing body weight differences, evaluating modifications in diabetes regimen, and documenting AEs. We predicted that glycemic control would either remain stable or improve with conversion to semaglutide.
Statistical Analysis
Patient demographic data were analyzed using descriptive statistics. Quantitative data (HbA1c, blood glucose, and body weight differences as continuous variables) were analyzed using a paired Student t test, and categorical variables were analyzed using the χ2 test.
Results
During the study period, 692 patients were identified with active liraglutide prescriptions (Figure). Of these, 49 patients who were initially excluded due to outdated teleretinal imaging or negative findings met the criteria for clinician education, and 14 of those 49 patients (28.6%) were converted from liraglutide to semaglutide. Thirty-three patients (67.3%) did not schedule teleretinal imaging or did not convert to semaglutide following negative teleretinal findings. Two patients (4.1%) either scheduled or proceeded with teleretinal imaging, without any further action from the clinician.
Including the 14 patients converted posteducational intervention, 425 patients were converted to semaglutide. Excluded from analysis were 121 patients: 57 for incomplete HbA1c data or no filled semaglutide prescription; 30 for HbA1c and weight data outside of the study timeframe; 25 died of causes unrelated to the project; 8 had insulin pumps; and 1 was diagnosed with late-onset type 1 DM. The final sample was 304 patients who underwent analysis.
Two hundred seventy-three patients (89.8%) were male, and 180 (59.2%) were White (Table 1). The mean (SD) age of patients was 65.9 (9.6) years, and 236 (77.6%) were established on insulin therapy (either basal, bolus, or a combination). The 3 most common antihyperglycemic agents (other than insulin) that patients used included 185 metformin (60.9%), 104 empagliflozin (34.2%), and 50 glipizide (16.4%) prescriptions.
Most patients had CV disease. Three hundred patients (98.7%) had comorbid hypertension, 298 (98.0%) had hyperlipidemia, and 114 (37.5%) had coronary artery disease (Table 2). Other diseases that patients were concomitantly diagnosed with included peripheral vascular disease, heart failure, history of stroke or transient ischemic attack, and history of myocardial infarction.
Documented AEs included 83 patients (27.3%) with hypoglycemia at any point within 3 to 12 months of conversion and 25 patients (8.2%) with mainly GI-related events, including nausea, vomiting, diarrhea, decreased appetite, and abdominal pain. Six patients (2.0%) had a new diagnosis of DR 3 to 12 months postconversion.
Glycemic Control and Weight Changes
At baseline, mean (SD) HbA1c was 8.1% (1.5), blood glucose was 187.4 (44.2) mg/dL, and body weight was 112.9 (23.0) kg (Table 3). In the timeframe evaluated (3 to 12 months postconversion), patients’ mean (SD) HbA1c was found to have significantly decreased to 7.6% (1.4) (P < .001; 95% CI, -0.7 to -0.3), blood glucose decreased to 172.6 (39.0) mg/dL (P < .001; 95% CI, -19.3 to -10.2), and body weight decreased to 105.2 (32.3) kg (P < .001; 95% CI, -10.6 to -4.8). All parameters evaluated were deemed statistically significant.
Further analyses evaluating specific changes in HbA1c observed postconversion are as follows: 199 patients (65.5%) experienced a decrease, 92 (30.3%) experienced an increase, and 13 (4.3%) experienced no change in their HbA1c.
As the timeframe was fairly broad to assess HbA1c changes, a prespecified subgroup analysis was conducted to determine specific changes in HbA1c within 3 to 6, 6 to 9, and 9 to 12 months postconversion (Table 4). At 3 to 6 months postconversion, patient mean (SD) HbA1c levels significantly decreased from 8.2% (1.5) at baseline to 7.6% (1.3) postconversion (P = .002; 95% CI, -1.0 to -0.2). At 6 to 9 months postconversion, the mean (SD) HbA1c significantly decreased from 8.1% (1.5) at baseline to 7.6% (1.4) postconversion (P = .002; 95% CI, -0.8 to -0.2).
Glucose-Lowering Agent Adjustments
One hundred thirteen patients (37.2%) required no changes to their antihyperglycemic regimen with the conversion, 85 (28.0%) required increased insulin doses, and 77 (25.3%) required decreased insulin doses (Table 5). Forty-five (14.8%) patients underwent discontinuation of either insulin or other antihyperglycemic agents; 44 (14.5%) had other antihyperglycemic agents dose increased, 39 (12.8%) required adding other glucose-lowering agents, 28 (9.2%) discontinued semaglutide, and 10 (3.3%) had other glucose-lowering medication doses decreased.
Cost Savings
Cost savings were evaluated using the MEDVAMC outpatient pharmacy procurement service. The total cost savings per patient per month was $82.49. For the 411 preclinician education patients converted to semaglutide, this resulted in a prospective annual cost savings of $406,840.68. An additional $13,858.32 was saved due to the intervention/clinician education for 14 patients converted to semaglutide. The total annual cost savings was $420,699.00.
Discussion
Overall, glycemic control significantly improved with veterans’ conversion from liraglutide to semaglutide. Not only were significant changes noted with HbA1c levels and weight, but consistencies were noted with mean HbA1c decrease and weight loss expected of GLP-1 RAs noted in clinical trials. The typical range for HbA1c changes expected is -1% to -2% and weight loss of 1 to 6 kg.4,7 Data from the LEAD-5 and SUSTAIN-4 trials, evaluating glycemic control in liraglutide and semaglutide, respectively, have noted comparable yet slightly more potent HbA1c decreases (-1.33% for liraglutide 1.8 mg daily vs -1.2% and -1.6% for semaglutide 0.5 mg and 1 mg weekly, respectively).8,9 However, more robust weight loss has been noted with semaglutide vs liraglutide (-4.62 kg for semaglutide 0.5 mg weekly and -6.33 kg for semaglutide 1 mg weekly vs -3.43 kg for liraglutide 1.8 mg daily).8,9 Results from the SUSTAIN-10 trial also noted mean changes in HbA1c of -1.7% for semaglutide 1 mg weekly vs -1.0% for liraglutide 1.2 mg daily; mean body weight differences were -5.8 kg for semaglutide and -1.9 kg for liraglutide at their respective doses.5 The mean weight loss noted with this QI project is consistent with prior trials of semaglutide.
Of note, 44 patients (14.5%) required the dosage increase of either one or multiple additional glucose-lowering agents at any time point within the 3- to 12-month period. Of those patients, 38 (86.4%) underwent further semaglutide dose titration to 1 mg weekly. Common reasons for a further dose increase to 1 mg weekly were an indication for more robust HbA1c lowering, a desire to decrease patients’ either basal or bolus insulin requirements, or a treatment goal of completely titrating patients off insulin.
It is uncertain why 30.3% of patients experienced an increase in HbA1c and 4.3% experienced no change. However, possibilities for the divergence in HbA1c outcomes in these subsets of patients may include suboptimal adherence to semaglutide or other antihyperglycemic agents as indicated by clinicians or nonadherence to dietary and lifestyle modifications.
Most patients (65.5%) experienced a decrease in HbA1c because of conversion to semaglutide, and
At the MEDVAMC, liraglutide is a nonformulary agent and semaglutide is now the formulary-preferred option. For patients with uncontrolled T2DM, if a GLP-1 RA is desired for therapy, clinicians are to place a prior authorization drug request (PADR) consultation for semaglutide for further evaluation and review of VA Criteria for Use (CFU) by clinical pharmacist practitioners. Liraglutide is the alternative option if patients do not meet the CFU for semaglutide (ie, have a diagnosis of DR among other exclusions). However, the semaglutide CFU was updated in April 2022 to exclude those specifically diagnosed with PDR, severe NPDR, and macular edema unless an ophthalmologist deems semaglutide acceptable. This indicates that patients with mild-to-moderate NPDR (who were originally excluded from this QI project) are now eligible to receive semaglutide. The incidence of new DR diagnoses (2%) observed in this study could indicate an unclear relationship between semaglutide and increased rates of DR; however, no definitive correlation can be established due to the retrospective nature of this project. The implications of the results of this QI project in relation to the updated CFU remain undetermined.
Due to the comparable improvements in HbA1c and more robust weight loss noted with semaglutide vs liraglutide, we deem it appropriate to select semaglutide as the more cost-efficient GLP-1 RA and formulary preferred option. The data of this QI project supports the overall safety and treatment utility of this option. Although significant cost savings were achieved (> $400,000), the long-term benefit of the liraglutide to semaglutide conversion remains unknown.
Strengths and Limitations
Strengths of this project include the large sample size, its setting in a large VA medical center, and the evaluation of multiple outcomes beyond HbA1c for assessment of glycemic control (ie, mean blood glucose, insulin titration, and dose adjustment of other glucose-lowering agents).
Limitations of this study include the retrospective chart review used for data collection, limited accuracy of objective data due to the COVID-19 pandemic, and inconsistencies with documentation in patients’ electronic health records. As a protective measure in the height of the pandemic between March 2021 and November 2021, the VA promoted using telephone and virtual-visit clinics to minimize exposure for patients with nonurgent follow-up needs. Patient hesitance to present to the clinic in person due to COVID-19 was also a significant factor in obtaining objective follow-up data. As a result, less accurate and timely baseline and postconversion weight and HbA1c data resulted, leading to our decision to extend the timeframe evaluated postconversion to 3 to 12 months. We also noted inconsistencies with documentation in CPRS. Unless veterans were closely followed by clinical pharmacist practitioners or endocrine consultation service clinicians, it was more difficult to follow and document trends of insulin titration to assess the impact of semaglutide conversion. The number of AEs, including hypoglycemia and GI intolerance, were also not consistently documented within the CPRS, and the frequency of AEs may be underestimated.
Another possible limitation regarding the interpretation of the results includes the portion of patients titrated up to semaglutide 1 mg weekly. As the focal point of this project was to review changes in glycemic control in the conversion to semaglutide 0.5 mg, this population of patients converted to 1 mg could potentially overestimate the HbA1c and weight changes described, as it is consistent with the SUSTAIN trials that show more robust decreases in those parameters described earlier.
Conclusions
A subset of patients with T2DM converted from liraglutide to semaglutide experienced significant changes in glycemic control and body weight. Significant differences were noted for a decreased HbA1c, decreased mean blood glucose, and weight loss. A fair portion of patients’ antihyperglycemic regimens required no changes on conversion to semaglutide. Although the semaglutide discontinuation rate neared 10%, AEs that may have contributed to this discontinuation rate included hypoglycemia and GI intolerance. Clinician education resulted in a substantial number of patients undergoing teleretinal imaging and further conversion to semaglutide; however, due to the low conversion response rate, a more effective method of educating clinicians is warranted. Although the semaglutide cost savings initiative at MEDVAMC resulted in significant savings, a full cost-effective analysis is needed to assess more comprehensive institution savings.
1. ElSayed NA, Aleppo G, Aroda VR, et al. 9. Pharmacologic Approaches to Glycemic Treatment: Standards of Care in Diabetes-2023. Diabetes Care. 2023;46(suppl 1):S140-S157. doi:10.2337/dc23-S009
2. Aroda VR, Ahmann A, Cariou B, et al. Comparative efficacy, safety, and cardiovascular outcome with once-weekly subcutaneous semaglutide in the treatment of type 2 diabetes: insights from the SUSTAIN 1-7 trials. Diabetes Metab. 2019;45(5):409-418. doi:10.1016/j.diabet.2018.12.001
3. Trujillo JM, Nuffer W, Smith BA. GLP-1 receptor agonists: an updated review of head-to-head clinical studies. Ther Adv Endocrinol Metab. 2021;12:2042018821997320. Published 2021 Mar 9. doi:10.1177/2042018821997320
4. Drucker DJ. Mechanisms of action and therapeutic application of glucagon-like peptide-1. Cell Metab. 2018;27(4):740-756. doi:10.1016/j.cmet.2018.03.001
5. Capehorn MS, Catarig AM, Furberg JK, et al. Efficacy and safety of once-weekly semaglutide 1.0mg vs once-daily liraglutide 1.2mg as add-on to 1-3 oral antidiabetic drugs in subjects with type 2 diabetes (SUSTAIN 10). Diabetes Metab. 2020;46(2):100-109. doi:10.1016/j.diabet.2019.101117
6. Marso SP, Bain SC, Consoli A, et al; SUSTAIN-6 Investigators. Semaglutide and cardiovascular outcomes in patients with type 2 diabetes. N Engl J Med. 2016;375(19):1834-1844. doi:10.1056/NEJMoa1607141
7. ElSayed NA, Aleppo G, Aroda VR, et al. 10. Cardiovascular Disease and Risk Management: Standards of Care in Diabetes-2023. Diabetes Care. 2023;46(suppl 1):S158-S190. doi:10.2337/dc23-S010
8. Russell-Jones D, Vaag A, Schmitz O, et al. Liraglutide vs insulin glargine and placebo in combination with metformin and sulfonylurea therapy in type 2 diabetes mellitus (LEAD-5 met+SU): a randomised controlled trial. Diabetologia. 2009;52(10):2046-2055. doi:10.1007/s00125-009-1472-y
9. Aroda VR, Bain SC, Cariou B, et al. Efficacy and safety of once-weekly semaglutide versus once-daily insulin glargine as add-on to metformin (with or without sulfonylureas) in insulin-naive patients with type 2 diabetes (SUSTAIN 4): a randomised, open-label, parallel-group, multicentre, multinational, phase 3a trial. Lancet Diabetes Endocrinol. 2017;5(5):355-366. doi:10.1016/S2213-8587(17)30085-2
Semaglutide and liraglutide are glucagon-like peptide 1 receptor agonists (GLP-1 RAs) that are approved by the US Food and Drug Administration as subcutaneous injections for patients with type 2 diabetes mellitus (T2DM). Both are recommended by the American Diabetes Association (ADA) as first-line options for patients with concomitant atherosclerotic cardiovascular (CV) disease and exert therapeutic effect via incretin-like mechanisms.1 These agents lower blood glucose levels by stimulating insulin release, increasing the body’s sensitivity to insulin, and inhibiting inappropriate glucagon secretion.2,3 They also slow gastric emptying, resulting in decreased appetite and potential weight loss.4
The SUSTAIN (1-7) trials concluded that semaglutide presented an equivalent safety profile and greater efficacy compared with other GLP-1 RAs, including exenatide and dulaglutide.2 The SUSTAIN-10 open-label, head-to-head trial evaluating 1 mg semaglutide once weekly vs 1.2 mg liraglutide daily concluded that semaglutide was superior in hemoglobin A1c (HbA1c) and body weight reduction compared with liraglutide, with slightly increased gastrointestinal (GI) adverse effects (AEs).5 Similar to the LEADER trial assessing liraglutide, SUSTAIN-6 evaluated semaglutide in patients at increased CV risk and found that compared with placebo, semaglutide decreased rates of serious CV events, such as CV death, myocardial infarction, and stroke and were similar to the CV outcomes in the LEADER trial.2,6 Although initial results of the SUSTAIN-6 trial were thought to be nearly equivalent to the LEADER trial, analyses later published comparing both trials noted that semaglutide had more potent HbA1c lowering and weight loss benefit when compared with liraglutide.2,6 The cardioprotective outcomes of SUSTAIN-6 qualified semaglutide for inclusion in the current ADA Standards of Medical Care recommendations for CV risk reduction.6,7 However, despite the CV safety profile and efficacy associated with semaglutide, the SUSTAIN-6 trial noted an increased risk of diabetic retinopathy (DR) complications in 50 of 1648 patients (3%) treated with semaglutide compared with 29 of 1649 (1.8%) who received placebo (P = .02; hazard ratio, 1.76; 95% CI, 1.11-2.78).6 Of the 79 total patients who experienced retinopathy complications, 66 had retinopathy at baseline (42 of 50 [84%]) in the semaglutide group; 24 of 29 [83%] in the placebo group).6 Worsening of DR became one of the most notable AEs of semaglutide evaluated in clinical trials. This further deemed the effect as a warning in the semaglutide package insert to assist clinicians with treatment decisions.
As part of a US Department of Veterans Affairs (VA) National Lost Opportunity Cost Savings Initiative, which encompasses administrative efforts to promote more cost-effective yet safe and efficacious therapy options for veterans, the Michael E. DeBakey VA Medical Center (MEDVAMC) in Houston, Texas, converted a portion of patients with T2DM established on liraglutide to semaglutide. The 30-day supply cost of the 2-pack liraglutide 6 mg/mL (3 mL) injection pens for the MEDVAMC was $197.64. The 30-day supply cost for the singular multidose semaglutide 0.5 mg/0.375 mL (1.5 mL) injection pen was $115.15. Cost savings for the MEDVAMC facility were initially estimated to reach $642,522.
The subset of patients converted had to have undergone teleretinal imaging and not have a diagnosis of nonproliferative DR (NPDR), proliferative DR (PDR), or PDR with or without
In the fall of 2021, there was also a standing list of patients on liraglutide who were not converted due to a lack of teleretinal imaging. As a result, there was potential for a quality improvement (QI) intervention to target this patient population, which could result in further cost savings for MEDVAMC and improved glycemic control because of increased conversion from liraglutide to semaglutide. The purpose of this project was to perform a QI assessment on this subset of patients both initially converted from liraglutide to semaglutide, and those who were yet to be converted due to a lack of teleretinal imaging to determine the impact on glycemic control and cost savings.
Methods
This QI project was a single-center, prospective cohort study with a retrospective chart review of veterans with T2DM converted from liraglutide to semaglutide at the MEDVAMC. Patient data were collected from the Computerized Patient Record System (CPRS) between March 1, 2021, and November 30, 2021. An initial subset of patients was converted to semaglutide in March and April 2021. Patients initially excluded underwent a second chart review to determine whether they truly met exclusion criteria. Patients who did not have a definitive diagnosis of NPDR or PDR, those due for updated teleretinal imaging, as well as those with updated teleretinal imaging that excluded NPDR or PDR were targeted for clinician education interventions.
Following this intervention, a subset of patients with negative DR findings were converted from liraglutide to semaglutide. Primary care and endocrinology clinicians were notified that patients who met the criteria should be referred for teleretinal imaging if no updated results were present or that patients were eligible for semaglutide conversion based on negative findings. Both patients who were initially converted as well as those converted following education were included for data collection/analysis of glycemic control via HbA1c and blood glucose levels.
Cost savings were evaluated using outpatient pharmacy procurement pricing data. This project was approved by the MEDVAMC Quality Assurance and Regulatory Affairs Office.
Participants
Patients included in the study were adults aged ≥ 18 years with T2DM, converted from liraglutide 0.6 and 1.2 mg daily to semaglutide 0.25 mg weekly (titrated to 0.5 mg weekly after 4 weeks), and had an active prescription for semaglutide, with or without insulin or other oral antihyperglycemics. Patients with NPDR or PDR, type 1 DM, no HbA1c data, no filled semaglutide prescriptions, insulin pumps, and those without teleretinal imaging within the postintervention period or who died during the study period were excluded.
Patient baseline characteristics collected included demographic data, CV comorbidities, antihyperglycemic medications, and changes in insulin doses. Parameters analyzed at baseline and 3 to 12 months postconversion included body weight, HbA1c, and blood glucose levels.
Outcomes
The primary objectives of this QI project were to assess glycemic control (via changes in HbA1c levels) and cost savings following patient conversion from liraglutide to semaglutide. A second objective was to educate clinicians for referral of T2DM patients without teleretinal imaging in the past 2 years.
The purpose of the latter objective was to encourage conversion from liraglutide to semaglutide in the absence of DR. We predicted that 50% of patients with clinician education would be converted. Secondary objectives included assessing body weight differences, evaluating modifications in diabetes regimen, and documenting AEs. We predicted that glycemic control would either remain stable or improve with conversion to semaglutide.
Statistical Analysis
Patient demographic data were analyzed using descriptive statistics. Quantitative data (HbA1c, blood glucose, and body weight differences as continuous variables) were analyzed using a paired Student t test, and categorical variables were analyzed using the χ2 test.
Results
During the study period, 692 patients were identified with active liraglutide prescriptions (Figure). Of these, 49 patients who were initially excluded due to outdated teleretinal imaging or negative findings met the criteria for clinician education, and 14 of those 49 patients (28.6%) were converted from liraglutide to semaglutide. Thirty-three patients (67.3%) did not schedule teleretinal imaging or did not convert to semaglutide following negative teleretinal findings. Two patients (4.1%) either scheduled or proceeded with teleretinal imaging, without any further action from the clinician.
Including the 14 patients converted posteducational intervention, 425 patients were converted to semaglutide. Excluded from analysis were 121 patients: 57 for incomplete HbA1c data or no filled semaglutide prescription; 30 for HbA1c and weight data outside of the study timeframe; 25 died of causes unrelated to the project; 8 had insulin pumps; and 1 was diagnosed with late-onset type 1 DM. The final sample was 304 patients who underwent analysis.
Two hundred seventy-three patients (89.8%) were male, and 180 (59.2%) were White (Table 1). The mean (SD) age of patients was 65.9 (9.6) years, and 236 (77.6%) were established on insulin therapy (either basal, bolus, or a combination). The 3 most common antihyperglycemic agents (other than insulin) that patients used included 185 metformin (60.9%), 104 empagliflozin (34.2%), and 50 glipizide (16.4%) prescriptions.
Most patients had CV disease. Three hundred patients (98.7%) had comorbid hypertension, 298 (98.0%) had hyperlipidemia, and 114 (37.5%) had coronary artery disease (Table 2). Other diseases that patients were concomitantly diagnosed with included peripheral vascular disease, heart failure, history of stroke or transient ischemic attack, and history of myocardial infarction.
Documented AEs included 83 patients (27.3%) with hypoglycemia at any point within 3 to 12 months of conversion and 25 patients (8.2%) with mainly GI-related events, including nausea, vomiting, diarrhea, decreased appetite, and abdominal pain. Six patients (2.0%) had a new diagnosis of DR 3 to 12 months postconversion.
Glycemic Control and Weight Changes
At baseline, mean (SD) HbA1c was 8.1% (1.5), blood glucose was 187.4 (44.2) mg/dL, and body weight was 112.9 (23.0) kg (Table 3). In the timeframe evaluated (3 to 12 months postconversion), patients’ mean (SD) HbA1c was found to have significantly decreased to 7.6% (1.4) (P < .001; 95% CI, -0.7 to -0.3), blood glucose decreased to 172.6 (39.0) mg/dL (P < .001; 95% CI, -19.3 to -10.2), and body weight decreased to 105.2 (32.3) kg (P < .001; 95% CI, -10.6 to -4.8). All parameters evaluated were deemed statistically significant.
Further analyses evaluating specific changes in HbA1c observed postconversion are as follows: 199 patients (65.5%) experienced a decrease, 92 (30.3%) experienced an increase, and 13 (4.3%) experienced no change in their HbA1c.
As the timeframe was fairly broad to assess HbA1c changes, a prespecified subgroup analysis was conducted to determine specific changes in HbA1c within 3 to 6, 6 to 9, and 9 to 12 months postconversion (Table 4). At 3 to 6 months postconversion, patient mean (SD) HbA1c levels significantly decreased from 8.2% (1.5) at baseline to 7.6% (1.3) postconversion (P = .002; 95% CI, -1.0 to -0.2). At 6 to 9 months postconversion, the mean (SD) HbA1c significantly decreased from 8.1% (1.5) at baseline to 7.6% (1.4) postconversion (P = .002; 95% CI, -0.8 to -0.2).
Glucose-Lowering Agent Adjustments
One hundred thirteen patients (37.2%) required no changes to their antihyperglycemic regimen with the conversion, 85 (28.0%) required increased insulin doses, and 77 (25.3%) required decreased insulin doses (Table 5). Forty-five (14.8%) patients underwent discontinuation of either insulin or other antihyperglycemic agents; 44 (14.5%) had other antihyperglycemic agents dose increased, 39 (12.8%) required adding other glucose-lowering agents, 28 (9.2%) discontinued semaglutide, and 10 (3.3%) had other glucose-lowering medication doses decreased.
Cost Savings
Cost savings were evaluated using the MEDVAMC outpatient pharmacy procurement service. The total cost savings per patient per month was $82.49. For the 411 preclinician education patients converted to semaglutide, this resulted in a prospective annual cost savings of $406,840.68. An additional $13,858.32 was saved due to the intervention/clinician education for 14 patients converted to semaglutide. The total annual cost savings was $420,699.00.
Discussion
Overall, glycemic control significantly improved with veterans’ conversion from liraglutide to semaglutide. Not only were significant changes noted with HbA1c levels and weight, but consistencies were noted with mean HbA1c decrease and weight loss expected of GLP-1 RAs noted in clinical trials. The typical range for HbA1c changes expected is -1% to -2% and weight loss of 1 to 6 kg.4,7 Data from the LEAD-5 and SUSTAIN-4 trials, evaluating glycemic control in liraglutide and semaglutide, respectively, have noted comparable yet slightly more potent HbA1c decreases (-1.33% for liraglutide 1.8 mg daily vs -1.2% and -1.6% for semaglutide 0.5 mg and 1 mg weekly, respectively).8,9 However, more robust weight loss has been noted with semaglutide vs liraglutide (-4.62 kg for semaglutide 0.5 mg weekly and -6.33 kg for semaglutide 1 mg weekly vs -3.43 kg for liraglutide 1.8 mg daily).8,9 Results from the SUSTAIN-10 trial also noted mean changes in HbA1c of -1.7% for semaglutide 1 mg weekly vs -1.0% for liraglutide 1.2 mg daily; mean body weight differences were -5.8 kg for semaglutide and -1.9 kg for liraglutide at their respective doses.5 The mean weight loss noted with this QI project is consistent with prior trials of semaglutide.
Of note, 44 patients (14.5%) required the dosage increase of either one or multiple additional glucose-lowering agents at any time point within the 3- to 12-month period. Of those patients, 38 (86.4%) underwent further semaglutide dose titration to 1 mg weekly. Common reasons for a further dose increase to 1 mg weekly were an indication for more robust HbA1c lowering, a desire to decrease patients’ either basal or bolus insulin requirements, or a treatment goal of completely titrating patients off insulin.
It is uncertain why 30.3% of patients experienced an increase in HbA1c and 4.3% experienced no change. However, possibilities for the divergence in HbA1c outcomes in these subsets of patients may include suboptimal adherence to semaglutide or other antihyperglycemic agents as indicated by clinicians or nonadherence to dietary and lifestyle modifications.
Most patients (65.5%) experienced a decrease in HbA1c because of conversion to semaglutide, and
At the MEDVAMC, liraglutide is a nonformulary agent and semaglutide is now the formulary-preferred option. For patients with uncontrolled T2DM, if a GLP-1 RA is desired for therapy, clinicians are to place a prior authorization drug request (PADR) consultation for semaglutide for further evaluation and review of VA Criteria for Use (CFU) by clinical pharmacist practitioners. Liraglutide is the alternative option if patients do not meet the CFU for semaglutide (ie, have a diagnosis of DR among other exclusions). However, the semaglutide CFU was updated in April 2022 to exclude those specifically diagnosed with PDR, severe NPDR, and macular edema unless an ophthalmologist deems semaglutide acceptable. This indicates that patients with mild-to-moderate NPDR (who were originally excluded from this QI project) are now eligible to receive semaglutide. The incidence of new DR diagnoses (2%) observed in this study could indicate an unclear relationship between semaglutide and increased rates of DR; however, no definitive correlation can be established due to the retrospective nature of this project. The implications of the results of this QI project in relation to the updated CFU remain undetermined.
Due to the comparable improvements in HbA1c and more robust weight loss noted with semaglutide vs liraglutide, we deem it appropriate to select semaglutide as the more cost-efficient GLP-1 RA and formulary preferred option. The data of this QI project supports the overall safety and treatment utility of this option. Although significant cost savings were achieved (> $400,000), the long-term benefit of the liraglutide to semaglutide conversion remains unknown.
Strengths and Limitations
Strengths of this project include the large sample size, its setting in a large VA medical center, and the evaluation of multiple outcomes beyond HbA1c for assessment of glycemic control (ie, mean blood glucose, insulin titration, and dose adjustment of other glucose-lowering agents).
Limitations of this study include the retrospective chart review used for data collection, limited accuracy of objective data due to the COVID-19 pandemic, and inconsistencies with documentation in patients’ electronic health records. As a protective measure in the height of the pandemic between March 2021 and November 2021, the VA promoted using telephone and virtual-visit clinics to minimize exposure for patients with nonurgent follow-up needs. Patient hesitance to present to the clinic in person due to COVID-19 was also a significant factor in obtaining objective follow-up data. As a result, less accurate and timely baseline and postconversion weight and HbA1c data resulted, leading to our decision to extend the timeframe evaluated postconversion to 3 to 12 months. We also noted inconsistencies with documentation in CPRS. Unless veterans were closely followed by clinical pharmacist practitioners or endocrine consultation service clinicians, it was more difficult to follow and document trends of insulin titration to assess the impact of semaglutide conversion. The number of AEs, including hypoglycemia and GI intolerance, were also not consistently documented within the CPRS, and the frequency of AEs may be underestimated.
Another possible limitation regarding the interpretation of the results includes the portion of patients titrated up to semaglutide 1 mg weekly. As the focal point of this project was to review changes in glycemic control in the conversion to semaglutide 0.5 mg, this population of patients converted to 1 mg could potentially overestimate the HbA1c and weight changes described, as it is consistent with the SUSTAIN trials that show more robust decreases in those parameters described earlier.
Conclusions
A subset of patients with T2DM converted from liraglutide to semaglutide experienced significant changes in glycemic control and body weight. Significant differences were noted for a decreased HbA1c, decreased mean blood glucose, and weight loss. A fair portion of patients’ antihyperglycemic regimens required no changes on conversion to semaglutide. Although the semaglutide discontinuation rate neared 10%, AEs that may have contributed to this discontinuation rate included hypoglycemia and GI intolerance. Clinician education resulted in a substantial number of patients undergoing teleretinal imaging and further conversion to semaglutide; however, due to the low conversion response rate, a more effective method of educating clinicians is warranted. Although the semaglutide cost savings initiative at MEDVAMC resulted in significant savings, a full cost-effective analysis is needed to assess more comprehensive institution savings.
Semaglutide and liraglutide are glucagon-like peptide 1 receptor agonists (GLP-1 RAs) that are approved by the US Food and Drug Administration as subcutaneous injections for patients with type 2 diabetes mellitus (T2DM). Both are recommended by the American Diabetes Association (ADA) as first-line options for patients with concomitant atherosclerotic cardiovascular (CV) disease and exert therapeutic effect via incretin-like mechanisms.1 These agents lower blood glucose levels by stimulating insulin release, increasing the body’s sensitivity to insulin, and inhibiting inappropriate glucagon secretion.2,3 They also slow gastric emptying, resulting in decreased appetite and potential weight loss.4
The SUSTAIN (1-7) trials concluded that semaglutide presented an equivalent safety profile and greater efficacy compared with other GLP-1 RAs, including exenatide and dulaglutide.2 The SUSTAIN-10 open-label, head-to-head trial evaluating 1 mg semaglutide once weekly vs 1.2 mg liraglutide daily concluded that semaglutide was superior in hemoglobin A1c (HbA1c) and body weight reduction compared with liraglutide, with slightly increased gastrointestinal (GI) adverse effects (AEs).5 Similar to the LEADER trial assessing liraglutide, SUSTAIN-6 evaluated semaglutide in patients at increased CV risk and found that compared with placebo, semaglutide decreased rates of serious CV events, such as CV death, myocardial infarction, and stroke and were similar to the CV outcomes in the LEADER trial.2,6 Although initial results of the SUSTAIN-6 trial were thought to be nearly equivalent to the LEADER trial, analyses later published comparing both trials noted that semaglutide had more potent HbA1c lowering and weight loss benefit when compared with liraglutide.2,6 The cardioprotective outcomes of SUSTAIN-6 qualified semaglutide for inclusion in the current ADA Standards of Medical Care recommendations for CV risk reduction.6,7 However, despite the CV safety profile and efficacy associated with semaglutide, the SUSTAIN-6 trial noted an increased risk of diabetic retinopathy (DR) complications in 50 of 1648 patients (3%) treated with semaglutide compared with 29 of 1649 (1.8%) who received placebo (P = .02; hazard ratio, 1.76; 95% CI, 1.11-2.78).6 Of the 79 total patients who experienced retinopathy complications, 66 had retinopathy at baseline (42 of 50 [84%]) in the semaglutide group; 24 of 29 [83%] in the placebo group).6 Worsening of DR became one of the most notable AEs of semaglutide evaluated in clinical trials. This further deemed the effect as a warning in the semaglutide package insert to assist clinicians with treatment decisions.
As part of a US Department of Veterans Affairs (VA) National Lost Opportunity Cost Savings Initiative, which encompasses administrative efforts to promote more cost-effective yet safe and efficacious therapy options for veterans, the Michael E. DeBakey VA Medical Center (MEDVAMC) in Houston, Texas, converted a portion of patients with T2DM established on liraglutide to semaglutide. The 30-day supply cost of the 2-pack liraglutide 6 mg/mL (3 mL) injection pens for the MEDVAMC was $197.64. The 30-day supply cost for the singular multidose semaglutide 0.5 mg/0.375 mL (1.5 mL) injection pen was $115.15. Cost savings for the MEDVAMC facility were initially estimated to reach $642,522.
The subset of patients converted had to have undergone teleretinal imaging and not have a diagnosis of nonproliferative DR (NPDR), proliferative DR (PDR), or PDR with or without
In the fall of 2021, there was also a standing list of patients on liraglutide who were not converted due to a lack of teleretinal imaging. As a result, there was potential for a quality improvement (QI) intervention to target this patient population, which could result in further cost savings for MEDVAMC and improved glycemic control because of increased conversion from liraglutide to semaglutide. The purpose of this project was to perform a QI assessment on this subset of patients both initially converted from liraglutide to semaglutide, and those who were yet to be converted due to a lack of teleretinal imaging to determine the impact on glycemic control and cost savings.
Methods
This QI project was a single-center, prospective cohort study with a retrospective chart review of veterans with T2DM converted from liraglutide to semaglutide at the MEDVAMC. Patient data were collected from the Computerized Patient Record System (CPRS) between March 1, 2021, and November 30, 2021. An initial subset of patients was converted to semaglutide in March and April 2021. Patients initially excluded underwent a second chart review to determine whether they truly met exclusion criteria. Patients who did not have a definitive diagnosis of NPDR or PDR, those due for updated teleretinal imaging, as well as those with updated teleretinal imaging that excluded NPDR or PDR were targeted for clinician education interventions.
Following this intervention, a subset of patients with negative DR findings were converted from liraglutide to semaglutide. Primary care and endocrinology clinicians were notified that patients who met the criteria should be referred for teleretinal imaging if no updated results were present or that patients were eligible for semaglutide conversion based on negative findings. Both patients who were initially converted as well as those converted following education were included for data collection/analysis of glycemic control via HbA1c and blood glucose levels.
Cost savings were evaluated using outpatient pharmacy procurement pricing data. This project was approved by the MEDVAMC Quality Assurance and Regulatory Affairs Office.
Participants
Patients included in the study were adults aged ≥ 18 years with T2DM, converted from liraglutide 0.6 and 1.2 mg daily to semaglutide 0.25 mg weekly (titrated to 0.5 mg weekly after 4 weeks), and had an active prescription for semaglutide, with or without insulin or other oral antihyperglycemics. Patients with NPDR or PDR, type 1 DM, no HbA1c data, no filled semaglutide prescriptions, insulin pumps, and those without teleretinal imaging within the postintervention period or who died during the study period were excluded.
Patient baseline characteristics collected included demographic data, CV comorbidities, antihyperglycemic medications, and changes in insulin doses. Parameters analyzed at baseline and 3 to 12 months postconversion included body weight, HbA1c, and blood glucose levels.
Outcomes
The primary objectives of this QI project were to assess glycemic control (via changes in HbA1c levels) and cost savings following patient conversion from liraglutide to semaglutide. A second objective was to educate clinicians for referral of T2DM patients without teleretinal imaging in the past 2 years.
The purpose of the latter objective was to encourage conversion from liraglutide to semaglutide in the absence of DR. We predicted that 50% of patients with clinician education would be converted. Secondary objectives included assessing body weight differences, evaluating modifications in diabetes regimen, and documenting AEs. We predicted that glycemic control would either remain stable or improve with conversion to semaglutide.
Statistical Analysis
Patient demographic data were analyzed using descriptive statistics. Quantitative data (HbA1c, blood glucose, and body weight differences as continuous variables) were analyzed using a paired Student t test, and categorical variables were analyzed using the χ2 test.
Results
During the study period, 692 patients were identified with active liraglutide prescriptions (Figure). Of these, 49 patients who were initially excluded due to outdated teleretinal imaging or negative findings met the criteria for clinician education, and 14 of those 49 patients (28.6%) were converted from liraglutide to semaglutide. Thirty-three patients (67.3%) did not schedule teleretinal imaging or did not convert to semaglutide following negative teleretinal findings. Two patients (4.1%) either scheduled or proceeded with teleretinal imaging, without any further action from the clinician.
Including the 14 patients converted posteducational intervention, 425 patients were converted to semaglutide. Excluded from analysis were 121 patients: 57 for incomplete HbA1c data or no filled semaglutide prescription; 30 for HbA1c and weight data outside of the study timeframe; 25 died of causes unrelated to the project; 8 had insulin pumps; and 1 was diagnosed with late-onset type 1 DM. The final sample was 304 patients who underwent analysis.
Two hundred seventy-three patients (89.8%) were male, and 180 (59.2%) were White (Table 1). The mean (SD) age of patients was 65.9 (9.6) years, and 236 (77.6%) were established on insulin therapy (either basal, bolus, or a combination). The 3 most common antihyperglycemic agents (other than insulin) that patients used included 185 metformin (60.9%), 104 empagliflozin (34.2%), and 50 glipizide (16.4%) prescriptions.
Most patients had CV disease. Three hundred patients (98.7%) had comorbid hypertension, 298 (98.0%) had hyperlipidemia, and 114 (37.5%) had coronary artery disease (Table 2). Other diseases that patients were concomitantly diagnosed with included peripheral vascular disease, heart failure, history of stroke or transient ischemic attack, and history of myocardial infarction.
Documented AEs included 83 patients (27.3%) with hypoglycemia at any point within 3 to 12 months of conversion and 25 patients (8.2%) with mainly GI-related events, including nausea, vomiting, diarrhea, decreased appetite, and abdominal pain. Six patients (2.0%) had a new diagnosis of DR 3 to 12 months postconversion.
Glycemic Control and Weight Changes
At baseline, mean (SD) HbA1c was 8.1% (1.5), blood glucose was 187.4 (44.2) mg/dL, and body weight was 112.9 (23.0) kg (Table 3). In the timeframe evaluated (3 to 12 months postconversion), patients’ mean (SD) HbA1c was found to have significantly decreased to 7.6% (1.4) (P < .001; 95% CI, -0.7 to -0.3), blood glucose decreased to 172.6 (39.0) mg/dL (P < .001; 95% CI, -19.3 to -10.2), and body weight decreased to 105.2 (32.3) kg (P < .001; 95% CI, -10.6 to -4.8). All parameters evaluated were deemed statistically significant.
Further analyses evaluating specific changes in HbA1c observed postconversion are as follows: 199 patients (65.5%) experienced a decrease, 92 (30.3%) experienced an increase, and 13 (4.3%) experienced no change in their HbA1c.
As the timeframe was fairly broad to assess HbA1c changes, a prespecified subgroup analysis was conducted to determine specific changes in HbA1c within 3 to 6, 6 to 9, and 9 to 12 months postconversion (Table 4). At 3 to 6 months postconversion, patient mean (SD) HbA1c levels significantly decreased from 8.2% (1.5) at baseline to 7.6% (1.3) postconversion (P = .002; 95% CI, -1.0 to -0.2). At 6 to 9 months postconversion, the mean (SD) HbA1c significantly decreased from 8.1% (1.5) at baseline to 7.6% (1.4) postconversion (P = .002; 95% CI, -0.8 to -0.2).
Glucose-Lowering Agent Adjustments
One hundred thirteen patients (37.2%) required no changes to their antihyperglycemic regimen with the conversion, 85 (28.0%) required increased insulin doses, and 77 (25.3%) required decreased insulin doses (Table 5). Forty-five (14.8%) patients underwent discontinuation of either insulin or other antihyperglycemic agents; 44 (14.5%) had other antihyperglycemic agents dose increased, 39 (12.8%) required adding other glucose-lowering agents, 28 (9.2%) discontinued semaglutide, and 10 (3.3%) had other glucose-lowering medication doses decreased.
Cost Savings
Cost savings were evaluated using the MEDVAMC outpatient pharmacy procurement service. The total cost savings per patient per month was $82.49. For the 411 preclinician education patients converted to semaglutide, this resulted in a prospective annual cost savings of $406,840.68. An additional $13,858.32 was saved due to the intervention/clinician education for 14 patients converted to semaglutide. The total annual cost savings was $420,699.00.
Discussion
Overall, glycemic control significantly improved with veterans’ conversion from liraglutide to semaglutide. Not only were significant changes noted with HbA1c levels and weight, but consistencies were noted with mean HbA1c decrease and weight loss expected of GLP-1 RAs noted in clinical trials. The typical range for HbA1c changes expected is -1% to -2% and weight loss of 1 to 6 kg.4,7 Data from the LEAD-5 and SUSTAIN-4 trials, evaluating glycemic control in liraglutide and semaglutide, respectively, have noted comparable yet slightly more potent HbA1c decreases (-1.33% for liraglutide 1.8 mg daily vs -1.2% and -1.6% for semaglutide 0.5 mg and 1 mg weekly, respectively).8,9 However, more robust weight loss has been noted with semaglutide vs liraglutide (-4.62 kg for semaglutide 0.5 mg weekly and -6.33 kg for semaglutide 1 mg weekly vs -3.43 kg for liraglutide 1.8 mg daily).8,9 Results from the SUSTAIN-10 trial also noted mean changes in HbA1c of -1.7% for semaglutide 1 mg weekly vs -1.0% for liraglutide 1.2 mg daily; mean body weight differences were -5.8 kg for semaglutide and -1.9 kg for liraglutide at their respective doses.5 The mean weight loss noted with this QI project is consistent with prior trials of semaglutide.
Of note, 44 patients (14.5%) required the dosage increase of either one or multiple additional glucose-lowering agents at any time point within the 3- to 12-month period. Of those patients, 38 (86.4%) underwent further semaglutide dose titration to 1 mg weekly. Common reasons for a further dose increase to 1 mg weekly were an indication for more robust HbA1c lowering, a desire to decrease patients’ either basal or bolus insulin requirements, or a treatment goal of completely titrating patients off insulin.
It is uncertain why 30.3% of patients experienced an increase in HbA1c and 4.3% experienced no change. However, possibilities for the divergence in HbA1c outcomes in these subsets of patients may include suboptimal adherence to semaglutide or other antihyperglycemic agents as indicated by clinicians or nonadherence to dietary and lifestyle modifications.
Most patients (65.5%) experienced a decrease in HbA1c because of conversion to semaglutide, and
At the MEDVAMC, liraglutide is a nonformulary agent and semaglutide is now the formulary-preferred option. For patients with uncontrolled T2DM, if a GLP-1 RA is desired for therapy, clinicians are to place a prior authorization drug request (PADR) consultation for semaglutide for further evaluation and review of VA Criteria for Use (CFU) by clinical pharmacist practitioners. Liraglutide is the alternative option if patients do not meet the CFU for semaglutide (ie, have a diagnosis of DR among other exclusions). However, the semaglutide CFU was updated in April 2022 to exclude those specifically diagnosed with PDR, severe NPDR, and macular edema unless an ophthalmologist deems semaglutide acceptable. This indicates that patients with mild-to-moderate NPDR (who were originally excluded from this QI project) are now eligible to receive semaglutide. The incidence of new DR diagnoses (2%) observed in this study could indicate an unclear relationship between semaglutide and increased rates of DR; however, no definitive correlation can be established due to the retrospective nature of this project. The implications of the results of this QI project in relation to the updated CFU remain undetermined.
Due to the comparable improvements in HbA1c and more robust weight loss noted with semaglutide vs liraglutide, we deem it appropriate to select semaglutide as the more cost-efficient GLP-1 RA and formulary preferred option. The data of this QI project supports the overall safety and treatment utility of this option. Although significant cost savings were achieved (> $400,000), the long-term benefit of the liraglutide to semaglutide conversion remains unknown.
Strengths and Limitations
Strengths of this project include the large sample size, its setting in a large VA medical center, and the evaluation of multiple outcomes beyond HbA1c for assessment of glycemic control (ie, mean blood glucose, insulin titration, and dose adjustment of other glucose-lowering agents).
Limitations of this study include the retrospective chart review used for data collection, limited accuracy of objective data due to the COVID-19 pandemic, and inconsistencies with documentation in patients’ electronic health records. As a protective measure in the height of the pandemic between March 2021 and November 2021, the VA promoted using telephone and virtual-visit clinics to minimize exposure for patients with nonurgent follow-up needs. Patient hesitance to present to the clinic in person due to COVID-19 was also a significant factor in obtaining objective follow-up data. As a result, less accurate and timely baseline and postconversion weight and HbA1c data resulted, leading to our decision to extend the timeframe evaluated postconversion to 3 to 12 months. We also noted inconsistencies with documentation in CPRS. Unless veterans were closely followed by clinical pharmacist practitioners or endocrine consultation service clinicians, it was more difficult to follow and document trends of insulin titration to assess the impact of semaglutide conversion. The number of AEs, including hypoglycemia and GI intolerance, were also not consistently documented within the CPRS, and the frequency of AEs may be underestimated.
Another possible limitation regarding the interpretation of the results includes the portion of patients titrated up to semaglutide 1 mg weekly. As the focal point of this project was to review changes in glycemic control in the conversion to semaglutide 0.5 mg, this population of patients converted to 1 mg could potentially overestimate the HbA1c and weight changes described, as it is consistent with the SUSTAIN trials that show more robust decreases in those parameters described earlier.
Conclusions
A subset of patients with T2DM converted from liraglutide to semaglutide experienced significant changes in glycemic control and body weight. Significant differences were noted for a decreased HbA1c, decreased mean blood glucose, and weight loss. A fair portion of patients’ antihyperglycemic regimens required no changes on conversion to semaglutide. Although the semaglutide discontinuation rate neared 10%, AEs that may have contributed to this discontinuation rate included hypoglycemia and GI intolerance. Clinician education resulted in a substantial number of patients undergoing teleretinal imaging and further conversion to semaglutide; however, due to the low conversion response rate, a more effective method of educating clinicians is warranted. Although the semaglutide cost savings initiative at MEDVAMC resulted in significant savings, a full cost-effective analysis is needed to assess more comprehensive institution savings.
1. ElSayed NA, Aleppo G, Aroda VR, et al. 9. Pharmacologic Approaches to Glycemic Treatment: Standards of Care in Diabetes-2023. Diabetes Care. 2023;46(suppl 1):S140-S157. doi:10.2337/dc23-S009
2. Aroda VR, Ahmann A, Cariou B, et al. Comparative efficacy, safety, and cardiovascular outcome with once-weekly subcutaneous semaglutide in the treatment of type 2 diabetes: insights from the SUSTAIN 1-7 trials. Diabetes Metab. 2019;45(5):409-418. doi:10.1016/j.diabet.2018.12.001
3. Trujillo JM, Nuffer W, Smith BA. GLP-1 receptor agonists: an updated review of head-to-head clinical studies. Ther Adv Endocrinol Metab. 2021;12:2042018821997320. Published 2021 Mar 9. doi:10.1177/2042018821997320
4. Drucker DJ. Mechanisms of action and therapeutic application of glucagon-like peptide-1. Cell Metab. 2018;27(4):740-756. doi:10.1016/j.cmet.2018.03.001
5. Capehorn MS, Catarig AM, Furberg JK, et al. Efficacy and safety of once-weekly semaglutide 1.0mg vs once-daily liraglutide 1.2mg as add-on to 1-3 oral antidiabetic drugs in subjects with type 2 diabetes (SUSTAIN 10). Diabetes Metab. 2020;46(2):100-109. doi:10.1016/j.diabet.2019.101117
6. Marso SP, Bain SC, Consoli A, et al; SUSTAIN-6 Investigators. Semaglutide and cardiovascular outcomes in patients with type 2 diabetes. N Engl J Med. 2016;375(19):1834-1844. doi:10.1056/NEJMoa1607141
7. ElSayed NA, Aleppo G, Aroda VR, et al. 10. Cardiovascular Disease and Risk Management: Standards of Care in Diabetes-2023. Diabetes Care. 2023;46(suppl 1):S158-S190. doi:10.2337/dc23-S010
8. Russell-Jones D, Vaag A, Schmitz O, et al. Liraglutide vs insulin glargine and placebo in combination with metformin and sulfonylurea therapy in type 2 diabetes mellitus (LEAD-5 met+SU): a randomised controlled trial. Diabetologia. 2009;52(10):2046-2055. doi:10.1007/s00125-009-1472-y
9. Aroda VR, Bain SC, Cariou B, et al. Efficacy and safety of once-weekly semaglutide versus once-daily insulin glargine as add-on to metformin (with or without sulfonylureas) in insulin-naive patients with type 2 diabetes (SUSTAIN 4): a randomised, open-label, parallel-group, multicentre, multinational, phase 3a trial. Lancet Diabetes Endocrinol. 2017;5(5):355-366. doi:10.1016/S2213-8587(17)30085-2
1. ElSayed NA, Aleppo G, Aroda VR, et al. 9. Pharmacologic Approaches to Glycemic Treatment: Standards of Care in Diabetes-2023. Diabetes Care. 2023;46(suppl 1):S140-S157. doi:10.2337/dc23-S009
2. Aroda VR, Ahmann A, Cariou B, et al. Comparative efficacy, safety, and cardiovascular outcome with once-weekly subcutaneous semaglutide in the treatment of type 2 diabetes: insights from the SUSTAIN 1-7 trials. Diabetes Metab. 2019;45(5):409-418. doi:10.1016/j.diabet.2018.12.001
3. Trujillo JM, Nuffer W, Smith BA. GLP-1 receptor agonists: an updated review of head-to-head clinical studies. Ther Adv Endocrinol Metab. 2021;12:2042018821997320. Published 2021 Mar 9. doi:10.1177/2042018821997320
4. Drucker DJ. Mechanisms of action and therapeutic application of glucagon-like peptide-1. Cell Metab. 2018;27(4):740-756. doi:10.1016/j.cmet.2018.03.001
5. Capehorn MS, Catarig AM, Furberg JK, et al. Efficacy and safety of once-weekly semaglutide 1.0mg vs once-daily liraglutide 1.2mg as add-on to 1-3 oral antidiabetic drugs in subjects with type 2 diabetes (SUSTAIN 10). Diabetes Metab. 2020;46(2):100-109. doi:10.1016/j.diabet.2019.101117
6. Marso SP, Bain SC, Consoli A, et al; SUSTAIN-6 Investigators. Semaglutide and cardiovascular outcomes in patients with type 2 diabetes. N Engl J Med. 2016;375(19):1834-1844. doi:10.1056/NEJMoa1607141
7. ElSayed NA, Aleppo G, Aroda VR, et al. 10. Cardiovascular Disease and Risk Management: Standards of Care in Diabetes-2023. Diabetes Care. 2023;46(suppl 1):S158-S190. doi:10.2337/dc23-S010
8. Russell-Jones D, Vaag A, Schmitz O, et al. Liraglutide vs insulin glargine and placebo in combination with metformin and sulfonylurea therapy in type 2 diabetes mellitus (LEAD-5 met+SU): a randomised controlled trial. Diabetologia. 2009;52(10):2046-2055. doi:10.1007/s00125-009-1472-y
9. Aroda VR, Bain SC, Cariou B, et al. Efficacy and safety of once-weekly semaglutide versus once-daily insulin glargine as add-on to metformin (with or without sulfonylureas) in insulin-naive patients with type 2 diabetes (SUSTAIN 4): a randomised, open-label, parallel-group, multicentre, multinational, phase 3a trial. Lancet Diabetes Endocrinol. 2017;5(5):355-366. doi:10.1016/S2213-8587(17)30085-2
Video-Based Coaching for Dermatology Resident Surgical Education
To the Editor:
Video-based coaching (VBC) involves a surgeon recording a surgery and then reviewing the video with a surgical coach; it is a form of education that is gaining popularity among surgical specialties.1 Video-based education is underutilized in dermatology residency training.2 We conducted a pilot study at our dermatology residency program to evaluate the efficacy and feasibility of VBC.
The University of Texas at Austin Dell Medical School institutional review board approved this study. All 4 first-year dermatology residents were recruited to participate in this study. Participants filled out a prestudy survey assessing their surgical experience, confidence in performing surgery, and attitudes on VBC. Participants used a head-mounted point-of-view camera to record themselves performing a wide local excision on the trunk or extremities of a live human patient. Participants then reviewed the recording on their own and scored themselves using the Objective Structured Assessment of Technical Skills (OSATS) scoring table (scored from 1 to 5, with 5 being the highest possible score for each element), which is a validated tool for assessing surgical skills (eTable 1).3 Given that there were no assistants participating in the surgery, this element of the OSATS scoring table was excluded, making a maximum possible score of 30 and a minimum possible score of 6. After scoring themselves, participants then had a 1-on-1 coaching session with a fellowship-trained dermatologic surgeon (M.F. or T.H.) via online teleconferencing.
During the coaching session, participants and coaches reviewed the video. The surgical coaches also scored the residents using the OSATS, then residents and coaches discussed how the resident could improve using the OSATS scores as a guide. The residents then completed a poststudy survey assessing their surgical experience, confidence in performing surgery, and attitudes on VBC. Descriptive statistics were reported.
On average, residents spent 31.3 minutes reviewing their own surgeries and scoring themselves. The average time for a coaching session, which included time spent scoring, was 13.8 minutes. Residents scored themselves lower than the surgical coaches did by an average of 5.25 points (eTable 2). Residents gave themselves an average total score of 10.5, while their respective surgical coaches gave the residents an average score of 15.75. There was a trend of residents with greater surgical experience having higher OSATS scores (Figure). After the coaching session, 3 of 4 residents reported that they felt more confident in their surgical skills. All residents felt more confident in assessing their surgical skills and felt that VBC was an effective teaching measure. All residents agreed that VBC should be continued as part of their residency training.
Video-based coaching has the potential to provide several benefits for dermatology trainees. Because receiving feedback intraoperatively often can be distracting and incomplete, video review can instead allow the surgeon to focus on performing the surgery and then later focus on learning while reviewing the video.1,4 Feedback also can be more comprehensive and delivered without concern for time constraints or disturbing clinic flow as well as without the additional concern of the patient overhearing comments and feedback.3 Although independent video review in the absence of coaching can lead to improvement in surgical skills, the addition of VBC provides even greater potential educational benefit.4 During the COVID-19 pandemic, VBC allowed coaches to provide feedback without additional exposures. We utilized dermatologic surgery faculty as coaches, but this format of training also would apply to general dermatology faculty.
Another goal of VBC is to enhance a trainee’s ability to perform self-directed learning, which requires accurate self-assessment.4 Accurately assessing one’s own strengths empowers a trainee to act with appropriate confidence, while understanding one’s own weaknesses allows a trainee to effectively balance confidence and caution in daily practice.5 Interestingly, in our study all residents scored themselves lower than surgical coaches, but with 1 coaching session, the residents subsequently reported greater surgical confidence.
Time constraints can be a potential barrier to surgical coaching.4 Our study demonstrates that VBC requires minimal time investment. Increasing the speed of video playback allowed for efficient evaluation of resident surgeries without compromising the coach’s ability to provide comprehensive feedback. Our feedback sessions were performed virtually, which allowed for ease of scheduling between trainees and coaches.
Our pilot study demonstrated that VBC is relatively easy to implement in a dermatology residency training setting, leveraging relatively low-cost technologies and allowing for a means of learning that residents felt was effective. Video-based coaching requires minimal time investment from both trainees and coaches and has the potential to enhance surgical confidence. Our current study is limited by its small sample size. Future studies should include follow-up recordings and assess the efficacy of VBC in enhancing surgical skills.
- Greenberg CC, Dombrowski J, Dimick JB. Video-based surgical coaching: an emerging approach to performance improvement. JAMA Surg. 2016;151:282-283.
- Dai J, Bordeaux JS, Miller CJ, et al. Assessing surgical training and deliberate practice methods in dermatology residency: a survey of dermatology program directors. Dermatol Surg. 2016;42:977-984.
- Chitgopeker P, Sidey K, Aronson A, et al. Surgical skills video-based assessment tool for dermatology residents: a prospective pilot study. J Am Acad Dermatol. 2020;83:614-616.
- Bull NB, Silverman CD, Bonrath EM. Targeted surgical coaching can improve operative self-assessment ability: a single-blinded nonrandomized trial. Surgery. 2020;167:308-313.
- Eva KW, Regehr G. Self-assessment in the health professions: a reformulation and research agenda. Acad Med. 2005;80(10 suppl):S46-S54.
To the Editor:
Video-based coaching (VBC) involves a surgeon recording a surgery and then reviewing the video with a surgical coach; it is a form of education that is gaining popularity among surgical specialties.1 Video-based education is underutilized in dermatology residency training.2 We conducted a pilot study at our dermatology residency program to evaluate the efficacy and feasibility of VBC.
The University of Texas at Austin Dell Medical School institutional review board approved this study. All 4 first-year dermatology residents were recruited to participate in this study. Participants filled out a prestudy survey assessing their surgical experience, confidence in performing surgery, and attitudes on VBC. Participants used a head-mounted point-of-view camera to record themselves performing a wide local excision on the trunk or extremities of a live human patient. Participants then reviewed the recording on their own and scored themselves using the Objective Structured Assessment of Technical Skills (OSATS) scoring table (scored from 1 to 5, with 5 being the highest possible score for each element), which is a validated tool for assessing surgical skills (eTable 1).3 Given that there were no assistants participating in the surgery, this element of the OSATS scoring table was excluded, making a maximum possible score of 30 and a minimum possible score of 6. After scoring themselves, participants then had a 1-on-1 coaching session with a fellowship-trained dermatologic surgeon (M.F. or T.H.) via online teleconferencing.
During the coaching session, participants and coaches reviewed the video. The surgical coaches also scored the residents using the OSATS, then residents and coaches discussed how the resident could improve using the OSATS scores as a guide. The residents then completed a poststudy survey assessing their surgical experience, confidence in performing surgery, and attitudes on VBC. Descriptive statistics were reported.
On average, residents spent 31.3 minutes reviewing their own surgeries and scoring themselves. The average time for a coaching session, which included time spent scoring, was 13.8 minutes. Residents scored themselves lower than the surgical coaches did by an average of 5.25 points (eTable 2). Residents gave themselves an average total score of 10.5, while their respective surgical coaches gave the residents an average score of 15.75. There was a trend of residents with greater surgical experience having higher OSATS scores (Figure). After the coaching session, 3 of 4 residents reported that they felt more confident in their surgical skills. All residents felt more confident in assessing their surgical skills and felt that VBC was an effective teaching measure. All residents agreed that VBC should be continued as part of their residency training.
Video-based coaching has the potential to provide several benefits for dermatology trainees. Because receiving feedback intraoperatively often can be distracting and incomplete, video review can instead allow the surgeon to focus on performing the surgery and then later focus on learning while reviewing the video.1,4 Feedback also can be more comprehensive and delivered without concern for time constraints or disturbing clinic flow as well as without the additional concern of the patient overhearing comments and feedback.3 Although independent video review in the absence of coaching can lead to improvement in surgical skills, the addition of VBC provides even greater potential educational benefit.4 During the COVID-19 pandemic, VBC allowed coaches to provide feedback without additional exposures. We utilized dermatologic surgery faculty as coaches, but this format of training also would apply to general dermatology faculty.
Another goal of VBC is to enhance a trainee’s ability to perform self-directed learning, which requires accurate self-assessment.4 Accurately assessing one’s own strengths empowers a trainee to act with appropriate confidence, while understanding one’s own weaknesses allows a trainee to effectively balance confidence and caution in daily practice.5 Interestingly, in our study all residents scored themselves lower than surgical coaches, but with 1 coaching session, the residents subsequently reported greater surgical confidence.
Time constraints can be a potential barrier to surgical coaching.4 Our study demonstrates that VBC requires minimal time investment. Increasing the speed of video playback allowed for efficient evaluation of resident surgeries without compromising the coach’s ability to provide comprehensive feedback. Our feedback sessions were performed virtually, which allowed for ease of scheduling between trainees and coaches.
Our pilot study demonstrated that VBC is relatively easy to implement in a dermatology residency training setting, leveraging relatively low-cost technologies and allowing for a means of learning that residents felt was effective. Video-based coaching requires minimal time investment from both trainees and coaches and has the potential to enhance surgical confidence. Our current study is limited by its small sample size. Future studies should include follow-up recordings and assess the efficacy of VBC in enhancing surgical skills.
To the Editor:
Video-based coaching (VBC) involves a surgeon recording a surgery and then reviewing the video with a surgical coach; it is a form of education that is gaining popularity among surgical specialties.1 Video-based education is underutilized in dermatology residency training.2 We conducted a pilot study at our dermatology residency program to evaluate the efficacy and feasibility of VBC.
The University of Texas at Austin Dell Medical School institutional review board approved this study. All 4 first-year dermatology residents were recruited to participate in this study. Participants filled out a prestudy survey assessing their surgical experience, confidence in performing surgery, and attitudes on VBC. Participants used a head-mounted point-of-view camera to record themselves performing a wide local excision on the trunk or extremities of a live human patient. Participants then reviewed the recording on their own and scored themselves using the Objective Structured Assessment of Technical Skills (OSATS) scoring table (scored from 1 to 5, with 5 being the highest possible score for each element), which is a validated tool for assessing surgical skills (eTable 1).3 Given that there were no assistants participating in the surgery, this element of the OSATS scoring table was excluded, making a maximum possible score of 30 and a minimum possible score of 6. After scoring themselves, participants then had a 1-on-1 coaching session with a fellowship-trained dermatologic surgeon (M.F. or T.H.) via online teleconferencing.
During the coaching session, participants and coaches reviewed the video. The surgical coaches also scored the residents using the OSATS, then residents and coaches discussed how the resident could improve using the OSATS scores as a guide. The residents then completed a poststudy survey assessing their surgical experience, confidence in performing surgery, and attitudes on VBC. Descriptive statistics were reported.
On average, residents spent 31.3 minutes reviewing their own surgeries and scoring themselves. The average time for a coaching session, which included time spent scoring, was 13.8 minutes. Residents scored themselves lower than the surgical coaches did by an average of 5.25 points (eTable 2). Residents gave themselves an average total score of 10.5, while their respective surgical coaches gave the residents an average score of 15.75. There was a trend of residents with greater surgical experience having higher OSATS scores (Figure). After the coaching session, 3 of 4 residents reported that they felt more confident in their surgical skills. All residents felt more confident in assessing their surgical skills and felt that VBC was an effective teaching measure. All residents agreed that VBC should be continued as part of their residency training.
Video-based coaching has the potential to provide several benefits for dermatology trainees. Because receiving feedback intraoperatively often can be distracting and incomplete, video review can instead allow the surgeon to focus on performing the surgery and then later focus on learning while reviewing the video.1,4 Feedback also can be more comprehensive and delivered without concern for time constraints or disturbing clinic flow as well as without the additional concern of the patient overhearing comments and feedback.3 Although independent video review in the absence of coaching can lead to improvement in surgical skills, the addition of VBC provides even greater potential educational benefit.4 During the COVID-19 pandemic, VBC allowed coaches to provide feedback without additional exposures. We utilized dermatologic surgery faculty as coaches, but this format of training also would apply to general dermatology faculty.
Another goal of VBC is to enhance a trainee’s ability to perform self-directed learning, which requires accurate self-assessment.4 Accurately assessing one’s own strengths empowers a trainee to act with appropriate confidence, while understanding one’s own weaknesses allows a trainee to effectively balance confidence and caution in daily practice.5 Interestingly, in our study all residents scored themselves lower than surgical coaches, but with 1 coaching session, the residents subsequently reported greater surgical confidence.
Time constraints can be a potential barrier to surgical coaching.4 Our study demonstrates that VBC requires minimal time investment. Increasing the speed of video playback allowed for efficient evaluation of resident surgeries without compromising the coach’s ability to provide comprehensive feedback. Our feedback sessions were performed virtually, which allowed for ease of scheduling between trainees and coaches.
Our pilot study demonstrated that VBC is relatively easy to implement in a dermatology residency training setting, leveraging relatively low-cost technologies and allowing for a means of learning that residents felt was effective. Video-based coaching requires minimal time investment from both trainees and coaches and has the potential to enhance surgical confidence. Our current study is limited by its small sample size. Future studies should include follow-up recordings and assess the efficacy of VBC in enhancing surgical skills.
- Greenberg CC, Dombrowski J, Dimick JB. Video-based surgical coaching: an emerging approach to performance improvement. JAMA Surg. 2016;151:282-283.
- Dai J, Bordeaux JS, Miller CJ, et al. Assessing surgical training and deliberate practice methods in dermatology residency: a survey of dermatology program directors. Dermatol Surg. 2016;42:977-984.
- Chitgopeker P, Sidey K, Aronson A, et al. Surgical skills video-based assessment tool for dermatology residents: a prospective pilot study. J Am Acad Dermatol. 2020;83:614-616.
- Bull NB, Silverman CD, Bonrath EM. Targeted surgical coaching can improve operative self-assessment ability: a single-blinded nonrandomized trial. Surgery. 2020;167:308-313.
- Eva KW, Regehr G. Self-assessment in the health professions: a reformulation and research agenda. Acad Med. 2005;80(10 suppl):S46-S54.
- Greenberg CC, Dombrowski J, Dimick JB. Video-based surgical coaching: an emerging approach to performance improvement. JAMA Surg. 2016;151:282-283.
- Dai J, Bordeaux JS, Miller CJ, et al. Assessing surgical training and deliberate practice methods in dermatology residency: a survey of dermatology program directors. Dermatol Surg. 2016;42:977-984.
- Chitgopeker P, Sidey K, Aronson A, et al. Surgical skills video-based assessment tool for dermatology residents: a prospective pilot study. J Am Acad Dermatol. 2020;83:614-616.
- Bull NB, Silverman CD, Bonrath EM. Targeted surgical coaching can improve operative self-assessment ability: a single-blinded nonrandomized trial. Surgery. 2020;167:308-313.
- Eva KW, Regehr G. Self-assessment in the health professions: a reformulation and research agenda. Acad Med. 2005;80(10 suppl):S46-S54.
PRACTICE POINTS
- Video-based coaching (VBC) for surgical procedures is an up-and-coming form of medical education that allows a “coach” to provide thoughtful and in-depth feedback while reviewing a recording with the surgeon in a private setting. This format has potential utility in teaching dermatology resident surgeons being coached by a dermatology faculty member.
- We performed a pilot study demonstrating that VBC can be performed easily with a minimal time investment for both the surgeon and the coach. Dermatology residents not only felt that VBC was an effective teaching method but also should become a formal part of their education.
Perceived Benefits of a Research Fellowship for Dermatology Residency Applicants: Outcomes of a Faculty-Reported Survey
Dermatology residency positions continue to be highly coveted among applicants in the match. In 2019, dermatology proved to be the most competitive specialty, with 36.3% of US medical school seniors and independent applicants going unmatched.1 Prior to the transition to a pass/fail system, the mean US Medical Licensing Examination (USMLE) Step 1 score for matched applicants increased from 247 in 2014 to 251 in 2019. The growing number of scholarly activities reported by applicants has contributed to the competitiveness of the specialty. In 2018, the mean number of abstracts, presentations, and publications reported by matched applicants was 14.71, which was higher than other competitive specialties, including orthopedic surgery and otolaryngology (11.5 and 10.4, respectively). Dermatology applicants who did not match in 2018 reported a mean of 8.6 abstracts, presentations, and publications, which was on par with successful applicants in many other specialties.1 In 2011, Stratman and Ness2 found that publishing manuscripts and listing research experience were factors strongly associated with matching into dermatology for reapplicants. These trends in reported research have added pressure for applicants to increase their publications.
Given that many students do not choose a career in dermatology until later in medical school, some students choose to take a gap year between their third and fourth years of medical school to pursue a research fellowship (RF) and produce publications, in theory to increase the chances of matching in dermatology. A survey of dermatology applicants conducted by Costello et al3 in 2021 found that, of the students who completed a gap year (n=90; 31.25%), 78.7% (n=71) of them completed an RF, and those who completed RFs were more likely to match at top dermatology residency programs (P<.01). The authors also reported that there was no significant difference in overall match rates between gap-year and non–gap-year applicants.3 Another survey of 328 medical students found that the most common reason students take years off for research during medical school is to increase competitiveness for residency application.4 Although it is clear that students completing an RF often find success in the match, there are limited published data on how those involved in selecting dermatology residents view this additional year. We surveyed faculty members participating in the resident selection process to assess their viewpoints on how RFs factored into an applicant’s odds of matching into dermatology residency and performance as a resident.
Materials and Methods
An institutional review board application was submitted through the Geisinger Health System (Danville, Pennsylvania), and an exemption to complete the survey was granted. The survey consisted of 16 questions via REDCap electronic data capture and was sent to a listserve of dermatology program directors who were asked to distribute the survey to program chairs and faculty members within their department. Survey questions evaluated the participants’ involvement in medical student advising and the residency selection process. Questions relating to the respondents’ opinions were based on a 5-point Likert scale on level of agreement (1=strongly agree; 5=strongly disagree) or importance (1=a great deal; 5=not at all). All responses were collected anonymously. Data points were compiled and analyzed using REDCap. Statistical analysis via χ2 tests were conducted when appropriate.
Results
The survey was sent to 142 individuals and distributed to faculty members within those departments between August 16, 2019, and September 24, 2019. The survey elicited a total of 110 respondents. Demographic information is shown in eTable 1. Of these respondents, 35.5% were program directors, 23.6% were program chairs, 3.6% were both program director and program chair, and 37.3% were core faculty members. Although respondents’ roles were varied, 96.4% indicated that they were involved in both advising medical students and in selecting residents.
None of the respondents indicated that they always recommend that students complete an RF, and only 4.5% indicated that they usually recommend it; 40% of respondents rarely or never recommend an RF, while 55.5% sometimes recommend it. Although there was a variety of responses to how frequently faculty members recommend an RF, almost all respondents (98.2%) agreed that the reason medical students pursued an RF prior to residency application was to increase the competitiveness of their residency application. However, 20% of respondents believed that students in this cohort were seeking to gain a deeper understanding of the specialty, and 27.3% thought that this cohort had genuine interest in research. Interestingly, despite the medical students’ intentions of choosing an RF, most respondents (67.3%) agreed or strongly agreed that the publications produced by fellows make an impact on the dermatologic scientific community.
Although some respondents indicated that completion of an RF positively impacts resident performance with regard to patient care, most indicated that the impact was a little (26.4%) or not at all (50%). Additionally, a minority of respondents (11.8%) believed that RFs positively impact resident performance on in-service and board examinations at least a moderate amount, with 62.7% indicating no positive impact at all. Only 12.7% of participants agreed or strongly agreed that completion of an RF led to increased applicant involvement in research throughout their career, and most (73.6%) believed there were downsides to completing an RF. Finally, only 20% agreed or strongly agreed that students who completed an RF were more dedicated to the field of dermatology (eTable 2).
Further evaluation of the data indicated that the perceived utility of RFs did not affect respondents’ recommendation on whether to pursue an RF or not. For example, of the 4.5% of respondents who indicated that they always or usually recommended RFs, only 1 respondent believed that students who completed an RF were more dedicated to the field of dermatology than those who did not. Although 55.5% of respondents answered that they sometimes recommended completion of an RF, less than a quarter of this group believed that students who completed an RF were more likely to be heavily involved in research throughout their career (P=.99).
Overall, 11.8% of respondents indicated that completion of a dermatology RF influenced the evaluation of an applicant a great deal or a lot, while 53.6% of respondents indicated a little or no influence at all. Most respondents (62.8%) agreed or strongly agreed that completion of an RF can compensate for flaws in a residency application. Furthermore, when asked if completion of an RF could set 2 otherwise equivocal applicants apart from one another, 46.4% of respondents agreed or strongly agreed with the statement, while only 17.3% disagreed or strongly disagreed (eTable 2).
Comment
This study characterized how completion of an RF is viewed by those involved in advising medical students and selecting dermatology residents. The growing pressure for applicants to increase the number of publications combined with the competitiveness of applying for a dermatology residency position has led to increased participation in RFs. However, studies have found that students who completed an RF often did so despite a lack of interest.4 Nonetheless, little is known about how this is perceived by those involved in choosing residents.
We found that few respondents always or usually advised applicants to complete an RF, but the majority sometimes recommended them, demonstrating the complexity of this issue. Completion of an RF impacted 11.8% of respondents’ overall opinion of an applicant a lot or a great deal, while most respondents (53.6%) were influenced a little or not at all. However, 46.4% of respondents indicated that completion of a dermatology RF would set apart 2 applicants of otherwise equal standing, and 62.8% agreed or strongly agreed that completion of an RF would compensate for flaws in an application. These responses align with the findings of a study conducted by Kaffenberger et al,5 who surveyed members of the Association of Professors of Dermatology and found that 74.5% (73/98) of mentors almost always or sometimes recommended a research gap year for reasons that included low grades, low USMLE Step scores, and little research. These data suggest that completion of an RF can give a competitive advantage to applicants despite most advisors acknowledging that these applicants are not likely to be involved in research throughout their careers, perform better on standardized examinations, or provide better patient care.
Given the complexity of this issue, respondents may not have been able to accurately answer the question about how much an RF influenced their overall opinion of an applicant because of subconscious bias. Furthermore, respondents likely tailored their recommendations to complete an RF based on individual applicant strengths and weaknesses, and the specific reasons why one may recommend an RF need to be further investigated.
Although there may be other perceived advantages to RFs that were not captured by our survey, completion of a dermatology RF is not without disadvantages. Fellowships often are unfunded and offered in cities with high costs of living. Additionally, students are forced to delay graduation from medical school by a year at minimum and continue to accrue interest on medical school loans during this time. The financial burdens of completing an RF may exclude students of lower socioeconomic status and contribute to a decrease in diversity within the field. Dermatology has been found to be the second least diverse specialty, behind orthopedics.6 Soliman et al7 found that racial minorities and low-income students were more likely to cite socioeconomic barriers as factors involved in their decision not to pursue a career in dermatology. This notion was supported by Rinderknecht et al,8 who found that Black and Latinx dermatology applicants were more likely to come from disadvantaged backgrounds, and Black applicants were more likely to indicate financial concerns as their primary reason for not pursuing an RF. The impact of accumulated student debt and decreased access should be carefully weighed against the potential benefits of an RF. However, as the USMLE transitions their Step 1 score reporting from numerical to a pass/fail system, it also is possible that dermatology programs will place more emphasis on research productivity when evaluating applications for residency. Overall, the decision to recommend an RF represents an extremely complex topic, as indicated by the results of this study.
Limitations—Our survey-based study is limited by response rate and response bias. Despite the large number of responses, the overall response rate cannot be determined because it is unknown how many total faculty members actually received the survey. Moreover, data collected from current dermatology residents who have completed RFs vs those who have not as they pertain to resident performance and preparedness for the rigors of a dermatology residency would be useful.
- National Resident Matching Program. Results and Data: 2019 Main Residency Match. National Resident Matching Program; 2019. Accessed September 13, 2023. https://www.nrmp.org/wp-content/uploads/2021/07/NRMP-Results-and-Data-2019_04112019_final.pdf
- Stratman EJ, Ness RM. Factors associated with successful matching to dermatology residency programs by reapplicants and other applicants who previously graduated from medical school. Arch Dermatol. 2011;147:196-202.
- Costello CM, Harvey JA, Besch-Stokes JG, et al. The role research gap-years play in a successful dermatology match. J Am Acad Dermatol. 2021;85:AB22.
- Pathipati AS, Taleghani N. Research in medical school: a survey evaluating why medical students take research years. Cureus. 2016;8:E741.
- Kaffenberger J, Lee B, Ahmed AM. How to advise medical students interested in dermatology: a survey of academic dermatology mentors. Cutis. 2023;111:124-127.
- Pandya AG, Alexis AF, Berger TG, et al. Increasing racial and ethnic diversity in dermatology: a call to action. J Am Acad Dermatol. 2016;74:584-587.
- Soliman YS, Rzepecki AK, Guzman AK, et al. Understanding perceived barriers of minority medical students pursuing a career in dermatology. JAMA Dermatol. 2019;155:252-254.
- Rinderknecht FA, Brumfiel CM, Jefferson IS, et al. Differences in underrepresented in medicine applicant backgrounds and outcomes in the 2020-2021 dermatology residency match. Cutis. 2022;110:76-79.
Dermatology residency positions continue to be highly coveted among applicants in the match. In 2019, dermatology proved to be the most competitive specialty, with 36.3% of US medical school seniors and independent applicants going unmatched.1 Prior to the transition to a pass/fail system, the mean US Medical Licensing Examination (USMLE) Step 1 score for matched applicants increased from 247 in 2014 to 251 in 2019. The growing number of scholarly activities reported by applicants has contributed to the competitiveness of the specialty. In 2018, the mean number of abstracts, presentations, and publications reported by matched applicants was 14.71, which was higher than other competitive specialties, including orthopedic surgery and otolaryngology (11.5 and 10.4, respectively). Dermatology applicants who did not match in 2018 reported a mean of 8.6 abstracts, presentations, and publications, which was on par with successful applicants in many other specialties.1 In 2011, Stratman and Ness2 found that publishing manuscripts and listing research experience were factors strongly associated with matching into dermatology for reapplicants. These trends in reported research have added pressure for applicants to increase their publications.
Given that many students do not choose a career in dermatology until later in medical school, some students choose to take a gap year between their third and fourth years of medical school to pursue a research fellowship (RF) and produce publications, in theory to increase the chances of matching in dermatology. A survey of dermatology applicants conducted by Costello et al3 in 2021 found that, of the students who completed a gap year (n=90; 31.25%), 78.7% (n=71) of them completed an RF, and those who completed RFs were more likely to match at top dermatology residency programs (P<.01). The authors also reported that there was no significant difference in overall match rates between gap-year and non–gap-year applicants.3 Another survey of 328 medical students found that the most common reason students take years off for research during medical school is to increase competitiveness for residency application.4 Although it is clear that students completing an RF often find success in the match, there are limited published data on how those involved in selecting dermatology residents view this additional year. We surveyed faculty members participating in the resident selection process to assess their viewpoints on how RFs factored into an applicant’s odds of matching into dermatology residency and performance as a resident.
Materials and Methods
An institutional review board application was submitted through the Geisinger Health System (Danville, Pennsylvania), and an exemption to complete the survey was granted. The survey consisted of 16 questions via REDCap electronic data capture and was sent to a listserve of dermatology program directors who were asked to distribute the survey to program chairs and faculty members within their department. Survey questions evaluated the participants’ involvement in medical student advising and the residency selection process. Questions relating to the respondents’ opinions were based on a 5-point Likert scale on level of agreement (1=strongly agree; 5=strongly disagree) or importance (1=a great deal; 5=not at all). All responses were collected anonymously. Data points were compiled and analyzed using REDCap. Statistical analysis via χ2 tests were conducted when appropriate.
Results
The survey was sent to 142 individuals and distributed to faculty members within those departments between August 16, 2019, and September 24, 2019. The survey elicited a total of 110 respondents. Demographic information is shown in eTable 1. Of these respondents, 35.5% were program directors, 23.6% were program chairs, 3.6% were both program director and program chair, and 37.3% were core faculty members. Although respondents’ roles were varied, 96.4% indicated that they were involved in both advising medical students and in selecting residents.
None of the respondents indicated that they always recommend that students complete an RF, and only 4.5% indicated that they usually recommend it; 40% of respondents rarely or never recommend an RF, while 55.5% sometimes recommend it. Although there was a variety of responses to how frequently faculty members recommend an RF, almost all respondents (98.2%) agreed that the reason medical students pursued an RF prior to residency application was to increase the competitiveness of their residency application. However, 20% of respondents believed that students in this cohort were seeking to gain a deeper understanding of the specialty, and 27.3% thought that this cohort had genuine interest in research. Interestingly, despite the medical students’ intentions of choosing an RF, most respondents (67.3%) agreed or strongly agreed that the publications produced by fellows make an impact on the dermatologic scientific community.
Although some respondents indicated that completion of an RF positively impacts resident performance with regard to patient care, most indicated that the impact was a little (26.4%) or not at all (50%). Additionally, a minority of respondents (11.8%) believed that RFs positively impact resident performance on in-service and board examinations at least a moderate amount, with 62.7% indicating no positive impact at all. Only 12.7% of participants agreed or strongly agreed that completion of an RF led to increased applicant involvement in research throughout their career, and most (73.6%) believed there were downsides to completing an RF. Finally, only 20% agreed or strongly agreed that students who completed an RF were more dedicated to the field of dermatology (eTable 2).
Further evaluation of the data indicated that the perceived utility of RFs did not affect respondents’ recommendation on whether to pursue an RF or not. For example, of the 4.5% of respondents who indicated that they always or usually recommended RFs, only 1 respondent believed that students who completed an RF were more dedicated to the field of dermatology than those who did not. Although 55.5% of respondents answered that they sometimes recommended completion of an RF, less than a quarter of this group believed that students who completed an RF were more likely to be heavily involved in research throughout their career (P=.99).
Overall, 11.8% of respondents indicated that completion of a dermatology RF influenced the evaluation of an applicant a great deal or a lot, while 53.6% of respondents indicated a little or no influence at all. Most respondents (62.8%) agreed or strongly agreed that completion of an RF can compensate for flaws in a residency application. Furthermore, when asked if completion of an RF could set 2 otherwise equivocal applicants apart from one another, 46.4% of respondents agreed or strongly agreed with the statement, while only 17.3% disagreed or strongly disagreed (eTable 2).
Comment
This study characterized how completion of an RF is viewed by those involved in advising medical students and selecting dermatology residents. The growing pressure for applicants to increase the number of publications combined with the competitiveness of applying for a dermatology residency position has led to increased participation in RFs. However, studies have found that students who completed an RF often did so despite a lack of interest.4 Nonetheless, little is known about how this is perceived by those involved in choosing residents.
We found that few respondents always or usually advised applicants to complete an RF, but the majority sometimes recommended them, demonstrating the complexity of this issue. Completion of an RF impacted 11.8% of respondents’ overall opinion of an applicant a lot or a great deal, while most respondents (53.6%) were influenced a little or not at all. However, 46.4% of respondents indicated that completion of a dermatology RF would set apart 2 applicants of otherwise equal standing, and 62.8% agreed or strongly agreed that completion of an RF would compensate for flaws in an application. These responses align with the findings of a study conducted by Kaffenberger et al,5 who surveyed members of the Association of Professors of Dermatology and found that 74.5% (73/98) of mentors almost always or sometimes recommended a research gap year for reasons that included low grades, low USMLE Step scores, and little research. These data suggest that completion of an RF can give a competitive advantage to applicants despite most advisors acknowledging that these applicants are not likely to be involved in research throughout their careers, perform better on standardized examinations, or provide better patient care.
Given the complexity of this issue, respondents may not have been able to accurately answer the question about how much an RF influenced their overall opinion of an applicant because of subconscious bias. Furthermore, respondents likely tailored their recommendations to complete an RF based on individual applicant strengths and weaknesses, and the specific reasons why one may recommend an RF need to be further investigated.
Although there may be other perceived advantages to RFs that were not captured by our survey, completion of a dermatology RF is not without disadvantages. Fellowships often are unfunded and offered in cities with high costs of living. Additionally, students are forced to delay graduation from medical school by a year at minimum and continue to accrue interest on medical school loans during this time. The financial burdens of completing an RF may exclude students of lower socioeconomic status and contribute to a decrease in diversity within the field. Dermatology has been found to be the second least diverse specialty, behind orthopedics.6 Soliman et al7 found that racial minorities and low-income students were more likely to cite socioeconomic barriers as factors involved in their decision not to pursue a career in dermatology. This notion was supported by Rinderknecht et al,8 who found that Black and Latinx dermatology applicants were more likely to come from disadvantaged backgrounds, and Black applicants were more likely to indicate financial concerns as their primary reason for not pursuing an RF. The impact of accumulated student debt and decreased access should be carefully weighed against the potential benefits of an RF. However, as the USMLE transitions their Step 1 score reporting from numerical to a pass/fail system, it also is possible that dermatology programs will place more emphasis on research productivity when evaluating applications for residency. Overall, the decision to recommend an RF represents an extremely complex topic, as indicated by the results of this study.
Limitations—Our survey-based study is limited by response rate and response bias. Despite the large number of responses, the overall response rate cannot be determined because it is unknown how many total faculty members actually received the survey. Moreover, data collected from current dermatology residents who have completed RFs vs those who have not as they pertain to resident performance and preparedness for the rigors of a dermatology residency would be useful.
Dermatology residency positions continue to be highly coveted among applicants in the match. In 2019, dermatology proved to be the most competitive specialty, with 36.3% of US medical school seniors and independent applicants going unmatched.1 Prior to the transition to a pass/fail system, the mean US Medical Licensing Examination (USMLE) Step 1 score for matched applicants increased from 247 in 2014 to 251 in 2019. The growing number of scholarly activities reported by applicants has contributed to the competitiveness of the specialty. In 2018, the mean number of abstracts, presentations, and publications reported by matched applicants was 14.71, which was higher than other competitive specialties, including orthopedic surgery and otolaryngology (11.5 and 10.4, respectively). Dermatology applicants who did not match in 2018 reported a mean of 8.6 abstracts, presentations, and publications, which was on par with successful applicants in many other specialties.1 In 2011, Stratman and Ness2 found that publishing manuscripts and listing research experience were factors strongly associated with matching into dermatology for reapplicants. These trends in reported research have added pressure for applicants to increase their publications.
Given that many students do not choose a career in dermatology until later in medical school, some students choose to take a gap year between their third and fourth years of medical school to pursue a research fellowship (RF) and produce publications, in theory to increase the chances of matching in dermatology. A survey of dermatology applicants conducted by Costello et al3 in 2021 found that, of the students who completed a gap year (n=90; 31.25%), 78.7% (n=71) of them completed an RF, and those who completed RFs were more likely to match at top dermatology residency programs (P<.01). The authors also reported that there was no significant difference in overall match rates between gap-year and non–gap-year applicants.3 Another survey of 328 medical students found that the most common reason students take years off for research during medical school is to increase competitiveness for residency application.4 Although it is clear that students completing an RF often find success in the match, there are limited published data on how those involved in selecting dermatology residents view this additional year. We surveyed faculty members participating in the resident selection process to assess their viewpoints on how RFs factored into an applicant’s odds of matching into dermatology residency and performance as a resident.
Materials and Methods
An institutional review board application was submitted through the Geisinger Health System (Danville, Pennsylvania), and an exemption to complete the survey was granted. The survey consisted of 16 questions via REDCap electronic data capture and was sent to a listserve of dermatology program directors who were asked to distribute the survey to program chairs and faculty members within their department. Survey questions evaluated the participants’ involvement in medical student advising and the residency selection process. Questions relating to the respondents’ opinions were based on a 5-point Likert scale on level of agreement (1=strongly agree; 5=strongly disagree) or importance (1=a great deal; 5=not at all). All responses were collected anonymously. Data points were compiled and analyzed using REDCap. Statistical analysis via χ2 tests were conducted when appropriate.
Results
The survey was sent to 142 individuals and distributed to faculty members within those departments between August 16, 2019, and September 24, 2019. The survey elicited a total of 110 respondents. Demographic information is shown in eTable 1. Of these respondents, 35.5% were program directors, 23.6% were program chairs, 3.6% were both program director and program chair, and 37.3% were core faculty members. Although respondents’ roles were varied, 96.4% indicated that they were involved in both advising medical students and in selecting residents.
None of the respondents indicated that they always recommend that students complete an RF, and only 4.5% indicated that they usually recommend it; 40% of respondents rarely or never recommend an RF, while 55.5% sometimes recommend it. Although there was a variety of responses to how frequently faculty members recommend an RF, almost all respondents (98.2%) agreed that the reason medical students pursued an RF prior to residency application was to increase the competitiveness of their residency application. However, 20% of respondents believed that students in this cohort were seeking to gain a deeper understanding of the specialty, and 27.3% thought that this cohort had genuine interest in research. Interestingly, despite the medical students’ intentions of choosing an RF, most respondents (67.3%) agreed or strongly agreed that the publications produced by fellows make an impact on the dermatologic scientific community.
Although some respondents indicated that completion of an RF positively impacts resident performance with regard to patient care, most indicated that the impact was a little (26.4%) or not at all (50%). Additionally, a minority of respondents (11.8%) believed that RFs positively impact resident performance on in-service and board examinations at least a moderate amount, with 62.7% indicating no positive impact at all. Only 12.7% of participants agreed or strongly agreed that completion of an RF led to increased applicant involvement in research throughout their career, and most (73.6%) believed there were downsides to completing an RF. Finally, only 20% agreed or strongly agreed that students who completed an RF were more dedicated to the field of dermatology (eTable 2).
Further evaluation of the data indicated that the perceived utility of RFs did not affect respondents’ recommendation on whether to pursue an RF or not. For example, of the 4.5% of respondents who indicated that they always or usually recommended RFs, only 1 respondent believed that students who completed an RF were more dedicated to the field of dermatology than those who did not. Although 55.5% of respondents answered that they sometimes recommended completion of an RF, less than a quarter of this group believed that students who completed an RF were more likely to be heavily involved in research throughout their career (P=.99).
Overall, 11.8% of respondents indicated that completion of a dermatology RF influenced the evaluation of an applicant a great deal or a lot, while 53.6% of respondents indicated a little or no influence at all. Most respondents (62.8%) agreed or strongly agreed that completion of an RF can compensate for flaws in a residency application. Furthermore, when asked if completion of an RF could set 2 otherwise equivocal applicants apart from one another, 46.4% of respondents agreed or strongly agreed with the statement, while only 17.3% disagreed or strongly disagreed (eTable 2).
Comment
This study characterized how completion of an RF is viewed by those involved in advising medical students and selecting dermatology residents. The growing pressure for applicants to increase the number of publications combined with the competitiveness of applying for a dermatology residency position has led to increased participation in RFs. However, studies have found that students who completed an RF often did so despite a lack of interest.4 Nonetheless, little is known about how this is perceived by those involved in choosing residents.
We found that few respondents always or usually advised applicants to complete an RF, but the majority sometimes recommended them, demonstrating the complexity of this issue. Completion of an RF impacted 11.8% of respondents’ overall opinion of an applicant a lot or a great deal, while most respondents (53.6%) were influenced a little or not at all. However, 46.4% of respondents indicated that completion of a dermatology RF would set apart 2 applicants of otherwise equal standing, and 62.8% agreed or strongly agreed that completion of an RF would compensate for flaws in an application. These responses align with the findings of a study conducted by Kaffenberger et al,5 who surveyed members of the Association of Professors of Dermatology and found that 74.5% (73/98) of mentors almost always or sometimes recommended a research gap year for reasons that included low grades, low USMLE Step scores, and little research. These data suggest that completion of an RF can give a competitive advantage to applicants despite most advisors acknowledging that these applicants are not likely to be involved in research throughout their careers, perform better on standardized examinations, or provide better patient care.
Given the complexity of this issue, respondents may not have been able to accurately answer the question about how much an RF influenced their overall opinion of an applicant because of subconscious bias. Furthermore, respondents likely tailored their recommendations to complete an RF based on individual applicant strengths and weaknesses, and the specific reasons why one may recommend an RF need to be further investigated.
Although there may be other perceived advantages to RFs that were not captured by our survey, completion of a dermatology RF is not without disadvantages. Fellowships often are unfunded and offered in cities with high costs of living. Additionally, students are forced to delay graduation from medical school by a year at minimum and continue to accrue interest on medical school loans during this time. The financial burdens of completing an RF may exclude students of lower socioeconomic status and contribute to a decrease in diversity within the field. Dermatology has been found to be the second least diverse specialty, behind orthopedics.6 Soliman et al7 found that racial minorities and low-income students were more likely to cite socioeconomic barriers as factors involved in their decision not to pursue a career in dermatology. This notion was supported by Rinderknecht et al,8 who found that Black and Latinx dermatology applicants were more likely to come from disadvantaged backgrounds, and Black applicants were more likely to indicate financial concerns as their primary reason for not pursuing an RF. The impact of accumulated student debt and decreased access should be carefully weighed against the potential benefits of an RF. However, as the USMLE transitions their Step 1 score reporting from numerical to a pass/fail system, it also is possible that dermatology programs will place more emphasis on research productivity when evaluating applications for residency. Overall, the decision to recommend an RF represents an extremely complex topic, as indicated by the results of this study.
Limitations—Our survey-based study is limited by response rate and response bias. Despite the large number of responses, the overall response rate cannot be determined because it is unknown how many total faculty members actually received the survey. Moreover, data collected from current dermatology residents who have completed RFs vs those who have not as they pertain to resident performance and preparedness for the rigors of a dermatology residency would be useful.
- National Resident Matching Program. Results and Data: 2019 Main Residency Match. National Resident Matching Program; 2019. Accessed September 13, 2023. https://www.nrmp.org/wp-content/uploads/2021/07/NRMP-Results-and-Data-2019_04112019_final.pdf
- Stratman EJ, Ness RM. Factors associated with successful matching to dermatology residency programs by reapplicants and other applicants who previously graduated from medical school. Arch Dermatol. 2011;147:196-202.
- Costello CM, Harvey JA, Besch-Stokes JG, et al. The role research gap-years play in a successful dermatology match. J Am Acad Dermatol. 2021;85:AB22.
- Pathipati AS, Taleghani N. Research in medical school: a survey evaluating why medical students take research years. Cureus. 2016;8:E741.
- Kaffenberger J, Lee B, Ahmed AM. How to advise medical students interested in dermatology: a survey of academic dermatology mentors. Cutis. 2023;111:124-127.
- Pandya AG, Alexis AF, Berger TG, et al. Increasing racial and ethnic diversity in dermatology: a call to action. J Am Acad Dermatol. 2016;74:584-587.
- Soliman YS, Rzepecki AK, Guzman AK, et al. Understanding perceived barriers of minority medical students pursuing a career in dermatology. JAMA Dermatol. 2019;155:252-254.
- Rinderknecht FA, Brumfiel CM, Jefferson IS, et al. Differences in underrepresented in medicine applicant backgrounds and outcomes in the 2020-2021 dermatology residency match. Cutis. 2022;110:76-79.
- National Resident Matching Program. Results and Data: 2019 Main Residency Match. National Resident Matching Program; 2019. Accessed September 13, 2023. https://www.nrmp.org/wp-content/uploads/2021/07/NRMP-Results-and-Data-2019_04112019_final.pdf
- Stratman EJ, Ness RM. Factors associated with successful matching to dermatology residency programs by reapplicants and other applicants who previously graduated from medical school. Arch Dermatol. 2011;147:196-202.
- Costello CM, Harvey JA, Besch-Stokes JG, et al. The role research gap-years play in a successful dermatology match. J Am Acad Dermatol. 2021;85:AB22.
- Pathipati AS, Taleghani N. Research in medical school: a survey evaluating why medical students take research years. Cureus. 2016;8:E741.
- Kaffenberger J, Lee B, Ahmed AM. How to advise medical students interested in dermatology: a survey of academic dermatology mentors. Cutis. 2023;111:124-127.
- Pandya AG, Alexis AF, Berger TG, et al. Increasing racial and ethnic diversity in dermatology: a call to action. J Am Acad Dermatol. 2016;74:584-587.
- Soliman YS, Rzepecki AK, Guzman AK, et al. Understanding perceived barriers of minority medical students pursuing a career in dermatology. JAMA Dermatol. 2019;155:252-254.
- Rinderknecht FA, Brumfiel CM, Jefferson IS, et al. Differences in underrepresented in medicine applicant backgrounds and outcomes in the 2020-2021 dermatology residency match. Cutis. 2022;110:76-79.
PRACTICE POINTS
- Many medical students seeking to match into a dermatology residency program complete a research fellowship (RF).
- Completion of an RF can give a competitive advantage to applicants even though most advisors acknowledge that these applicants are not likely to be involved in research throughout their career, perform better on standardized examinations, or provide better patient care.
- The decision to recommend an RF represents an extremely complex topic and should be tailored to each individual applicant.
Assessment of the Efficacy of Tranexamic Acid Solution 5% in the Treatment of Melasma in Patients of South Asian Descent
Melasma is a complex, long-lasting, acquired dermatologic pigmentation disorder resulting in grey-brown patches that last for more than 3 months. Sun-exposed areas including the nose, cheeks, forehead, and forearms are most likely to be affected.1 In Southeast Asia, 0.25% to 4% of the population affected by melasma is aged 30 to 40 years.2 In particular, melasma is a concern among pregnant women due to increased levels of melanocyte-stimulating hormones (MSHs) and is impacted by genetics, hormonal influence, and exposure to UV light.3,4 In Pakistan, approximately 46% of women are affected by melasma during pregnancy.2,5 Although few studies have focused on the clinical approaches to melasma in darker skin types, it continues to disproportionately affect the skin of color population.4
The areas of hyperpigmentation seen in melasma exhibit increased deposition of melanin in the epidermis and dermis, but melanocytes are not elevated. However, in areas of hyperpigmentation, the melanocytes are larger and more dendritic and demonstrate an increased level of melanogenesis.6 During pregnancy, especially in the third trimester, elevated levels of estrogen, progesterone, and MSH often are found in association with melasma.7 Tyrosinase (TYR) activity increases and cellular proliferation is reduced after treatment of melanocytes in culture with β-estradiol.8 Sex steroids increase transcription of genes encoding melanogenic enzymes in normal human melanocytes, especially TYR.9 These results are consistent with the notable increases in melanin synthesis and TYR activity reported for normal human melanocytes under similar conditions in culture.10 Because melanocytes contain both cytosolic and nuclear estrogen receptors, melanocytes in patients with melasma may be inherently more sensitive to the stimulatory effects of estrogens and possibly other steroid hormones.11
The current treatment options for melasma have varying levels of success and include topical depigmenting agents such as hydroquinone, tretinoin, azelaic acid, kojic acid, and corticosteroids; dermabrasion; and chemical peels.12-14 Chemical peels with glycolic acid, salicylic acid, lactic acid, trichloroacetic acid, and phenol, as well as laser therapy, are reliable management options.13,14 Traditionally, melasma has been treated with a combination of modalities along with photoprotection and trigger avoidance.12
The efficacy and safety of the available therapies for melasma are still controversial and require further exploration. In recent years, off-label tranexamic acid (TA) has emerged as a potential therapy for melasma. Although the mechanism of action remains unclear, TA may inhibit melanin synthesis by blocking the interaction between melanocytes and keratinocytes.15 Tranexamic acid also may reverse the abnormal dermal changes associated with melasma by inhibiting melanogenesis and angiogenesis.16
Although various therapeutic options exist for melasma, the search for a reliable option in patients with darker skin types continues.13 We sought to evaluate the efficacy of TA solution 5% in reducing the severity of melasma in South Asian patients, thereby improving patient outcomes and maximizing patient satisfaction. Topical TA is inexpensive and readily accessible and does not cause systemic side effects. These qualities make it a promising treatment compared to traditional therapies.
Methods
We conducted a randomized controlled trial at Rawalpindi Medical Institute (Punjab, Pakistan). The researchers obtained informed consent for all enrolled patients. Cases were sampled from the original patient population seen at the office using nonprobability consecutive sampling. The sample size was calculated with a 95% CI, margin of error of 9%, and expected percentage of efficacy of 86.1% by using TA solution 5%. South Asian male and female patients aged 20 to 45 years with melasma were included in the analysis. Patients were excluded if they were already taking TA, oral contraceptive pills, or photosensitizing drugs (eg, nonsteroidal anti-inflammatory drugs, tetracyclines, phenytoin, carbamazepine); were pregnant; had chronic kidney disease (creatinine >2.0 mg/dL); had cardiac abnormalities (abnormal electrocardiogram); had hematologic disorders (international normalized ratio >2); or had received another melasma treatment within the last 3 to 6 months.
All enrolled patients underwent a detailed history and physical examination. Patient demographics were subsequently noted, including age, sex, history of diabetes mellitus or hypertension, and duration of melasma. The melasma area and severity index (MASI) score of each patient was calculated at baseline, and a corresponding photograph was taken.
The topical solution was prepared with 5 g of TA dissolved in 10 cc of ethanol at 96 °F, 10 cc of 1,3-butanediol, and distilled water up to 100 cc. The TA solution was applied to the affected areas once daily by the patient for 12 weeks. Each application covered the affected areas completely. Patients were instructed to apply sunscreen with sun protection factor 60 to those same areas for UV protection after 15 minutes of TA application. Biweekly follow-ups were scheduled during the trial, and the MASI score was recorded at these visits. If the mean MASI score was reduced by half after 12 weeks of treatment, then the treatment was considered efficacious with a 95% CI.
The percentage reduction from baseline was calculated as follows: percentage reduction=(baseline score– follow-up score)/baseline score×100.
Statistical Analysis—Data were analyzed in SPSS Statistics 25 (IBM). The quantitative variables of age, duration of melasma, and body mass index were presented as mean (SD). Qualitative variables such as sex, history of diabetes mellitus or hypertension, site of melasma, and efficacy were presented as frequencies and percentages. Mean MASI scores at baseline and 12 weeks posttreatment were compared using a paired t test (P≤.05). Data were stratified for age, sex, history of diabetes mellitus or hypertension, site of melasma, and duration of melasma, and a χ2 test was applied to compare efficacy in stratified groups (P≤.05).
Results
Sixty patients were enrolled in the study. Of them, 17 (28.33%) were male, and 43 (71.67%) were female (2:5 ratio). They ranged in age from 20 to 45 years (mean [SD], 31.93 [6.26] years). Thirty-seven patients (61.67%) were aged 31 to 45 years of age (Table 1). The mean (SD) duration of disease was 10.18 (2.10) months. The response to TA was recorded based on patient distribution according to the site of melasma as well as history of diabetes mellitus and hypertension.
Topical TA was found to be efficacious for melasma in 50 (83.33%) patients. The mean (SD) baseline and week 12 MASI scores were 23.15 (5.02) and 12.71 (4.10)(P<.0001), respectively (Table 2). The stratification of efficacy with respect to age, sex, duration of melasma, site of melasma, and history of diabetes mellitus or hypertension is shown in the eTable. The site of melasma was significant with respect to stratification of efficacy. On the forehead, TA was found to be efficacious in 11 patients and nonefficacious in 0 patients (P=.036). In the malar region, it was efficacious in 16 patients and nonefficacious in 1 patient (P=.036). Finally, on the chin, it was efficacious in 23 patients and nonefficacious in 9 patients (P=.036).
Comment
Melasma Presentation and Development—Melasma is a chronic skin condition that more often affects patients with darker skin types. This condition is characterized by hyperpigmentation of skin that is directly exposed to the sun, such as the cheek, nose, forehead, and above the upper lip.17 Although the mechanism behind how melasma develops is unknown, one theory suggests that UV light can lead to increased plasmin in keratinocytes.18 This increased plasmin will thereby increase the arachidonic acid and α-MSH, leading to the observed uneven hyperpigmentation that is notable in melasma. Melasma is common in patients using oral contraceptives or expired cosmetic drugs; in those who are pregnant; and in those with liver dysfunction.18 Melasma has a negative impact on patients’ quality of life because of substantial psychological and social distress. Thus, finding an accessible treatment is imperative.19
Melasma Management—The most common treatments for melasma have been topical bleaching agents and photoprotection. Combination therapy options include chemical peels, dermabrasion, and laser treatments, though they present with limited efficacy.17,20 Because melasma focuses on pigmentation correction, topical treatments work to disturb melanocyte pigment production at the enzymatic level.21 Tyrosinase is rate limiting in melanin production, as it converts L-tyrosinase to L-3,4-dihydroxyphenylalanine, using copper to interact with L-3,4-dihydroxyphenylalanine as a cofactor in the active site.22 Therefore, tyrosine is a major target for many drugs that have been developed for melasma to decrease melaninization.21
Recently, research has focused on the effects of topical, intradermal, and oral TA for melasma.17 Tranexamic acid most commonly has been used in medicine as a fibrinolytic agent because of its antiplasmin properties. It has been hypothesized that TA can inhibit the release of paracrine melanogenic factors that normally act to stimulate melanocytes.17 Although studies have supported the safety and efficacy of TA, there remains a lack of clinical studies that are sufficiently powered. No definitive consensus on the use of TA for melasma currently exists, which indicates the need for large-scale, randomized, controlled trials.23
One trial (N=25) found that TA solution 5% achieved efficacy (>50% reduction in MASI score from baseline) in 86.1% of patients with melasma.24 In another study (N=18), topical TA 5% achieved efficacy (>50% reduction in MASI score) in 86% of patients with melasma.25
Melasma Comorbidities—To determine if certain comorbidities, such as diabetes mellitus or hypertension, influenced the progression of melasma, we stratified the efficacy results for patients with these 2 comorbidities, which showed no significant difference (P=.794 and P=.101, respectively). Thus, the relatively higher prevalence of diabetes mellitus (16 patients) and hypertension (11 patients) did not contribute to the efficacy of TA in lowering MASI scores over the 12-week period, which supports the findings of Doolan and Gupta,26 who investigated the endocrinologic conditions associated with melasma and found no such association with diabetes mellitus or hypertension.
TA Formulations for Melasma—The efficacy of topical TA has been explored in several studies. Six studies with sample sizes of 13 to 50 patients each showed statistically significant differences in MASI scores between baseline and following TA treatment (P<.001).27-32 Several formulations and regimens were utilized, including TA cream 3% for 12 weeks, TA gel 5% for 12 weeks, TA solution 3% for 12 weeks, TA liposome 5% for 12 weeks, and TA solution 2% for 12 weeks.18 Additionally, these studies found TA to be effective in limiting dyschromia and decreasing MASI scores. There were no statistically significant differences between formulations and method of application. Topical TA has been found to be just as effective as other treatments for melasma, including intradermal TA injections, topical hydroquinone, and a combination of topical hydroquinone and dexamethasone.18
Further study of the efficacy of intradermal TA is necessary because many human trials have lacked statistical significance or a control group. Lee et al32 conducted a trial of 100 female patients who received weekly intradermal TA microinjections for 12 weeks. After 8 and 12 weeks, MASI scores decreased significantly (P<.01).32 Similarly, Badran et al33 observed 60 female patients in 3 trial groups: group A received TA (4 mg/mL) intradermal injections every 2 weeks, group B received TA (10 mg/mL) intradermal injections every 2 weeks, and group C received TA cream 10% twice daily. Although all groups showed improvement in MASI, group B, which had the highest intradermal TA concentration, exhibited the most improvement. Thus, it was determined that intradermal application led to better results, but the cream was still effective.33
Saki et al34 conducted a randomized, split-face trial of 37 patients comparing the efficacy of intradermal TA and topical hydroquinone. Each group was treated with either monthly intradermal TA injections or nightly hydroquinone for 3 months. After 4 weeks of treatment, TA initially had a greater improvement. However, after 20 weeks, the overall changes were not significant between the 2 groups.34 Pazyar et al35 conducted a randomized, split-face trial of 49 patients comparing the efficacy of intradermal TA and hydroquinone cream. After 24 weeks of biweekly TA injections or twice-daily hydroquinone, there were no statistically significant differences in the decreased MASI scores between treatments.35 Additional large, double-blind, controlled trials are needed to thoroughly assess the role of intradermal TA in comparison to its treatment counterpart of hydroquinone.
Ebrahimi and Naeini29 conducted a 12-week, double-blind, split-phase trial of 50 Iranian melasma patients, which showed that 27.3% of patients rated the improvement in melasma as excellent, 42.4% as good, and 30.3% as fair after using TA solution 3%. Wu et al36 also showed a total melasma improvement rate of 80.9% in 256 patients with long-term oral use of TA. In a study by Kim et al31 (N=245), the mean MASI score considerably decreased after topical TA use, with a total response rate of 95.6%. In another study, Atefi et al37 presented significantly increased levels of satisfaction in patients treated with topical TA 5% vs hydroquinone (P=.015).
Melasma in Patients With Darker Skin Types—Special attention must be given to choosing the appropriate medication in melasma patients with darker skin types, as there is an increased risk for postinflammatory hyperpigmentation. Currently, few randomized controlled trials exist that fulfill the criteria of evaluating pharmacologic options for patients with melasma, and even fewer studies solely focus on patients with darker skin types.38 In addition to treatment advances, patients must be educated on the need to avoid sun exposure when possible or to use photoprotection, especially in the South Asian region, where these practices rarely are taught. Our study provided a unique analysis regarding the efficacy of TA solution 5% for the treatment of melasma in patients of South Asian descent. Clinicians can use these findings as a foundation for treating all patients with melasma but particularly those with darker skin types.
Study Limitations—Our study consisted of 60 patients; although our study had more patients than similar trials, larger studies are needed. Additionally, other variables were excluded from our analysis, such as comorbidities beyond diabetes mellitus and hypertension.
Conclusion
This study contributes to the growing field of melasma therapeutics by evaluating the efficacy of using TA solution 5% for the treatment of melasma in South Asian patients with darker skin types. Clinicians may use our study to broaden their treatment options for a common condition while also addressing the lack of clinical options for patients with darker skin types. Further studies investigating the effectiveness of TA in large clinical trials in humans are warranted to understand the efficacy and the risk for any complications.
- Espósito ACC, Brianezi G, De Souza NP, et al. Exploratory study of epidermis, basement membrane zone, upper dermis alterations and Wnt pathway activation in melasma compared to adjacent and retroauricular skin. Ann Dermatol. 2020;32:101-108.
- Janney MS, Subramaniyan R, Dabas R, et al. A randomized controlled study comparing the efficacy of topical 5% tranexamic acid solution versus 3% hydroquinone cream in melasma. J Cutan Aesthet Surg. 2019;12:63-67.
- Chalermchai T, Rummaneethorn P. Effects of a fractional picosecond 1,064 nm laser for the treatment of dermal and mixed type melasmaJ Cosmet Laser Ther. 2018;20:134-139.
- Grimes PE, Ijaz S, Nashawati R, et al. New oral and topical approaches for the treatment of melasma. Int J Womens Dermatol. 2019;5:30-36.
- Handel AC, Miot LDB, Miot HA. Melasma: a clinical and epidemiological review. An Bras Dermatol. 2014;89:771-782.
- Barankin B, Silver SG, Carruthers A. The skin in pregnancy. J Cutan Med Surg. 2002;6:236-240.
- Grimes PE, Yamada N, Bhawan J. Light microscopic, immunohistochemical, and ultrastructural alterations in patients with melasma. Am J Dermatopathol. 2005;27:96-101.
- Smith AG, Shuster S, Thody AJ, et al. Chloasma, oral contraceptives, and plasma immunoreactive beta-melanocyte-stimulating hormone. J Invest Dermatol. 1977;68:169-170.
- Ranson M, Posen S, Mason RS. Human melanocytes as a target tissue for hormones: in vitro studies with 1 alpha-25, dihydroxyvitamin D3, alpha-melanocyte stimulating hormone, and beta-estradiol. J Invest Dermatol. 1988;91:593-598.
- Kippenberger S, Loitsch S, Solano F, et al. Quantification of tyrosinase, TRP-1, and Trp-2 transcripts in human melanocytes by reverse transcriptase-competitive multiplex PCR—regulation by steroid hormones. J Invest Dermatol. 1998;110:364-367.
- McLeod SD, Ranson M, Mason RS. Effects of estrogens on human melanocytes in vitro. J Steroid Biochem Mol Biol. 1994;49:9-14.
- Chalermchai T, Rummaneethorn P. Effects of a fractional picosecond 1,064 nm laser for the treatment of dermal and mixed type melasma. J Cosmet Laser Ther. 2018;20:134-139.
- Sheu SL. Treatment of melasma using tranexamic acid: what’s known and what’s next. Cutis. 2018;101:E7-E8.
- Tian B. The Asian problem of frequent laser toning for melasma. J Clin Aesthet Dermatol. 2017;10:40-42.
- Zhang L, Tan WQ, Fang QQ, et al. Tranexamic acid for adults with melasma: a systematic review and meta-analysis. Biomed Res Int. 2018;2018:1683414.
- Zhu JW, Ni YJ, Tong XY, et al. Tranexamic acid inhibits angiogenesis and melanogenesis in vitro by targeting VEGF receptors. Int J Med Sci. 2020;17:903-911.
- Colferai MMT, Miquelin GM, Steiner D. Evaluation of oral tranexamic acid in the treatment of melasma. J Cosmet Dermatol. 2019;18:1495-1501.
- Taraz M, Niknam S, Ehsani AH. Tranexamic acid in treatment of melasma: a comprehensive review of clinical studies. Dermatol Ther. 2017;30:19-26.
- Yalamanchili R, Shastry V, Betkerur J. Clinico-epidemiological study and quality of life assessment in melasma. Indian J Dermatol. 2015;60:519.
- Kim HJ, Moon SH, Cho SH, et al. Efficacy and safety of tranexamic acid in melasma: a meta-analysis and systematic review. Acta Derm Venereol. 2017;97:776-781.
- Kim YJ, Kim MJ, Kweon DK, et al. Quantification of hypopigmentation activity in vitro. J Vis Exp. 2019;145:20-25.
- Cardoso R, Valente R, Souza da Costa CH, et al. Analysis of kojic acid derivatives as competitive inhibitors of tyrosinase: a molecular modeling approach. Molecules. 2021;26:2875.
- Bala HR, Lee S, Wong C, et al. Oral tranexamic acid for the treatment of melasma: a review. Dermatol Surg. 2018;44:814-825.
- Khuraiya S, Kachhawa D, Chouhan B, et al. A comparative study of topical 5% tranexamic acid and triple combination therapy for the treatment of melasma in Indian population. Pigment International. 2019;6:18-23.
- Steiner D, Feola C, Bialeski N, et al. Study evaluating the efficacy of topical and injected tranexamic acid in treatment of melasma. Surg Cosmet Dermatol. 2009;1:174-177.
- Doolan B, Gupta M. Melasma. Aust J Gen Pract. 2021;50:880-885.
- Banihashemi M, Zabolinejad N, Jaafari MR, et al. Comparison of therapeutic effects of liposomal tranexamic acid and conventional hydroquinone on melasma. J Cosmet Dermatol. 2015;14:174-177.
- Chung JY, Lee JH, Lee JH. Topical tranexamic acid as an adjuvant treatment in melasma: side-by-side comparison clinical study. J Dermatolog Treat. 2016;27:373-377.
- Ebrahimi B, Naeini FF. Topical tranexamic acid as a promising treatment for melasma. J Res Med Sci. 2014;19:753-757.
- Kanechorn Na Ayuthaya P, Niumphradit N, Manosroi A, et al. Topical 5% tranexamic acid for the treatment of melasma in Asians: a double-blind randomized controlled clinical trial. J Cosmet Laser Ther. 2012;14:150-154.
- Kim SJ, Park JY, Shibata T, et al. Efficacy and possible mechanisms of topical tranexamic acid in melasma. Clin Exp Dermatol. 2016;41:480-485.
- Lee JH, Park JG, Lim SH, et al. Localized intradermal microinjection of tranexamic acid for treatment of melasma in Asian patients: a preliminary clinical trial. Dermatol Surg. 2006;32:626-631.
- Badran AY, Ali AU, Gomaa AS. Efficacy of topical versus intradermal injection of tranexamic acid in Egyptian melasma patients: a randomised clinical trial. Australas J Dermatol. 2021;62:E373-E379.
- Saki N, Darayesh M, Heiran A. Comparing the efficacy of topical hydroquinone 2% versus intradermal tranexamic acid microinjections in treating melasma: a split-face controlled trial. J Dermatolog Treat. 2018;29:405-410.
- Pazyar N, Yaghoobi R, Zeynalie M, et al. Comparison of the efficacy of intradermal injected tranexamic acid vs hydroquinone cream in the treatment of melasma. Clin Cosmet Investig Dermatol. 2019;12:115-122.
- Wu S, Shi H, Wu H, et al. Treatment of melasma with oral administration of tranexamic acid. Aesthetic Plast Surg. 2012;36:964-970.
- Atefi N, Dalvand B, Ghassemi M, et al. Therapeutic effects of topical tranexamic acid in comparison with hydroquinone in treatment of women with melasma. Dermatol Ther (Heidelb). 2017;7:417-424.
- Cestari T, Arellano I, Hexsel D, et al. Melasma in Latin America: options for therapy and treatment algorithm. J Eur Acad Dermatol Venereol. 2009;23:760-772.
Melasma is a complex, long-lasting, acquired dermatologic pigmentation disorder resulting in grey-brown patches that last for more than 3 months. Sun-exposed areas including the nose, cheeks, forehead, and forearms are most likely to be affected.1 In Southeast Asia, 0.25% to 4% of the population affected by melasma is aged 30 to 40 years.2 In particular, melasma is a concern among pregnant women due to increased levels of melanocyte-stimulating hormones (MSHs) and is impacted by genetics, hormonal influence, and exposure to UV light.3,4 In Pakistan, approximately 46% of women are affected by melasma during pregnancy.2,5 Although few studies have focused on the clinical approaches to melasma in darker skin types, it continues to disproportionately affect the skin of color population.4
The areas of hyperpigmentation seen in melasma exhibit increased deposition of melanin in the epidermis and dermis, but melanocytes are not elevated. However, in areas of hyperpigmentation, the melanocytes are larger and more dendritic and demonstrate an increased level of melanogenesis.6 During pregnancy, especially in the third trimester, elevated levels of estrogen, progesterone, and MSH often are found in association with melasma.7 Tyrosinase (TYR) activity increases and cellular proliferation is reduced after treatment of melanocytes in culture with β-estradiol.8 Sex steroids increase transcription of genes encoding melanogenic enzymes in normal human melanocytes, especially TYR.9 These results are consistent with the notable increases in melanin synthesis and TYR activity reported for normal human melanocytes under similar conditions in culture.10 Because melanocytes contain both cytosolic and nuclear estrogen receptors, melanocytes in patients with melasma may be inherently more sensitive to the stimulatory effects of estrogens and possibly other steroid hormones.11
The current treatment options for melasma have varying levels of success and include topical depigmenting agents such as hydroquinone, tretinoin, azelaic acid, kojic acid, and corticosteroids; dermabrasion; and chemical peels.12-14 Chemical peels with glycolic acid, salicylic acid, lactic acid, trichloroacetic acid, and phenol, as well as laser therapy, are reliable management options.13,14 Traditionally, melasma has been treated with a combination of modalities along with photoprotection and trigger avoidance.12
The efficacy and safety of the available therapies for melasma are still controversial and require further exploration. In recent years, off-label tranexamic acid (TA) has emerged as a potential therapy for melasma. Although the mechanism of action remains unclear, TA may inhibit melanin synthesis by blocking the interaction between melanocytes and keratinocytes.15 Tranexamic acid also may reverse the abnormal dermal changes associated with melasma by inhibiting melanogenesis and angiogenesis.16
Although various therapeutic options exist for melasma, the search for a reliable option in patients with darker skin types continues.13 We sought to evaluate the efficacy of TA solution 5% in reducing the severity of melasma in South Asian patients, thereby improving patient outcomes and maximizing patient satisfaction. Topical TA is inexpensive and readily accessible and does not cause systemic side effects. These qualities make it a promising treatment compared to traditional therapies.
Methods
We conducted a randomized controlled trial at Rawalpindi Medical Institute (Punjab, Pakistan). The researchers obtained informed consent for all enrolled patients. Cases were sampled from the original patient population seen at the office using nonprobability consecutive sampling. The sample size was calculated with a 95% CI, margin of error of 9%, and expected percentage of efficacy of 86.1% by using TA solution 5%. South Asian male and female patients aged 20 to 45 years with melasma were included in the analysis. Patients were excluded if they were already taking TA, oral contraceptive pills, or photosensitizing drugs (eg, nonsteroidal anti-inflammatory drugs, tetracyclines, phenytoin, carbamazepine); were pregnant; had chronic kidney disease (creatinine >2.0 mg/dL); had cardiac abnormalities (abnormal electrocardiogram); had hematologic disorders (international normalized ratio >2); or had received another melasma treatment within the last 3 to 6 months.
All enrolled patients underwent a detailed history and physical examination. Patient demographics were subsequently noted, including age, sex, history of diabetes mellitus or hypertension, and duration of melasma. The melasma area and severity index (MASI) score of each patient was calculated at baseline, and a corresponding photograph was taken.
The topical solution was prepared with 5 g of TA dissolved in 10 cc of ethanol at 96 °F, 10 cc of 1,3-butanediol, and distilled water up to 100 cc. The TA solution was applied to the affected areas once daily by the patient for 12 weeks. Each application covered the affected areas completely. Patients were instructed to apply sunscreen with sun protection factor 60 to those same areas for UV protection after 15 minutes of TA application. Biweekly follow-ups were scheduled during the trial, and the MASI score was recorded at these visits. If the mean MASI score was reduced by half after 12 weeks of treatment, then the treatment was considered efficacious with a 95% CI.
The percentage reduction from baseline was calculated as follows: percentage reduction=(baseline score– follow-up score)/baseline score×100.
Statistical Analysis—Data were analyzed in SPSS Statistics 25 (IBM). The quantitative variables of age, duration of melasma, and body mass index were presented as mean (SD). Qualitative variables such as sex, history of diabetes mellitus or hypertension, site of melasma, and efficacy were presented as frequencies and percentages. Mean MASI scores at baseline and 12 weeks posttreatment were compared using a paired t test (P≤.05). Data were stratified for age, sex, history of diabetes mellitus or hypertension, site of melasma, and duration of melasma, and a χ2 test was applied to compare efficacy in stratified groups (P≤.05).
Results
Sixty patients were enrolled in the study. Of them, 17 (28.33%) were male, and 43 (71.67%) were female (2:5 ratio). They ranged in age from 20 to 45 years (mean [SD], 31.93 [6.26] years). Thirty-seven patients (61.67%) were aged 31 to 45 years of age (Table 1). The mean (SD) duration of disease was 10.18 (2.10) months. The response to TA was recorded based on patient distribution according to the site of melasma as well as history of diabetes mellitus and hypertension.
Topical TA was found to be efficacious for melasma in 50 (83.33%) patients. The mean (SD) baseline and week 12 MASI scores were 23.15 (5.02) and 12.71 (4.10)(P<.0001), respectively (Table 2). The stratification of efficacy with respect to age, sex, duration of melasma, site of melasma, and history of diabetes mellitus or hypertension is shown in the eTable. The site of melasma was significant with respect to stratification of efficacy. On the forehead, TA was found to be efficacious in 11 patients and nonefficacious in 0 patients (P=.036). In the malar region, it was efficacious in 16 patients and nonefficacious in 1 patient (P=.036). Finally, on the chin, it was efficacious in 23 patients and nonefficacious in 9 patients (P=.036).
Comment
Melasma Presentation and Development—Melasma is a chronic skin condition that more often affects patients with darker skin types. This condition is characterized by hyperpigmentation of skin that is directly exposed to the sun, such as the cheek, nose, forehead, and above the upper lip.17 Although the mechanism behind how melasma develops is unknown, one theory suggests that UV light can lead to increased plasmin in keratinocytes.18 This increased plasmin will thereby increase the arachidonic acid and α-MSH, leading to the observed uneven hyperpigmentation that is notable in melasma. Melasma is common in patients using oral contraceptives or expired cosmetic drugs; in those who are pregnant; and in those with liver dysfunction.18 Melasma has a negative impact on patients’ quality of life because of substantial psychological and social distress. Thus, finding an accessible treatment is imperative.19
Melasma Management—The most common treatments for melasma have been topical bleaching agents and photoprotection. Combination therapy options include chemical peels, dermabrasion, and laser treatments, though they present with limited efficacy.17,20 Because melasma focuses on pigmentation correction, topical treatments work to disturb melanocyte pigment production at the enzymatic level.21 Tyrosinase is rate limiting in melanin production, as it converts L-tyrosinase to L-3,4-dihydroxyphenylalanine, using copper to interact with L-3,4-dihydroxyphenylalanine as a cofactor in the active site.22 Therefore, tyrosine is a major target for many drugs that have been developed for melasma to decrease melaninization.21
Recently, research has focused on the effects of topical, intradermal, and oral TA for melasma.17 Tranexamic acid most commonly has been used in medicine as a fibrinolytic agent because of its antiplasmin properties. It has been hypothesized that TA can inhibit the release of paracrine melanogenic factors that normally act to stimulate melanocytes.17 Although studies have supported the safety and efficacy of TA, there remains a lack of clinical studies that are sufficiently powered. No definitive consensus on the use of TA for melasma currently exists, which indicates the need for large-scale, randomized, controlled trials.23
One trial (N=25) found that TA solution 5% achieved efficacy (>50% reduction in MASI score from baseline) in 86.1% of patients with melasma.24 In another study (N=18), topical TA 5% achieved efficacy (>50% reduction in MASI score) in 86% of patients with melasma.25
Melasma Comorbidities—To determine if certain comorbidities, such as diabetes mellitus or hypertension, influenced the progression of melasma, we stratified the efficacy results for patients with these 2 comorbidities, which showed no significant difference (P=.794 and P=.101, respectively). Thus, the relatively higher prevalence of diabetes mellitus (16 patients) and hypertension (11 patients) did not contribute to the efficacy of TA in lowering MASI scores over the 12-week period, which supports the findings of Doolan and Gupta,26 who investigated the endocrinologic conditions associated with melasma and found no such association with diabetes mellitus or hypertension.
TA Formulations for Melasma—The efficacy of topical TA has been explored in several studies. Six studies with sample sizes of 13 to 50 patients each showed statistically significant differences in MASI scores between baseline and following TA treatment (P<.001).27-32 Several formulations and regimens were utilized, including TA cream 3% for 12 weeks, TA gel 5% for 12 weeks, TA solution 3% for 12 weeks, TA liposome 5% for 12 weeks, and TA solution 2% for 12 weeks.18 Additionally, these studies found TA to be effective in limiting dyschromia and decreasing MASI scores. There were no statistically significant differences between formulations and method of application. Topical TA has been found to be just as effective as other treatments for melasma, including intradermal TA injections, topical hydroquinone, and a combination of topical hydroquinone and dexamethasone.18
Further study of the efficacy of intradermal TA is necessary because many human trials have lacked statistical significance or a control group. Lee et al32 conducted a trial of 100 female patients who received weekly intradermal TA microinjections for 12 weeks. After 8 and 12 weeks, MASI scores decreased significantly (P<.01).32 Similarly, Badran et al33 observed 60 female patients in 3 trial groups: group A received TA (4 mg/mL) intradermal injections every 2 weeks, group B received TA (10 mg/mL) intradermal injections every 2 weeks, and group C received TA cream 10% twice daily. Although all groups showed improvement in MASI, group B, which had the highest intradermal TA concentration, exhibited the most improvement. Thus, it was determined that intradermal application led to better results, but the cream was still effective.33
Saki et al34 conducted a randomized, split-face trial of 37 patients comparing the efficacy of intradermal TA and topical hydroquinone. Each group was treated with either monthly intradermal TA injections or nightly hydroquinone for 3 months. After 4 weeks of treatment, TA initially had a greater improvement. However, after 20 weeks, the overall changes were not significant between the 2 groups.34 Pazyar et al35 conducted a randomized, split-face trial of 49 patients comparing the efficacy of intradermal TA and hydroquinone cream. After 24 weeks of biweekly TA injections or twice-daily hydroquinone, there were no statistically significant differences in the decreased MASI scores between treatments.35 Additional large, double-blind, controlled trials are needed to thoroughly assess the role of intradermal TA in comparison to its treatment counterpart of hydroquinone.
Ebrahimi and Naeini29 conducted a 12-week, double-blind, split-phase trial of 50 Iranian melasma patients, which showed that 27.3% of patients rated the improvement in melasma as excellent, 42.4% as good, and 30.3% as fair after using TA solution 3%. Wu et al36 also showed a total melasma improvement rate of 80.9% in 256 patients with long-term oral use of TA. In a study by Kim et al31 (N=245), the mean MASI score considerably decreased after topical TA use, with a total response rate of 95.6%. In another study, Atefi et al37 presented significantly increased levels of satisfaction in patients treated with topical TA 5% vs hydroquinone (P=.015).
Melasma in Patients With Darker Skin Types—Special attention must be given to choosing the appropriate medication in melasma patients with darker skin types, as there is an increased risk for postinflammatory hyperpigmentation. Currently, few randomized controlled trials exist that fulfill the criteria of evaluating pharmacologic options for patients with melasma, and even fewer studies solely focus on patients with darker skin types.38 In addition to treatment advances, patients must be educated on the need to avoid sun exposure when possible or to use photoprotection, especially in the South Asian region, where these practices rarely are taught. Our study provided a unique analysis regarding the efficacy of TA solution 5% for the treatment of melasma in patients of South Asian descent. Clinicians can use these findings as a foundation for treating all patients with melasma but particularly those with darker skin types.
Study Limitations—Our study consisted of 60 patients; although our study had more patients than similar trials, larger studies are needed. Additionally, other variables were excluded from our analysis, such as comorbidities beyond diabetes mellitus and hypertension.
Conclusion
This study contributes to the growing field of melasma therapeutics by evaluating the efficacy of using TA solution 5% for the treatment of melasma in South Asian patients with darker skin types. Clinicians may use our study to broaden their treatment options for a common condition while also addressing the lack of clinical options for patients with darker skin types. Further studies investigating the effectiveness of TA in large clinical trials in humans are warranted to understand the efficacy and the risk for any complications.
Melasma is a complex, long-lasting, acquired dermatologic pigmentation disorder resulting in grey-brown patches that last for more than 3 months. Sun-exposed areas including the nose, cheeks, forehead, and forearms are most likely to be affected.1 In Southeast Asia, 0.25% to 4% of the population affected by melasma is aged 30 to 40 years.2 In particular, melasma is a concern among pregnant women due to increased levels of melanocyte-stimulating hormones (MSHs) and is impacted by genetics, hormonal influence, and exposure to UV light.3,4 In Pakistan, approximately 46% of women are affected by melasma during pregnancy.2,5 Although few studies have focused on the clinical approaches to melasma in darker skin types, it continues to disproportionately affect the skin of color population.4
The areas of hyperpigmentation seen in melasma exhibit increased deposition of melanin in the epidermis and dermis, but melanocytes are not elevated. However, in areas of hyperpigmentation, the melanocytes are larger and more dendritic and demonstrate an increased level of melanogenesis.6 During pregnancy, especially in the third trimester, elevated levels of estrogen, progesterone, and MSH often are found in association with melasma.7 Tyrosinase (TYR) activity increases and cellular proliferation is reduced after treatment of melanocytes in culture with β-estradiol.8 Sex steroids increase transcription of genes encoding melanogenic enzymes in normal human melanocytes, especially TYR.9 These results are consistent with the notable increases in melanin synthesis and TYR activity reported for normal human melanocytes under similar conditions in culture.10 Because melanocytes contain both cytosolic and nuclear estrogen receptors, melanocytes in patients with melasma may be inherently more sensitive to the stimulatory effects of estrogens and possibly other steroid hormones.11
The current treatment options for melasma have varying levels of success and include topical depigmenting agents such as hydroquinone, tretinoin, azelaic acid, kojic acid, and corticosteroids; dermabrasion; and chemical peels.12-14 Chemical peels with glycolic acid, salicylic acid, lactic acid, trichloroacetic acid, and phenol, as well as laser therapy, are reliable management options.13,14 Traditionally, melasma has been treated with a combination of modalities along with photoprotection and trigger avoidance.12
The efficacy and safety of the available therapies for melasma are still controversial and require further exploration. In recent years, off-label tranexamic acid (TA) has emerged as a potential therapy for melasma. Although the mechanism of action remains unclear, TA may inhibit melanin synthesis by blocking the interaction between melanocytes and keratinocytes.15 Tranexamic acid also may reverse the abnormal dermal changes associated with melasma by inhibiting melanogenesis and angiogenesis.16
Although various therapeutic options exist for melasma, the search for a reliable option in patients with darker skin types continues.13 We sought to evaluate the efficacy of TA solution 5% in reducing the severity of melasma in South Asian patients, thereby improving patient outcomes and maximizing patient satisfaction. Topical TA is inexpensive and readily accessible and does not cause systemic side effects. These qualities make it a promising treatment compared to traditional therapies.
Methods
We conducted a randomized controlled trial at Rawalpindi Medical Institute (Punjab, Pakistan). The researchers obtained informed consent for all enrolled patients. Cases were sampled from the original patient population seen at the office using nonprobability consecutive sampling. The sample size was calculated with a 95% CI, margin of error of 9%, and expected percentage of efficacy of 86.1% by using TA solution 5%. South Asian male and female patients aged 20 to 45 years with melasma were included in the analysis. Patients were excluded if they were already taking TA, oral contraceptive pills, or photosensitizing drugs (eg, nonsteroidal anti-inflammatory drugs, tetracyclines, phenytoin, carbamazepine); were pregnant; had chronic kidney disease (creatinine >2.0 mg/dL); had cardiac abnormalities (abnormal electrocardiogram); had hematologic disorders (international normalized ratio >2); or had received another melasma treatment within the last 3 to 6 months.
All enrolled patients underwent a detailed history and physical examination. Patient demographics were subsequently noted, including age, sex, history of diabetes mellitus or hypertension, and duration of melasma. The melasma area and severity index (MASI) score of each patient was calculated at baseline, and a corresponding photograph was taken.
The topical solution was prepared with 5 g of TA dissolved in 10 cc of ethanol at 96 °F, 10 cc of 1,3-butanediol, and distilled water up to 100 cc. The TA solution was applied to the affected areas once daily by the patient for 12 weeks. Each application covered the affected areas completely. Patients were instructed to apply sunscreen with sun protection factor 60 to those same areas for UV protection after 15 minutes of TA application. Biweekly follow-ups were scheduled during the trial, and the MASI score was recorded at these visits. If the mean MASI score was reduced by half after 12 weeks of treatment, then the treatment was considered efficacious with a 95% CI.
The percentage reduction from baseline was calculated as follows: percentage reduction=(baseline score– follow-up score)/baseline score×100.
Statistical Analysis—Data were analyzed in SPSS Statistics 25 (IBM). The quantitative variables of age, duration of melasma, and body mass index were presented as mean (SD). Qualitative variables such as sex, history of diabetes mellitus or hypertension, site of melasma, and efficacy were presented as frequencies and percentages. Mean MASI scores at baseline and 12 weeks posttreatment were compared using a paired t test (P≤.05). Data were stratified for age, sex, history of diabetes mellitus or hypertension, site of melasma, and duration of melasma, and a χ2 test was applied to compare efficacy in stratified groups (P≤.05).
Results
Sixty patients were enrolled in the study. Of them, 17 (28.33%) were male, and 43 (71.67%) were female (2:5 ratio). They ranged in age from 20 to 45 years (mean [SD], 31.93 [6.26] years). Thirty-seven patients (61.67%) were aged 31 to 45 years of age (Table 1). The mean (SD) duration of disease was 10.18 (2.10) months. The response to TA was recorded based on patient distribution according to the site of melasma as well as history of diabetes mellitus and hypertension.
Topical TA was found to be efficacious for melasma in 50 (83.33%) patients. The mean (SD) baseline and week 12 MASI scores were 23.15 (5.02) and 12.71 (4.10)(P<.0001), respectively (Table 2). The stratification of efficacy with respect to age, sex, duration of melasma, site of melasma, and history of diabetes mellitus or hypertension is shown in the eTable. The site of melasma was significant with respect to stratification of efficacy. On the forehead, TA was found to be efficacious in 11 patients and nonefficacious in 0 patients (P=.036). In the malar region, it was efficacious in 16 patients and nonefficacious in 1 patient (P=.036). Finally, on the chin, it was efficacious in 23 patients and nonefficacious in 9 patients (P=.036).
Comment
Melasma Presentation and Development—Melasma is a chronic skin condition that more often affects patients with darker skin types. This condition is characterized by hyperpigmentation of skin that is directly exposed to the sun, such as the cheek, nose, forehead, and above the upper lip.17 Although the mechanism behind how melasma develops is unknown, one theory suggests that UV light can lead to increased plasmin in keratinocytes.18 This increased plasmin will thereby increase the arachidonic acid and α-MSH, leading to the observed uneven hyperpigmentation that is notable in melasma. Melasma is common in patients using oral contraceptives or expired cosmetic drugs; in those who are pregnant; and in those with liver dysfunction.18 Melasma has a negative impact on patients’ quality of life because of substantial psychological and social distress. Thus, finding an accessible treatment is imperative.19
Melasma Management—The most common treatments for melasma have been topical bleaching agents and photoprotection. Combination therapy options include chemical peels, dermabrasion, and laser treatments, though they present with limited efficacy.17,20 Because melasma focuses on pigmentation correction, topical treatments work to disturb melanocyte pigment production at the enzymatic level.21 Tyrosinase is rate limiting in melanin production, as it converts L-tyrosinase to L-3,4-dihydroxyphenylalanine, using copper to interact with L-3,4-dihydroxyphenylalanine as a cofactor in the active site.22 Therefore, tyrosine is a major target for many drugs that have been developed for melasma to decrease melaninization.21
Recently, research has focused on the effects of topical, intradermal, and oral TA for melasma.17 Tranexamic acid most commonly has been used in medicine as a fibrinolytic agent because of its antiplasmin properties. It has been hypothesized that TA can inhibit the release of paracrine melanogenic factors that normally act to stimulate melanocytes.17 Although studies have supported the safety and efficacy of TA, there remains a lack of clinical studies that are sufficiently powered. No definitive consensus on the use of TA for melasma currently exists, which indicates the need for large-scale, randomized, controlled trials.23
One trial (N=25) found that TA solution 5% achieved efficacy (>50% reduction in MASI score from baseline) in 86.1% of patients with melasma.24 In another study (N=18), topical TA 5% achieved efficacy (>50% reduction in MASI score) in 86% of patients with melasma.25
Melasma Comorbidities—To determine if certain comorbidities, such as diabetes mellitus or hypertension, influenced the progression of melasma, we stratified the efficacy results for patients with these 2 comorbidities, which showed no significant difference (P=.794 and P=.101, respectively). Thus, the relatively higher prevalence of diabetes mellitus (16 patients) and hypertension (11 patients) did not contribute to the efficacy of TA in lowering MASI scores over the 12-week period, which supports the findings of Doolan and Gupta,26 who investigated the endocrinologic conditions associated with melasma and found no such association with diabetes mellitus or hypertension.
TA Formulations for Melasma—The efficacy of topical TA has been explored in several studies. Six studies with sample sizes of 13 to 50 patients each showed statistically significant differences in MASI scores between baseline and following TA treatment (P<.001).27-32 Several formulations and regimens were utilized, including TA cream 3% for 12 weeks, TA gel 5% for 12 weeks, TA solution 3% for 12 weeks, TA liposome 5% for 12 weeks, and TA solution 2% for 12 weeks.18 Additionally, these studies found TA to be effective in limiting dyschromia and decreasing MASI scores. There were no statistically significant differences between formulations and method of application. Topical TA has been found to be just as effective as other treatments for melasma, including intradermal TA injections, topical hydroquinone, and a combination of topical hydroquinone and dexamethasone.18
Further study of the efficacy of intradermal TA is necessary because many human trials have lacked statistical significance or a control group. Lee et al32 conducted a trial of 100 female patients who received weekly intradermal TA microinjections for 12 weeks. After 8 and 12 weeks, MASI scores decreased significantly (P<.01).32 Similarly, Badran et al33 observed 60 female patients in 3 trial groups: group A received TA (4 mg/mL) intradermal injections every 2 weeks, group B received TA (10 mg/mL) intradermal injections every 2 weeks, and group C received TA cream 10% twice daily. Although all groups showed improvement in MASI, group B, which had the highest intradermal TA concentration, exhibited the most improvement. Thus, it was determined that intradermal application led to better results, but the cream was still effective.33
Saki et al34 conducted a randomized, split-face trial of 37 patients comparing the efficacy of intradermal TA and topical hydroquinone. Each group was treated with either monthly intradermal TA injections or nightly hydroquinone for 3 months. After 4 weeks of treatment, TA initially had a greater improvement. However, after 20 weeks, the overall changes were not significant between the 2 groups.34 Pazyar et al35 conducted a randomized, split-face trial of 49 patients comparing the efficacy of intradermal TA and hydroquinone cream. After 24 weeks of biweekly TA injections or twice-daily hydroquinone, there were no statistically significant differences in the decreased MASI scores between treatments.35 Additional large, double-blind, controlled trials are needed to thoroughly assess the role of intradermal TA in comparison to its treatment counterpart of hydroquinone.
Ebrahimi and Naeini29 conducted a 12-week, double-blind, split-phase trial of 50 Iranian melasma patients, which showed that 27.3% of patients rated the improvement in melasma as excellent, 42.4% as good, and 30.3% as fair after using TA solution 3%. Wu et al36 also showed a total melasma improvement rate of 80.9% in 256 patients with long-term oral use of TA. In a study by Kim et al31 (N=245), the mean MASI score considerably decreased after topical TA use, with a total response rate of 95.6%. In another study, Atefi et al37 presented significantly increased levels of satisfaction in patients treated with topical TA 5% vs hydroquinone (P=.015).
Melasma in Patients With Darker Skin Types—Special attention must be given to choosing the appropriate medication in melasma patients with darker skin types, as there is an increased risk for postinflammatory hyperpigmentation. Currently, few randomized controlled trials exist that fulfill the criteria of evaluating pharmacologic options for patients with melasma, and even fewer studies solely focus on patients with darker skin types.38 In addition to treatment advances, patients must be educated on the need to avoid sun exposure when possible or to use photoprotection, especially in the South Asian region, where these practices rarely are taught. Our study provided a unique analysis regarding the efficacy of TA solution 5% for the treatment of melasma in patients of South Asian descent. Clinicians can use these findings as a foundation for treating all patients with melasma but particularly those with darker skin types.
Study Limitations—Our study consisted of 60 patients; although our study had more patients than similar trials, larger studies are needed. Additionally, other variables were excluded from our analysis, such as comorbidities beyond diabetes mellitus and hypertension.
Conclusion
This study contributes to the growing field of melasma therapeutics by evaluating the efficacy of using TA solution 5% for the treatment of melasma in South Asian patients with darker skin types. Clinicians may use our study to broaden their treatment options for a common condition while also addressing the lack of clinical options for patients with darker skin types. Further studies investigating the effectiveness of TA in large clinical trials in humans are warranted to understand the efficacy and the risk for any complications.
- Espósito ACC, Brianezi G, De Souza NP, et al. Exploratory study of epidermis, basement membrane zone, upper dermis alterations and Wnt pathway activation in melasma compared to adjacent and retroauricular skin. Ann Dermatol. 2020;32:101-108.
- Janney MS, Subramaniyan R, Dabas R, et al. A randomized controlled study comparing the efficacy of topical 5% tranexamic acid solution versus 3% hydroquinone cream in melasma. J Cutan Aesthet Surg. 2019;12:63-67.
- Chalermchai T, Rummaneethorn P. Effects of a fractional picosecond 1,064 nm laser for the treatment of dermal and mixed type melasmaJ Cosmet Laser Ther. 2018;20:134-139.
- Grimes PE, Ijaz S, Nashawati R, et al. New oral and topical approaches for the treatment of melasma. Int J Womens Dermatol. 2019;5:30-36.
- Handel AC, Miot LDB, Miot HA. Melasma: a clinical and epidemiological review. An Bras Dermatol. 2014;89:771-782.
- Barankin B, Silver SG, Carruthers A. The skin in pregnancy. J Cutan Med Surg. 2002;6:236-240.
- Grimes PE, Yamada N, Bhawan J. Light microscopic, immunohistochemical, and ultrastructural alterations in patients with melasma. Am J Dermatopathol. 2005;27:96-101.
- Smith AG, Shuster S, Thody AJ, et al. Chloasma, oral contraceptives, and plasma immunoreactive beta-melanocyte-stimulating hormone. J Invest Dermatol. 1977;68:169-170.
- Ranson M, Posen S, Mason RS. Human melanocytes as a target tissue for hormones: in vitro studies with 1 alpha-25, dihydroxyvitamin D3, alpha-melanocyte stimulating hormone, and beta-estradiol. J Invest Dermatol. 1988;91:593-598.
- Kippenberger S, Loitsch S, Solano F, et al. Quantification of tyrosinase, TRP-1, and Trp-2 transcripts in human melanocytes by reverse transcriptase-competitive multiplex PCR—regulation by steroid hormones. J Invest Dermatol. 1998;110:364-367.
- McLeod SD, Ranson M, Mason RS. Effects of estrogens on human melanocytes in vitro. J Steroid Biochem Mol Biol. 1994;49:9-14.
- Chalermchai T, Rummaneethorn P. Effects of a fractional picosecond 1,064 nm laser for the treatment of dermal and mixed type melasma. J Cosmet Laser Ther. 2018;20:134-139.
- Sheu SL. Treatment of melasma using tranexamic acid: what’s known and what’s next. Cutis. 2018;101:E7-E8.
- Tian B. The Asian problem of frequent laser toning for melasma. J Clin Aesthet Dermatol. 2017;10:40-42.
- Zhang L, Tan WQ, Fang QQ, et al. Tranexamic acid for adults with melasma: a systematic review and meta-analysis. Biomed Res Int. 2018;2018:1683414.
- Zhu JW, Ni YJ, Tong XY, et al. Tranexamic acid inhibits angiogenesis and melanogenesis in vitro by targeting VEGF receptors. Int J Med Sci. 2020;17:903-911.
- Colferai MMT, Miquelin GM, Steiner D. Evaluation of oral tranexamic acid in the treatment of melasma. J Cosmet Dermatol. 2019;18:1495-1501.
- Taraz M, Niknam S, Ehsani AH. Tranexamic acid in treatment of melasma: a comprehensive review of clinical studies. Dermatol Ther. 2017;30:19-26.
- Yalamanchili R, Shastry V, Betkerur J. Clinico-epidemiological study and quality of life assessment in melasma. Indian J Dermatol. 2015;60:519.
- Kim HJ, Moon SH, Cho SH, et al. Efficacy and safety of tranexamic acid in melasma: a meta-analysis and systematic review. Acta Derm Venereol. 2017;97:776-781.
- Kim YJ, Kim MJ, Kweon DK, et al. Quantification of hypopigmentation activity in vitro. J Vis Exp. 2019;145:20-25.
- Cardoso R, Valente R, Souza da Costa CH, et al. Analysis of kojic acid derivatives as competitive inhibitors of tyrosinase: a molecular modeling approach. Molecules. 2021;26:2875.
- Bala HR, Lee S, Wong C, et al. Oral tranexamic acid for the treatment of melasma: a review. Dermatol Surg. 2018;44:814-825.
- Khuraiya S, Kachhawa D, Chouhan B, et al. A comparative study of topical 5% tranexamic acid and triple combination therapy for the treatment of melasma in Indian population. Pigment International. 2019;6:18-23.
- Steiner D, Feola C, Bialeski N, et al. Study evaluating the efficacy of topical and injected tranexamic acid in treatment of melasma. Surg Cosmet Dermatol. 2009;1:174-177.
- Doolan B, Gupta M. Melasma. Aust J Gen Pract. 2021;50:880-885.
- Banihashemi M, Zabolinejad N, Jaafari MR, et al. Comparison of therapeutic effects of liposomal tranexamic acid and conventional hydroquinone on melasma. J Cosmet Dermatol. 2015;14:174-177.
- Chung JY, Lee JH, Lee JH. Topical tranexamic acid as an adjuvant treatment in melasma: side-by-side comparison clinical study. J Dermatolog Treat. 2016;27:373-377.
- Ebrahimi B, Naeini FF. Topical tranexamic acid as a promising treatment for melasma. J Res Med Sci. 2014;19:753-757.
- Kanechorn Na Ayuthaya P, Niumphradit N, Manosroi A, et al. Topical 5% tranexamic acid for the treatment of melasma in Asians: a double-blind randomized controlled clinical trial. J Cosmet Laser Ther. 2012;14:150-154.
- Kim SJ, Park JY, Shibata T, et al. Efficacy and possible mechanisms of topical tranexamic acid in melasma. Clin Exp Dermatol. 2016;41:480-485.
- Lee JH, Park JG, Lim SH, et al. Localized intradermal microinjection of tranexamic acid for treatment of melasma in Asian patients: a preliminary clinical trial. Dermatol Surg. 2006;32:626-631.
- Badran AY, Ali AU, Gomaa AS. Efficacy of topical versus intradermal injection of tranexamic acid in Egyptian melasma patients: a randomised clinical trial. Australas J Dermatol. 2021;62:E373-E379.
- Saki N, Darayesh M, Heiran A. Comparing the efficacy of topical hydroquinone 2% versus intradermal tranexamic acid microinjections in treating melasma: a split-face controlled trial. J Dermatolog Treat. 2018;29:405-410.
- Pazyar N, Yaghoobi R, Zeynalie M, et al. Comparison of the efficacy of intradermal injected tranexamic acid vs hydroquinone cream in the treatment of melasma. Clin Cosmet Investig Dermatol. 2019;12:115-122.
- Wu S, Shi H, Wu H, et al. Treatment of melasma with oral administration of tranexamic acid. Aesthetic Plast Surg. 2012;36:964-970.
- Atefi N, Dalvand B, Ghassemi M, et al. Therapeutic effects of topical tranexamic acid in comparison with hydroquinone in treatment of women with melasma. Dermatol Ther (Heidelb). 2017;7:417-424.
- Cestari T, Arellano I, Hexsel D, et al. Melasma in Latin America: options for therapy and treatment algorithm. J Eur Acad Dermatol Venereol. 2009;23:760-772.
- Espósito ACC, Brianezi G, De Souza NP, et al. Exploratory study of epidermis, basement membrane zone, upper dermis alterations and Wnt pathway activation in melasma compared to adjacent and retroauricular skin. Ann Dermatol. 2020;32:101-108.
- Janney MS, Subramaniyan R, Dabas R, et al. A randomized controlled study comparing the efficacy of topical 5% tranexamic acid solution versus 3% hydroquinone cream in melasma. J Cutan Aesthet Surg. 2019;12:63-67.
- Chalermchai T, Rummaneethorn P. Effects of a fractional picosecond 1,064 nm laser for the treatment of dermal and mixed type melasmaJ Cosmet Laser Ther. 2018;20:134-139.
- Grimes PE, Ijaz S, Nashawati R, et al. New oral and topical approaches for the treatment of melasma. Int J Womens Dermatol. 2019;5:30-36.
- Handel AC, Miot LDB, Miot HA. Melasma: a clinical and epidemiological review. An Bras Dermatol. 2014;89:771-782.
- Barankin B, Silver SG, Carruthers A. The skin in pregnancy. J Cutan Med Surg. 2002;6:236-240.
- Grimes PE, Yamada N, Bhawan J. Light microscopic, immunohistochemical, and ultrastructural alterations in patients with melasma. Am J Dermatopathol. 2005;27:96-101.
- Smith AG, Shuster S, Thody AJ, et al. Chloasma, oral contraceptives, and plasma immunoreactive beta-melanocyte-stimulating hormone. J Invest Dermatol. 1977;68:169-170.
- Ranson M, Posen S, Mason RS. Human melanocytes as a target tissue for hormones: in vitro studies with 1 alpha-25, dihydroxyvitamin D3, alpha-melanocyte stimulating hormone, and beta-estradiol. J Invest Dermatol. 1988;91:593-598.
- Kippenberger S, Loitsch S, Solano F, et al. Quantification of tyrosinase, TRP-1, and Trp-2 transcripts in human melanocytes by reverse transcriptase-competitive multiplex PCR—regulation by steroid hormones. J Invest Dermatol. 1998;110:364-367.
- McLeod SD, Ranson M, Mason RS. Effects of estrogens on human melanocytes in vitro. J Steroid Biochem Mol Biol. 1994;49:9-14.
- Chalermchai T, Rummaneethorn P. Effects of a fractional picosecond 1,064 nm laser for the treatment of dermal and mixed type melasma. J Cosmet Laser Ther. 2018;20:134-139.
- Sheu SL. Treatment of melasma using tranexamic acid: what’s known and what’s next. Cutis. 2018;101:E7-E8.
- Tian B. The Asian problem of frequent laser toning for melasma. J Clin Aesthet Dermatol. 2017;10:40-42.
- Zhang L, Tan WQ, Fang QQ, et al. Tranexamic acid for adults with melasma: a systematic review and meta-analysis. Biomed Res Int. 2018;2018:1683414.
- Zhu JW, Ni YJ, Tong XY, et al. Tranexamic acid inhibits angiogenesis and melanogenesis in vitro by targeting VEGF receptors. Int J Med Sci. 2020;17:903-911.
- Colferai MMT, Miquelin GM, Steiner D. Evaluation of oral tranexamic acid in the treatment of melasma. J Cosmet Dermatol. 2019;18:1495-1501.
- Taraz M, Niknam S, Ehsani AH. Tranexamic acid in treatment of melasma: a comprehensive review of clinical studies. Dermatol Ther. 2017;30:19-26.
- Yalamanchili R, Shastry V, Betkerur J. Clinico-epidemiological study and quality of life assessment in melasma. Indian J Dermatol. 2015;60:519.
- Kim HJ, Moon SH, Cho SH, et al. Efficacy and safety of tranexamic acid in melasma: a meta-analysis and systematic review. Acta Derm Venereol. 2017;97:776-781.
- Kim YJ, Kim MJ, Kweon DK, et al. Quantification of hypopigmentation activity in vitro. J Vis Exp. 2019;145:20-25.
- Cardoso R, Valente R, Souza da Costa CH, et al. Analysis of kojic acid derivatives as competitive inhibitors of tyrosinase: a molecular modeling approach. Molecules. 2021;26:2875.
- Bala HR, Lee S, Wong C, et al. Oral tranexamic acid for the treatment of melasma: a review. Dermatol Surg. 2018;44:814-825.
- Khuraiya S, Kachhawa D, Chouhan B, et al. A comparative study of topical 5% tranexamic acid and triple combination therapy for the treatment of melasma in Indian population. Pigment International. 2019;6:18-23.
- Steiner D, Feola C, Bialeski N, et al. Study evaluating the efficacy of topical and injected tranexamic acid in treatment of melasma. Surg Cosmet Dermatol. 2009;1:174-177.
- Doolan B, Gupta M. Melasma. Aust J Gen Pract. 2021;50:880-885.
- Banihashemi M, Zabolinejad N, Jaafari MR, et al. Comparison of therapeutic effects of liposomal tranexamic acid and conventional hydroquinone on melasma. J Cosmet Dermatol. 2015;14:174-177.
- Chung JY, Lee JH, Lee JH. Topical tranexamic acid as an adjuvant treatment in melasma: side-by-side comparison clinical study. J Dermatolog Treat. 2016;27:373-377.
- Ebrahimi B, Naeini FF. Topical tranexamic acid as a promising treatment for melasma. J Res Med Sci. 2014;19:753-757.
- Kanechorn Na Ayuthaya P, Niumphradit N, Manosroi A, et al. Topical 5% tranexamic acid for the treatment of melasma in Asians: a double-blind randomized controlled clinical trial. J Cosmet Laser Ther. 2012;14:150-154.
- Kim SJ, Park JY, Shibata T, et al. Efficacy and possible mechanisms of topical tranexamic acid in melasma. Clin Exp Dermatol. 2016;41:480-485.
- Lee JH, Park JG, Lim SH, et al. Localized intradermal microinjection of tranexamic acid for treatment of melasma in Asian patients: a preliminary clinical trial. Dermatol Surg. 2006;32:626-631.
- Badran AY, Ali AU, Gomaa AS. Efficacy of topical versus intradermal injection of tranexamic acid in Egyptian melasma patients: a randomised clinical trial. Australas J Dermatol. 2021;62:E373-E379.
- Saki N, Darayesh M, Heiran A. Comparing the efficacy of topical hydroquinone 2% versus intradermal tranexamic acid microinjections in treating melasma: a split-face controlled trial. J Dermatolog Treat. 2018;29:405-410.
- Pazyar N, Yaghoobi R, Zeynalie M, et al. Comparison of the efficacy of intradermal injected tranexamic acid vs hydroquinone cream in the treatment of melasma. Clin Cosmet Investig Dermatol. 2019;12:115-122.
- Wu S, Shi H, Wu H, et al. Treatment of melasma with oral administration of tranexamic acid. Aesthetic Plast Surg. 2012;36:964-970.
- Atefi N, Dalvand B, Ghassemi M, et al. Therapeutic effects of topical tranexamic acid in comparison with hydroquinone in treatment of women with melasma. Dermatol Ther (Heidelb). 2017;7:417-424.
- Cestari T, Arellano I, Hexsel D, et al. Melasma in Latin America: options for therapy and treatment algorithm. J Eur Acad Dermatol Venereol. 2009;23:760-772.
PRATICE POINTS
- Tranexamic acid (TA) solution 5% is an efficacious treatment for skin of color patients with melasma.
- Topical TA is a treatment alternative for patients who may not be able to tolerate oral TA.
- Our study revealed the greatest efficacy for TA solution 5% was seen on the forehead and malar region, with less efficacy on the chin.
Endocrine Mucin-Producing Sweat Gland Carcinoma and Primary Cutaneous Mucinous Carcinoma: A Case Series
Endocrine mucin-producing sweat gland carcinoma (EMPSGC) and
Methods
Following institutional review board approval, we conducted a retrospective, single-institution case series. We searched electronic medical records dating from 2000 to 2019 for tumors diagnosed as PCMC or extramammary Paget disease treated with MMS. We gathered demographic, clinical, pathologic, and follow-up information from the electronic medical records for each case (Tables 1 and 2). Two dermatopathologists (B.P. and B.F.K.) reviewed the hematoxylin and eosin–stained slides of each tumor as well as all available immunohistochemical stains. One of the reviewers (B.F.K.) is a board-certified dermatologist, dermatopathologist, and fellowship-trained Mohs surgeon.
Results
Demographic and Clinical Information—We identified 2 cases of EMPSGC and 3 cases of PCMC diagnosed and treated at our institution; 4 of these cases had been treated within the last 2 years. One had been treated 18 years prior; case information was limited due to planned institutional record destruction. Three of the patients were female and 2 were male. The mean age at presentation was 71 years (range, 62–87 years). None had experienced recurrence or metastases after a mean follow-up of 30 months.
Case 1—A 68-year-old woman noted a slow-growing, flesh-colored papule measuring 12×10 mm on the right lower eyelid. An excisional biopsy was completed with 2-mm clinical margins, and the defect was closed in a linear fashion. Histologic sections demonstrated EMPSGC with uninvolved margins. The patient desired no further intervention and was clinically followed. Magnetic resonance imaging (MRI) of the head and neck found no evidence of metastasis. She has had no recurrence after 15 months.
Case 2—A 62-year-old man presented with a 7×5-mm, flesh-colored papule on the left lower eyelid margin (Figure 1). It was previously treated conservatively as a hordeolum but was biopsied after it failed to resolve with 3-mm margins. Histopathology demonstrated an EMPSGC (Figure 2). The lesion was treated with modified MMS with permanent en face section processing and cleared after 1 stage. Computed tomography of the head and neck showed no abnormalities. He has had no recurrence after 9 months.
Case 3—A 72-year-old man presented with a nontender papule near the right lateral canthus. A punch biopsy demonstrated PCMC. He was treated via modified MMS with permanent en face section processing. The tumor was cleared in 1 stage. He showed no evidence of recurrence after 112 months and died of unrelated causes. The rest of his clinical information was limited because of planned institutional destruction of records.
Case 4—An 87-year-old woman presented with a 25×25-mm, slow-growing mass of 12 months’ duration on the left lower abdomen (Figure 3). A biopsy demonstrated PCMC (Figure 4). Because of the size of the lesion, she underwent WLE with 20- to 30-mm margins by a general surgeon under general anesthesia. Positron emission tomography/computed tomography was unremarkable. She has remained disease free for 11 months.
Case 5—A 66-year-old woman presented for evaluation of a posterior scalp mass measuring 23×18 mm that had grown over the last 24 months. Biopsy showed mucinous carcinoma with lymphovascular invasion consistent with PCMC (Figure 5) confirmed on multiple tissue levels and with the aid of immunohistochemistry. She was sent for an MRI of the head, neck, chest, abdomen, and pelvis, which demonstrated 2 enlarged postauricular lymph nodes and raised suspicion for metastatic disease vs reactive lymphadenopathy. Mohs micrographic surgery with frozen sections was performed with 1- to 3-mm margins; the final layer was sent for permanent processing and confirmed negative margins. Sentinel lymph node biopsy and lymphadenectomy of the 2 nodes present on imaging showed no evidence of metastasis. The patient had no recurrence in 1 month.
Comment
Endocrine mucin-producing sweat gland carcinoma and PCMC are sweat gland malignancies that carry low metastatic potential but are locally aggressive. Endocrine mucin-producing sweat gland carcinoma has a strong predilection for the periorbital region, especially the lower eyelids of older women.3 Primary cutaneous mucinous carcinoma may arise on the eyelids, scalp, axillae, and trunk and has been reported more often in older men. These slow-growing tumors appear as nonspecific nodules.3 Lesions frequently are asymptomatic but rarely may cause pruritus and bleeding. Histologically, EMPSGC appears as solid or cystic nodules of cells with a papillary, cribriform, or pseudopapillary appearance. Intracellular or extracellular mucin as well as malignant spread of tumor cells along pre-existing ductlike structures make it difficult to histologically distinguish EMPSGC from ductal carcinoma in situ.3
A key histopathologic feature of PCMC is basophilic epithelioid cell nests in mucinous lakes.4 Rosettelike structures are seen within solid areas of the tumor. Fibrous septae separate individual collections of mucin, creating a lobulated appearance. The histopathologic differential diagnosis of EMPSGC and PCMC is broad, including basal cell carcinoma, hidradenoma, hidradenocarcinoma, apocrine adenoma, and dermal duct tumor. Positive expression of at least 1 neuroendocrine marker (ie, synaptophysin, neuron-specific enolase, chromogranin) and low-molecular cytokeratin (cytokeratin 7, CAM5.2, Ber-EP4) can aid in the diagnosis of both EMPSGC and PCMC.4 The use of p63 immunostaining is beneficial in delineating adnexal neoplasms. Adnexal tumors that stain positively with p63 are more likely to be of primary cutaneous origin, whereas lack of p63 staining usually denotes a secondary metastatic process. However, p63 staining is less reliable when distinguishing primary and metastatic mucinous neoplasms. Metastatic mucinous carcinomas often stain positive with p63, while PCMC usually stains negative despite its primary cutaneous origin, decreasing the clinical utility of p63. The tumor may be identical to metastatic mucinous adenocarcinoma of the breast, gastrointestinal tract, lung, ovary, and pancreas. Tumor islands floating in mucin are identified in both primary cutaneous and metastatic disease to the skin.3,6 Areas of tumor necrosis, notable atypia, and perineural or lymphovascular invasion are infrequently reported in EMPSGC or PCMC, though lymphatic invasion was identified in case 5 presented herein.
A metastatic workup is warranted in all cases of PCMC, including a thorough history, review of systems, breast examination, and imaging. A workup may be considered in cases of EMPSGC depending on histologic features or clinical history.
There is uncertainty regarding the optimal management of these slow-growing yet locally destructive tumors.5 The incidence of local recurrence of PCMC after WLE with narrow margins of at least 1 cm can be as high as 30% to 40%, especially on the eyelid.4 There is no consensus on surgical care for either of these tumors.5 Because of the high recurrence rate and the predilection for the eyelid and face, MMS provides an excellent alternative to WLE for tissue preservation and meticulous margin control. We advocate for the use of the Mohs technique with permanent sectioning, which may delay the repair, but reviewing tissue with permanent fixation improves the quality and accuracy of the margin evaluation because these tumors often are infiltrative and difficult to delineate under frozen section processing. Permanent en face sectioning allows the laboratory to utilize the full array of immunohistochemical stains for these tumors, providing accurate and timely results.
Limitations to our retrospective uncontrolled study include missing or incomplete data points and short follow-up time. Additionally, there was no standardization to the margins removed with MMS or WLE because of the limited available data that comment on appropriate margins.
- Held L, Ruetten A, Kutzner H, et al. Endocrine mucin‐producing sweat gland carcinoma: clinicopathologic, immunohistochemical and molecular analysis of 11 cases with emphasis on MYB immunoexpression. J Cutan Pathol. 2018;45:674-680.
- Navrazhina K, Petukhova T, Wildman HF, et al. Endocrine mucin-producing sweat gland carcinoma of the scalp treated with Mohs micrographic surgery. JAAD Case Rep. 2018;4:887-889.
- Scott BL, Anyanwu CO, Vandergriff T, et al. Endocrine mucin–producing sweat gland carcinoma treated with Mohs micrographic surgery. Dermatol Surg. 2017;43:1498-1500.
- Chang S, Shim SH, Joo M, et al. A case of endocrine mucin-producing sweat gland carcinoma co-existing with mucinous carcinoma: a case report. Korean J Pathol. 2010;44:97-100.
- Kamalpour L, Brindise RT, Nodzenski M, et al. Primary cutaneous mucinous carcinoma: a systematic review and meta-analysis of outcomes after surgery. JAMA Dermatol. 2014;150:380-384.
- Bulliard C, Murali R, Maloof A, et al. Endocrine mucin‐producing sweat gland carcinoma: report of a case and review of the literature. J Cutan Pathol. 2006;33:812-816.
Endocrine mucin-producing sweat gland carcinoma (EMPSGC) and
Methods
Following institutional review board approval, we conducted a retrospective, single-institution case series. We searched electronic medical records dating from 2000 to 2019 for tumors diagnosed as PCMC or extramammary Paget disease treated with MMS. We gathered demographic, clinical, pathologic, and follow-up information from the electronic medical records for each case (Tables 1 and 2). Two dermatopathologists (B.P. and B.F.K.) reviewed the hematoxylin and eosin–stained slides of each tumor as well as all available immunohistochemical stains. One of the reviewers (B.F.K.) is a board-certified dermatologist, dermatopathologist, and fellowship-trained Mohs surgeon.
Results
Demographic and Clinical Information—We identified 2 cases of EMPSGC and 3 cases of PCMC diagnosed and treated at our institution; 4 of these cases had been treated within the last 2 years. One had been treated 18 years prior; case information was limited due to planned institutional record destruction. Three of the patients were female and 2 were male. The mean age at presentation was 71 years (range, 62–87 years). None had experienced recurrence or metastases after a mean follow-up of 30 months.
Case 1—A 68-year-old woman noted a slow-growing, flesh-colored papule measuring 12×10 mm on the right lower eyelid. An excisional biopsy was completed with 2-mm clinical margins, and the defect was closed in a linear fashion. Histologic sections demonstrated EMPSGC with uninvolved margins. The patient desired no further intervention and was clinically followed. Magnetic resonance imaging (MRI) of the head and neck found no evidence of metastasis. She has had no recurrence after 15 months.
Case 2—A 62-year-old man presented with a 7×5-mm, flesh-colored papule on the left lower eyelid margin (Figure 1). It was previously treated conservatively as a hordeolum but was biopsied after it failed to resolve with 3-mm margins. Histopathology demonstrated an EMPSGC (Figure 2). The lesion was treated with modified MMS with permanent en face section processing and cleared after 1 stage. Computed tomography of the head and neck showed no abnormalities. He has had no recurrence after 9 months.
Case 3—A 72-year-old man presented with a nontender papule near the right lateral canthus. A punch biopsy demonstrated PCMC. He was treated via modified MMS with permanent en face section processing. The tumor was cleared in 1 stage. He showed no evidence of recurrence after 112 months and died of unrelated causes. The rest of his clinical information was limited because of planned institutional destruction of records.
Case 4—An 87-year-old woman presented with a 25×25-mm, slow-growing mass of 12 months’ duration on the left lower abdomen (Figure 3). A biopsy demonstrated PCMC (Figure 4). Because of the size of the lesion, she underwent WLE with 20- to 30-mm margins by a general surgeon under general anesthesia. Positron emission tomography/computed tomography was unremarkable. She has remained disease free for 11 months.
Case 5—A 66-year-old woman presented for evaluation of a posterior scalp mass measuring 23×18 mm that had grown over the last 24 months. Biopsy showed mucinous carcinoma with lymphovascular invasion consistent with PCMC (Figure 5) confirmed on multiple tissue levels and with the aid of immunohistochemistry. She was sent for an MRI of the head, neck, chest, abdomen, and pelvis, which demonstrated 2 enlarged postauricular lymph nodes and raised suspicion for metastatic disease vs reactive lymphadenopathy. Mohs micrographic surgery with frozen sections was performed with 1- to 3-mm margins; the final layer was sent for permanent processing and confirmed negative margins. Sentinel lymph node biopsy and lymphadenectomy of the 2 nodes present on imaging showed no evidence of metastasis. The patient had no recurrence in 1 month.
Comment
Endocrine mucin-producing sweat gland carcinoma and PCMC are sweat gland malignancies that carry low metastatic potential but are locally aggressive. Endocrine mucin-producing sweat gland carcinoma has a strong predilection for the periorbital region, especially the lower eyelids of older women.3 Primary cutaneous mucinous carcinoma may arise on the eyelids, scalp, axillae, and trunk and has been reported more often in older men. These slow-growing tumors appear as nonspecific nodules.3 Lesions frequently are asymptomatic but rarely may cause pruritus and bleeding. Histologically, EMPSGC appears as solid or cystic nodules of cells with a papillary, cribriform, or pseudopapillary appearance. Intracellular or extracellular mucin as well as malignant spread of tumor cells along pre-existing ductlike structures make it difficult to histologically distinguish EMPSGC from ductal carcinoma in situ.3
A key histopathologic feature of PCMC is basophilic epithelioid cell nests in mucinous lakes.4 Rosettelike structures are seen within solid areas of the tumor. Fibrous septae separate individual collections of mucin, creating a lobulated appearance. The histopathologic differential diagnosis of EMPSGC and PCMC is broad, including basal cell carcinoma, hidradenoma, hidradenocarcinoma, apocrine adenoma, and dermal duct tumor. Positive expression of at least 1 neuroendocrine marker (ie, synaptophysin, neuron-specific enolase, chromogranin) and low-molecular cytokeratin (cytokeratin 7, CAM5.2, Ber-EP4) can aid in the diagnosis of both EMPSGC and PCMC.4 The use of p63 immunostaining is beneficial in delineating adnexal neoplasms. Adnexal tumors that stain positively with p63 are more likely to be of primary cutaneous origin, whereas lack of p63 staining usually denotes a secondary metastatic process. However, p63 staining is less reliable when distinguishing primary and metastatic mucinous neoplasms. Metastatic mucinous carcinomas often stain positive with p63, while PCMC usually stains negative despite its primary cutaneous origin, decreasing the clinical utility of p63. The tumor may be identical to metastatic mucinous adenocarcinoma of the breast, gastrointestinal tract, lung, ovary, and pancreas. Tumor islands floating in mucin are identified in both primary cutaneous and metastatic disease to the skin.3,6 Areas of tumor necrosis, notable atypia, and perineural or lymphovascular invasion are infrequently reported in EMPSGC or PCMC, though lymphatic invasion was identified in case 5 presented herein.
A metastatic workup is warranted in all cases of PCMC, including a thorough history, review of systems, breast examination, and imaging. A workup may be considered in cases of EMPSGC depending on histologic features or clinical history.
There is uncertainty regarding the optimal management of these slow-growing yet locally destructive tumors.5 The incidence of local recurrence of PCMC after WLE with narrow margins of at least 1 cm can be as high as 30% to 40%, especially on the eyelid.4 There is no consensus on surgical care for either of these tumors.5 Because of the high recurrence rate and the predilection for the eyelid and face, MMS provides an excellent alternative to WLE for tissue preservation and meticulous margin control. We advocate for the use of the Mohs technique with permanent sectioning, which may delay the repair, but reviewing tissue with permanent fixation improves the quality and accuracy of the margin evaluation because these tumors often are infiltrative and difficult to delineate under frozen section processing. Permanent en face sectioning allows the laboratory to utilize the full array of immunohistochemical stains for these tumors, providing accurate and timely results.
Limitations to our retrospective uncontrolled study include missing or incomplete data points and short follow-up time. Additionally, there was no standardization to the margins removed with MMS or WLE because of the limited available data that comment on appropriate margins.
Endocrine mucin-producing sweat gland carcinoma (EMPSGC) and
Methods
Following institutional review board approval, we conducted a retrospective, single-institution case series. We searched electronic medical records dating from 2000 to 2019 for tumors diagnosed as PCMC or extramammary Paget disease treated with MMS. We gathered demographic, clinical, pathologic, and follow-up information from the electronic medical records for each case (Tables 1 and 2). Two dermatopathologists (B.P. and B.F.K.) reviewed the hematoxylin and eosin–stained slides of each tumor as well as all available immunohistochemical stains. One of the reviewers (B.F.K.) is a board-certified dermatologist, dermatopathologist, and fellowship-trained Mohs surgeon.
Results
Demographic and Clinical Information—We identified 2 cases of EMPSGC and 3 cases of PCMC diagnosed and treated at our institution; 4 of these cases had been treated within the last 2 years. One had been treated 18 years prior; case information was limited due to planned institutional record destruction. Three of the patients were female and 2 were male. The mean age at presentation was 71 years (range, 62–87 years). None had experienced recurrence or metastases after a mean follow-up of 30 months.
Case 1—A 68-year-old woman noted a slow-growing, flesh-colored papule measuring 12×10 mm on the right lower eyelid. An excisional biopsy was completed with 2-mm clinical margins, and the defect was closed in a linear fashion. Histologic sections demonstrated EMPSGC with uninvolved margins. The patient desired no further intervention and was clinically followed. Magnetic resonance imaging (MRI) of the head and neck found no evidence of metastasis. She has had no recurrence after 15 months.
Case 2—A 62-year-old man presented with a 7×5-mm, flesh-colored papule on the left lower eyelid margin (Figure 1). It was previously treated conservatively as a hordeolum but was biopsied after it failed to resolve with 3-mm margins. Histopathology demonstrated an EMPSGC (Figure 2). The lesion was treated with modified MMS with permanent en face section processing and cleared after 1 stage. Computed tomography of the head and neck showed no abnormalities. He has had no recurrence after 9 months.
Case 3—A 72-year-old man presented with a nontender papule near the right lateral canthus. A punch biopsy demonstrated PCMC. He was treated via modified MMS with permanent en face section processing. The tumor was cleared in 1 stage. He showed no evidence of recurrence after 112 months and died of unrelated causes. The rest of his clinical information was limited because of planned institutional destruction of records.
Case 4—An 87-year-old woman presented with a 25×25-mm, slow-growing mass of 12 months’ duration on the left lower abdomen (Figure 3). A biopsy demonstrated PCMC (Figure 4). Because of the size of the lesion, she underwent WLE with 20- to 30-mm margins by a general surgeon under general anesthesia. Positron emission tomography/computed tomography was unremarkable. She has remained disease free for 11 months.
Case 5—A 66-year-old woman presented for evaluation of a posterior scalp mass measuring 23×18 mm that had grown over the last 24 months. Biopsy showed mucinous carcinoma with lymphovascular invasion consistent with PCMC (Figure 5) confirmed on multiple tissue levels and with the aid of immunohistochemistry. She was sent for an MRI of the head, neck, chest, abdomen, and pelvis, which demonstrated 2 enlarged postauricular lymph nodes and raised suspicion for metastatic disease vs reactive lymphadenopathy. Mohs micrographic surgery with frozen sections was performed with 1- to 3-mm margins; the final layer was sent for permanent processing and confirmed negative margins. Sentinel lymph node biopsy and lymphadenectomy of the 2 nodes present on imaging showed no evidence of metastasis. The patient had no recurrence in 1 month.
Comment
Endocrine mucin-producing sweat gland carcinoma and PCMC are sweat gland malignancies that carry low metastatic potential but are locally aggressive. Endocrine mucin-producing sweat gland carcinoma has a strong predilection for the periorbital region, especially the lower eyelids of older women.3 Primary cutaneous mucinous carcinoma may arise on the eyelids, scalp, axillae, and trunk and has been reported more often in older men. These slow-growing tumors appear as nonspecific nodules.3 Lesions frequently are asymptomatic but rarely may cause pruritus and bleeding. Histologically, EMPSGC appears as solid or cystic nodules of cells with a papillary, cribriform, or pseudopapillary appearance. Intracellular or extracellular mucin as well as malignant spread of tumor cells along pre-existing ductlike structures make it difficult to histologically distinguish EMPSGC from ductal carcinoma in situ.3
A key histopathologic feature of PCMC is basophilic epithelioid cell nests in mucinous lakes.4 Rosettelike structures are seen within solid areas of the tumor. Fibrous septae separate individual collections of mucin, creating a lobulated appearance. The histopathologic differential diagnosis of EMPSGC and PCMC is broad, including basal cell carcinoma, hidradenoma, hidradenocarcinoma, apocrine adenoma, and dermal duct tumor. Positive expression of at least 1 neuroendocrine marker (ie, synaptophysin, neuron-specific enolase, chromogranin) and low-molecular cytokeratin (cytokeratin 7, CAM5.2, Ber-EP4) can aid in the diagnosis of both EMPSGC and PCMC.4 The use of p63 immunostaining is beneficial in delineating adnexal neoplasms. Adnexal tumors that stain positively with p63 are more likely to be of primary cutaneous origin, whereas lack of p63 staining usually denotes a secondary metastatic process. However, p63 staining is less reliable when distinguishing primary and metastatic mucinous neoplasms. Metastatic mucinous carcinomas often stain positive with p63, while PCMC usually stains negative despite its primary cutaneous origin, decreasing the clinical utility of p63. The tumor may be identical to metastatic mucinous adenocarcinoma of the breast, gastrointestinal tract, lung, ovary, and pancreas. Tumor islands floating in mucin are identified in both primary cutaneous and metastatic disease to the skin.3,6 Areas of tumor necrosis, notable atypia, and perineural or lymphovascular invasion are infrequently reported in EMPSGC or PCMC, though lymphatic invasion was identified in case 5 presented herein.
A metastatic workup is warranted in all cases of PCMC, including a thorough history, review of systems, breast examination, and imaging. A workup may be considered in cases of EMPSGC depending on histologic features or clinical history.
There is uncertainty regarding the optimal management of these slow-growing yet locally destructive tumors.5 The incidence of local recurrence of PCMC after WLE with narrow margins of at least 1 cm can be as high as 30% to 40%, especially on the eyelid.4 There is no consensus on surgical care for either of these tumors.5 Because of the high recurrence rate and the predilection for the eyelid and face, MMS provides an excellent alternative to WLE for tissue preservation and meticulous margin control. We advocate for the use of the Mohs technique with permanent sectioning, which may delay the repair, but reviewing tissue with permanent fixation improves the quality and accuracy of the margin evaluation because these tumors often are infiltrative and difficult to delineate under frozen section processing. Permanent en face sectioning allows the laboratory to utilize the full array of immunohistochemical stains for these tumors, providing accurate and timely results.
Limitations to our retrospective uncontrolled study include missing or incomplete data points and short follow-up time. Additionally, there was no standardization to the margins removed with MMS or WLE because of the limited available data that comment on appropriate margins.
- Held L, Ruetten A, Kutzner H, et al. Endocrine mucin‐producing sweat gland carcinoma: clinicopathologic, immunohistochemical and molecular analysis of 11 cases with emphasis on MYB immunoexpression. J Cutan Pathol. 2018;45:674-680.
- Navrazhina K, Petukhova T, Wildman HF, et al. Endocrine mucin-producing sweat gland carcinoma of the scalp treated with Mohs micrographic surgery. JAAD Case Rep. 2018;4:887-889.
- Scott BL, Anyanwu CO, Vandergriff T, et al. Endocrine mucin–producing sweat gland carcinoma treated with Mohs micrographic surgery. Dermatol Surg. 2017;43:1498-1500.
- Chang S, Shim SH, Joo M, et al. A case of endocrine mucin-producing sweat gland carcinoma co-existing with mucinous carcinoma: a case report. Korean J Pathol. 2010;44:97-100.
- Kamalpour L, Brindise RT, Nodzenski M, et al. Primary cutaneous mucinous carcinoma: a systematic review and meta-analysis of outcomes after surgery. JAMA Dermatol. 2014;150:380-384.
- Bulliard C, Murali R, Maloof A, et al. Endocrine mucin‐producing sweat gland carcinoma: report of a case and review of the literature. J Cutan Pathol. 2006;33:812-816.
- Held L, Ruetten A, Kutzner H, et al. Endocrine mucin‐producing sweat gland carcinoma: clinicopathologic, immunohistochemical and molecular analysis of 11 cases with emphasis on MYB immunoexpression. J Cutan Pathol. 2018;45:674-680.
- Navrazhina K, Petukhova T, Wildman HF, et al. Endocrine mucin-producing sweat gland carcinoma of the scalp treated with Mohs micrographic surgery. JAAD Case Rep. 2018;4:887-889.
- Scott BL, Anyanwu CO, Vandergriff T, et al. Endocrine mucin–producing sweat gland carcinoma treated with Mohs micrographic surgery. Dermatol Surg. 2017;43:1498-1500.
- Chang S, Shim SH, Joo M, et al. A case of endocrine mucin-producing sweat gland carcinoma co-existing with mucinous carcinoma: a case report. Korean J Pathol. 2010;44:97-100.
- Kamalpour L, Brindise RT, Nodzenski M, et al. Primary cutaneous mucinous carcinoma: a systematic review and meta-analysis of outcomes after surgery. JAMA Dermatol. 2014;150:380-384.
- Bulliard C, Murali R, Maloof A, et al. Endocrine mucin‐producing sweat gland carcinoma: report of a case and review of the literature. J Cutan Pathol. 2006;33:812-816.
Practice Points
- Endocrine mucin-producing sweat gland carcinoma and primary cutaneous mucinous carcinoma are rare low-grade neoplasms thought to arise from apocrine glands that are morphologically and immunohistochemically analogous to ductal carcinoma in situ and mucinous carcinoma of the breast, respectively.
- Management involves a metastatic workup and either wide local excision with margins greater than 5 mm or Mohs micrographic surgery in anatomically sensitive areas.
Rifampin for Prosthetic Joint Infections: Lessons Learned Over 20 Years at a VA Medical Center
Orthopedic implants are frequently used to repair fractures and replace joints. The number of total joint replacements is high, with > 1 million total hip (THA) and total knee (TKA) arthroplasties performed in the United States each year.1 While most joint arthroplasties are successful and significantly improve patient quality of life, a small proportion become infected.2 Prosthetic joint infection (PJI) causes substantial morbidity and mortality, particularly among older patients, and is difficult and costly to treat.3
The historic gold standard treatment for PJI is a 2-stage replacement, wherein the prosthesis is removed in one procedure and a new prosthesis is implanted in a second procedure after an extended course of antibiotics. This approach requires the patient to undergo 2 major procedures and spend considerable time without a functioning prosthesis, contributing to immobility and deconditioning. This option is difficult for frail or older patients and is associated with high medical costs.4
In 1998, a novel method of treatment known as debridement, antibiotics, and implant retention (DAIR) was evaluated in a small, randomized controlled trial.5 This study used a unique antimicrobial approach: the administration of ciprofloxacin plus either rifampin or placebo for 3 to 6 months, combined with a single surgical debridement. Eliminating a second surgical procedure and largely relying on oral antimicrobials reduces surgical risks and decreases costs.4 Current guidelines endorse DAIR with rifampin and a second antibiotic for patients diagnosed with PJI within about 30 days of prosthesis implantation who have a well-fixed implant without evidence of a sinus tract.6 Clinical trial data demonstrate that this approach is > 90% effective in patients with a well-fixed prosthesis and acute staphylococcal PJI.3,7
Thus far, clinical trials examining this approach have been small and did not include veterans who are typically older and have more comorbidities.8 The Minneapolis Veterans Affairs Health Care System (MVAHCS) infectious disease section has implemented the rifampin-based DAIR approach for orthopedic device-related infections since this approach was first described in 1998 but has not systematically evaluated its effectiveness or whether there are areas for improvement.
METHODS
We conducted a retrospective analysis of patients who underwent DAIR combined with a rifampin-containing regimen at the MVAHCS from January 1, 2001, through June 30, 2021. Inclusion required a diagnosis of orthopedic device-related infection and treatment with DAIR followed by antimicrobial therapy that included rifampin for 1 to 6 months. PJI was defined by meeting ≥ 1 of the following criteria: (1) isolation of the same microorganism from ≥ 2 cultures from joint aspirates or intraoperative tissue specimens; (2) purulence surrounding the prosthesis at the time of surgery; (3) acute inflammation consistent with infection on histopathological examination or periprosthetic tissue; or (4) presence of a sinus tract communicating with the prosthesis.
All cases of orthopedic device infection managed with DAIR and rifampin were included, regardless of implant stability, age of the implant at the time of symptom onset, presence of a sinus tract, or infecting microorganism. Exclusion criteria included patients who started or finished PJI treatment at another facility, were lost to follow-up, discontinued rifampin, died within 1 year of completing antibiotic therapy due to reasons unrelated to treatment failure, received rifampin for < 50% of their antimicrobial treatment course, had complete hardware removal, or had < 1 year between the completion of antimicrobial therapy and the time of data collection.
Management of DAIR procedures at the MVAHCS involves evaluating the fixation of the prosthesis, tissue sampling for microbiological analysis, and thorough debridement of infected tissue. Following debridement, a course of IV antibiotics is administered before initiating oral antibiotic therapy. To protect against resistance, rifampin is combined with another antibiotic typically from the fluoroquinolone, tetracycline, or cephalosporin class. Current guidelines suggest 3 and 6 months of oral antibiotics for prosthetic hip and knee infections, respectively.6
Treatment Outcomes
The primary outcome was treatment success, defined as meeting all of the following: (1) lack of clinical signs and symptoms of infection; (2) absence of radiological signs of loosening or infection within 1 year after the conclusion of treatment; and (3) absence of additional PJI treatment interventions for the prosthesis of concern within 1 year after completing the original antibiotic treatment.
Treatment failure was defined as meeting any of the following: (1) recurrence of PJI (original strain or different microorganism) within 1 year after the completion of antibiotic therapy; (2) death attributed to PJI anytime after the initial debridement; (3) removal of the prosthetic joint within 1 year after the completion of antibiotic therapy; or (4) long-term antibiotic use to suppress the PJI after the completion of the initial antibiotic therapy.
Statistical Analysis
Descriptive statistics were used to define the baseline characteristics of patients receiving rifampin therapy for orthopedic implant infections at the MVAHCS. Variables analyzed were age, sex, race and ethnicity, type of implant, age of implant, duration of symptoms, comorbidities (diabetes and rheumatoid arthritis), and presence of chronic infection. Patients were classified as having a chronic infection if they received previous infection treatment (antibiotics or surgery) for the orthopedic device in question. We created this category because patients with persistent infection after a medical or surgical attempt at treatment are likely to have a higher probability of treatment failure compared with those with no prior therapy. Charlson Comorbidity Index was calculated using clinical information present at the onset of infection.9 Fisher exact test was used to assess differences between categorical variables, and an independent t test was used to assess differences in continuous variables. P < .05 indicated statistical significance.
To assess the ability of a rifampin-based regimen to achieve a cure of PJI, we grouped participants into 2 categories: those with an intent to cure strategy and those without intent to cure based on documentation in the electronic health record (EHR). Participants who were prescribed rifampin with the documented goal of prosthesis retention with no further suppressive antibiotics were included in the intent-to-cure group, the primary focus of this study. Those excluded from the intent-to-cure group were given rifampin and another antibiotic, but there was a documented plan of either ongoing chronic suppression or eventual explantation; these participants were placed in the without-intent-to-cure group. Analysis of treatment success and failure was limited to the intent-to-cure group, whereas both groups were included for assessment of adverse effects (AEs) and treatment duration. This project was reviewed by the MVAHCS Institutional Review Board and determined to be a quality improvement initiative and to not meet the definition of research, and as such did not require review; it was reviewed and approved by the MVAHCS Research and Development Committee.
RESULTS
A total of 538 patients were identified who simultaneously received rifampin and another oral antibiotic between January 1, 2000, and June 30, 2021.
Forty-two participants (54%) had Staphylococcus aureus and 31 participants (40%) had coagulase-negative staphylococci infections, while 11 gram-negative organisms (14%) and 6 gram-positive anaerobic cocci (8%) infections were noted. Cutibacterium acnes and Streptococcus agalactiae were each found in 3 participants (4% of), and diphtheroids (not further identified) was found on 2 participants (3%). Candida albicans was identified in a single participant (1%), along with coagulase-negative staphylococci, and 2 participants (3%) had no identified organisms. There were multiple organisms isolated from 20 patients (26%).
Fifty participants had clear documentation in their EHR that cure of infection was the goal, meeting the criteria for the intent-to-cure group. The remaining 28 participants were placed in the without-intent-to-cure group. Success and failure rates were only measured in the intent-to-cure group, as by definition the without-intent-to-cure group patients would meet the criteria for failure (removal of prosthesis or long-term antibiotic use). The without-intent-to-cure group had a higher median age than the intent-to-cure group (69 years vs 64 years, P = .24) and a higher proportion of male participants (96% vs 80%, P = .09). The median (IQR) implant age of 11 months (1.0-50.5) in the without-intent-to-cure group was also higher than the median implant age of 1 month (0.6-22.0) in the primary group (P = .22). In the without-intent-to-cure group, 19 participants (68%) had a chronic infection, compared with 11 (22%) in the intent-to-cure group (P < .001).
The mean (SD) Charlson Comorbidity Index in the without-intent-to-cure group was 2.5 (1.3) compared with 1.9 (1.4) in the intent-to-cure group (P = .09). There was no significant difference in the type of implant or microbiology of the infecting organism between the 2 groups, although it should be noted that in the intent-to-cure group, 48 patients (96%) had Staphylococcus aureus or coagulase-negative staphylococci isolated.
The median (IQR) dosage of rifampin was 600 mg (300-900). The secondary oral antibiotics used most often were 36 fluoroquinolones (46%) followed by 20 tetracyclines (26%), 6 cephalosporins (8%), and 6 penicillins (8%). Additionally, 6 participants (8%) received IV vancomycin, and 1 participant (1%) was given an oral antifungal in addition to a fluoroquinolone because cultures revealed bacterial and fungal growth. The median (IQR) duration of antimicrobial therapy was 3 months (1.4-3.0). The mean (SD) duration of antimicrobial therapy was 3.6 (2.4) months for TKA infections and 2.4 (0.9) months for THA infections.
Clinical Outcome
Forty-one intent-to-cure group participants (82%) experienced treatment success. We further subdivided the intent-to-cure group by implant age. Participants whose implant was < 2 months old had a success rate of 93%, whereas patients whose implant was older had a success rate of 65% (P = .02).
Secondary Outcomes
The median (IQR) duration of antimicrobial treatment was 3 months (1.4-3.0) for the 38 patients with TKA-related infections and 3 months (1.4-6.0) for the 29 patients with THA infections. AEs were recorded in 24 (31%) of all study participants. Of those with AEs, the average number reported per patient was 1.6. Diarrhea, gastric upset, and nausea were each reported 7 times, accounting for 87% of all recorded AEs. Five participants reported having a rash while on antibiotics, and 2 experienced dysgeusia. One participant reported developing a yeast infection and another experienced vaginitis.
DISCUSSION
Among patients with orthopedic implant infections treated with intent to cure using a rifampin-containing antibiotic regimen at the MVAHCS, 82% had clinical success. Although this is lower than the success rates reported in clinical trials, this is not entirely unexpected.5,7 In most clinical trials studying DAIR and rifampin for PJI, patients are excluded if they do not have an acute staphylococcal infection in the setting of a well-fixed prosthesis without evidence of a sinus tract. Such exclusion criteria were not present in our retrospective study, which was designed to evaluate the real-world practice patterns at this facility. The population at the US Department of Veterans Affairs (VA) is older, more frail, and with more comorbid conditions than populations in prior studies. It is possible that patients with characteristics that would have caused them to be excluded from a clinical trial would be less likely to receive rifampin therapy with the intent to cure. This is suggested by the significantly higher prevalence of chronic infections (68%) in the without-intent-to-cure group compared with 22% in the intent-to-cure group. However, there were reasonably high proportions of participants included in the intent-to-cure group who did have conditions that would have led to their exclusion from prior trials, such as chronic infection (22%) and implant age ≥ 2 months (40%).
When evaluating participants by the age of their implant, treatment success rose to 93% for patients with implants < 2 months old compared with 65% for patients with older implants. This suggests that participants with a newer implant or more recent infection have a greater likelihood of successful treatment, which is consistent with the results of previous clinical trials.5,10 Considering how difficult multiple surgeries can be for older adult patients with comorbidities, we suggest that DAIR with a rifampin-containing regimen be considered as the primary treatment option for early PJIs at the MVAHCS. We also note inconsistent adherence to IDSA treatment guidelines on rifampin therapy, in that patients without intent to cure were prescribed a regimen including rifampin. This may reflect appropriate variability in the care of individual patients but may also offer an opportunity to change processes to improve care.
Limitations
Our analysis has limitations. As with any retrospective study evaluating the efficacy of a specific antibiotic, we were not able to attribute specific outcomes to the antibiotic of interest. Since the choice of antibiotics was left to the treating health care practitioner, therapy was not standardized, and because this was a retrospective study, causal relationships could not be inferred. Our analysis was also limited by the lack of intent to cure in 28 participants (36%), which could be an indication of practitioner bias in therapy selection or characteristic differences between the 2 groups. We looked for signs of infection failure 1 year after the completion of antimicrobial therapy, but longer follow-up could have led to higher rates of failure. Also, while participants’ infections were considered cured if they never sought further medical care for the infection at the MVAHCS, it is possible that patients could have sought care at another facility. We note that 9 patients were excluded because they were unable to complete a treatment course due to rifampin AEs, meaning that the success rates reported here reflect the success that may be expected if a patient can tolerate and complete a rifampin-based regimen. This study was conducted in a single VA hospital and may not be generalizable to nonveterans or veterans seeking care at other facilities.
Conclusions
DAIR followed by a short course of IV antibiotics and an oral regimen including rifampin and another antimicrobial is a reasonable option for veterans with acute staphylococcal orthopedic device infections at the MVAHCS. Patients with a well-placed prosthesis and an acute infection seem especially well suited for this treatment, and treatment with intent to cure should be pursued in patients who meet the criteria for rifampin therapy.
Acknowledgments
We thank Erik Stensgard, PharmD, for assistance in compiling the list of patients receiving rifampin and another antimicrobial.
1. Maradit Kremers H, Larson DR, Crowson CS, et al. Prevalence of total hip and knee replacement in the United States. J Bone Joint Surg Am. 2015;97(17):1386-1397. doi:10.2106/JBJS.N.01141
2. Kapadia BH, Berg RA, Daley JA, Fritz J, Bhave A, Mont MA. Periprosthetic joint infection. Lancet. 2016;387(10016):386-394. doi:10.1016/S0140-6736(14)61798-0
3. Zhan C, Kaczmarek R, Loyo-Berrios N, Sangl J, Bright RA. Incidence and short-term outcomes of primary and revision hip replacement in the United States. J Bone Joint Surg Am. 2007;89(3):526-533. doi:10.2106/JBJS.F.00952
4. Fisman DN, Reilly DT, Karchmer AW, Goldie SJ. Clinical effectiveness and cost-effectiveness of 2 management strategies for infected total hip arthroplasty in the elderly. Clin Infect Dis. 2001;32(3):419-430. doi:10.1086/318502
5. Zimmerli W, Widmer AF, Blatter M, Frei R, Ochsner PE. Role of rifampin for treatment of orthopedic implant-related staphylococcal infections: a randomized controlled trial. Foreign-Body Infection (FBI) Study Group. JAMA. 1998;279(19):1537-1541. doi:10.1001/jama.279.19.1537
6. Osmon DR, Berbari EF, Berendt AR, et al. Diagnosis and management of prosthetic joint infection: clinical practice guidelines by the Infectious Diseases Society of America. Clin Infect Dis. 2013;56(1):e1-e25. doi:10.1093/cid/cis803
7. Lora-Tamayo J, Euba G, Cobo J, et al. Short- versus long-duration levofloxacin plus rifampicin for acute staphylococcal prosthetic joint infection managed with implant retention: a randomised clinical trial. Int J Antimicrob Agents. 2016;48(3):310-316. doi:10.1016/j.ijantimicag.2016.05.021
8. Agha Z, Lofgren RP, VanRuiswyk JV, Layde PM. Are patients at Veterans Affairs medical centers sicker? A comparative analysis of health status and medical resource use. Arch Intern Med. 2000;160(21):3252-3257. doi:10.1001/archinte.160.21.3252
9. Charlson ME, Pompei P, Ales KL, MacKenzie CR. A new method of classifying prognostic comorbidity in longitudinal studies: development and validation. J Chronic Dis. 1987;40(5):373-383. doi:10.1016/0021-9681(87)90171-8
10. Vilchez F, Martínez-Pastor JC, García-Ramiro S, et al. Outcome and predictors of treatment failure in early post-surgical prosthetic joint infections due to Staphylococcus aureus treated with debridement. Clin Microbiol Infect. 2011;17(3):439-444. doi:10.1111/j.1469-0691.2010.03244.x
Orthopedic implants are frequently used to repair fractures and replace joints. The number of total joint replacements is high, with > 1 million total hip (THA) and total knee (TKA) arthroplasties performed in the United States each year.1 While most joint arthroplasties are successful and significantly improve patient quality of life, a small proportion become infected.2 Prosthetic joint infection (PJI) causes substantial morbidity and mortality, particularly among older patients, and is difficult and costly to treat.3
The historic gold standard treatment for PJI is a 2-stage replacement, wherein the prosthesis is removed in one procedure and a new prosthesis is implanted in a second procedure after an extended course of antibiotics. This approach requires the patient to undergo 2 major procedures and spend considerable time without a functioning prosthesis, contributing to immobility and deconditioning. This option is difficult for frail or older patients and is associated with high medical costs.4
In 1998, a novel method of treatment known as debridement, antibiotics, and implant retention (DAIR) was evaluated in a small, randomized controlled trial.5 This study used a unique antimicrobial approach: the administration of ciprofloxacin plus either rifampin or placebo for 3 to 6 months, combined with a single surgical debridement. Eliminating a second surgical procedure and largely relying on oral antimicrobials reduces surgical risks and decreases costs.4 Current guidelines endorse DAIR with rifampin and a second antibiotic for patients diagnosed with PJI within about 30 days of prosthesis implantation who have a well-fixed implant without evidence of a sinus tract.6 Clinical trial data demonstrate that this approach is > 90% effective in patients with a well-fixed prosthesis and acute staphylococcal PJI.3,7
Thus far, clinical trials examining this approach have been small and did not include veterans who are typically older and have more comorbidities.8 The Minneapolis Veterans Affairs Health Care System (MVAHCS) infectious disease section has implemented the rifampin-based DAIR approach for orthopedic device-related infections since this approach was first described in 1998 but has not systematically evaluated its effectiveness or whether there are areas for improvement.
METHODS
We conducted a retrospective analysis of patients who underwent DAIR combined with a rifampin-containing regimen at the MVAHCS from January 1, 2001, through June 30, 2021. Inclusion required a diagnosis of orthopedic device-related infection and treatment with DAIR followed by antimicrobial therapy that included rifampin for 1 to 6 months. PJI was defined by meeting ≥ 1 of the following criteria: (1) isolation of the same microorganism from ≥ 2 cultures from joint aspirates or intraoperative tissue specimens; (2) purulence surrounding the prosthesis at the time of surgery; (3) acute inflammation consistent with infection on histopathological examination or periprosthetic tissue; or (4) presence of a sinus tract communicating with the prosthesis.
All cases of orthopedic device infection managed with DAIR and rifampin were included, regardless of implant stability, age of the implant at the time of symptom onset, presence of a sinus tract, or infecting microorganism. Exclusion criteria included patients who started or finished PJI treatment at another facility, were lost to follow-up, discontinued rifampin, died within 1 year of completing antibiotic therapy due to reasons unrelated to treatment failure, received rifampin for < 50% of their antimicrobial treatment course, had complete hardware removal, or had < 1 year between the completion of antimicrobial therapy and the time of data collection.
Management of DAIR procedures at the MVAHCS involves evaluating the fixation of the prosthesis, tissue sampling for microbiological analysis, and thorough debridement of infected tissue. Following debridement, a course of IV antibiotics is administered before initiating oral antibiotic therapy. To protect against resistance, rifampin is combined with another antibiotic typically from the fluoroquinolone, tetracycline, or cephalosporin class. Current guidelines suggest 3 and 6 months of oral antibiotics for prosthetic hip and knee infections, respectively.6
Treatment Outcomes
The primary outcome was treatment success, defined as meeting all of the following: (1) lack of clinical signs and symptoms of infection; (2) absence of radiological signs of loosening or infection within 1 year after the conclusion of treatment; and (3) absence of additional PJI treatment interventions for the prosthesis of concern within 1 year after completing the original antibiotic treatment.
Treatment failure was defined as meeting any of the following: (1) recurrence of PJI (original strain or different microorganism) within 1 year after the completion of antibiotic therapy; (2) death attributed to PJI anytime after the initial debridement; (3) removal of the prosthetic joint within 1 year after the completion of antibiotic therapy; or (4) long-term antibiotic use to suppress the PJI after the completion of the initial antibiotic therapy.
Statistical Analysis
Descriptive statistics were used to define the baseline characteristics of patients receiving rifampin therapy for orthopedic implant infections at the MVAHCS. Variables analyzed were age, sex, race and ethnicity, type of implant, age of implant, duration of symptoms, comorbidities (diabetes and rheumatoid arthritis), and presence of chronic infection. Patients were classified as having a chronic infection if they received previous infection treatment (antibiotics or surgery) for the orthopedic device in question. We created this category because patients with persistent infection after a medical or surgical attempt at treatment are likely to have a higher probability of treatment failure compared with those with no prior therapy. Charlson Comorbidity Index was calculated using clinical information present at the onset of infection.9 Fisher exact test was used to assess differences between categorical variables, and an independent t test was used to assess differences in continuous variables. P < .05 indicated statistical significance.
To assess the ability of a rifampin-based regimen to achieve a cure of PJI, we grouped participants into 2 categories: those with an intent to cure strategy and those without intent to cure based on documentation in the electronic health record (EHR). Participants who were prescribed rifampin with the documented goal of prosthesis retention with no further suppressive antibiotics were included in the intent-to-cure group, the primary focus of this study. Those excluded from the intent-to-cure group were given rifampin and another antibiotic, but there was a documented plan of either ongoing chronic suppression or eventual explantation; these participants were placed in the without-intent-to-cure group. Analysis of treatment success and failure was limited to the intent-to-cure group, whereas both groups were included for assessment of adverse effects (AEs) and treatment duration. This project was reviewed by the MVAHCS Institutional Review Board and determined to be a quality improvement initiative and to not meet the definition of research, and as such did not require review; it was reviewed and approved by the MVAHCS Research and Development Committee.
RESULTS
A total of 538 patients were identified who simultaneously received rifampin and another oral antibiotic between January 1, 2000, and June 30, 2021.
Forty-two participants (54%) had Staphylococcus aureus and 31 participants (40%) had coagulase-negative staphylococci infections, while 11 gram-negative organisms (14%) and 6 gram-positive anaerobic cocci (8%) infections were noted. Cutibacterium acnes and Streptococcus agalactiae were each found in 3 participants (4% of), and diphtheroids (not further identified) was found on 2 participants (3%). Candida albicans was identified in a single participant (1%), along with coagulase-negative staphylococci, and 2 participants (3%) had no identified organisms. There were multiple organisms isolated from 20 patients (26%).
Fifty participants had clear documentation in their EHR that cure of infection was the goal, meeting the criteria for the intent-to-cure group. The remaining 28 participants were placed in the without-intent-to-cure group. Success and failure rates were only measured in the intent-to-cure group, as by definition the without-intent-to-cure group patients would meet the criteria for failure (removal of prosthesis or long-term antibiotic use). The without-intent-to-cure group had a higher median age than the intent-to-cure group (69 years vs 64 years, P = .24) and a higher proportion of male participants (96% vs 80%, P = .09). The median (IQR) implant age of 11 months (1.0-50.5) in the without-intent-to-cure group was also higher than the median implant age of 1 month (0.6-22.0) in the primary group (P = .22). In the without-intent-to-cure group, 19 participants (68%) had a chronic infection, compared with 11 (22%) in the intent-to-cure group (P < .001).
The mean (SD) Charlson Comorbidity Index in the without-intent-to-cure group was 2.5 (1.3) compared with 1.9 (1.4) in the intent-to-cure group (P = .09). There was no significant difference in the type of implant or microbiology of the infecting organism between the 2 groups, although it should be noted that in the intent-to-cure group, 48 patients (96%) had Staphylococcus aureus or coagulase-negative staphylococci isolated.
The median (IQR) dosage of rifampin was 600 mg (300-900). The secondary oral antibiotics used most often were 36 fluoroquinolones (46%) followed by 20 tetracyclines (26%), 6 cephalosporins (8%), and 6 penicillins (8%). Additionally, 6 participants (8%) received IV vancomycin, and 1 participant (1%) was given an oral antifungal in addition to a fluoroquinolone because cultures revealed bacterial and fungal growth. The median (IQR) duration of antimicrobial therapy was 3 months (1.4-3.0). The mean (SD) duration of antimicrobial therapy was 3.6 (2.4) months for TKA infections and 2.4 (0.9) months for THA infections.
Clinical Outcome
Forty-one intent-to-cure group participants (82%) experienced treatment success. We further subdivided the intent-to-cure group by implant age. Participants whose implant was < 2 months old had a success rate of 93%, whereas patients whose implant was older had a success rate of 65% (P = .02).
Secondary Outcomes
The median (IQR) duration of antimicrobial treatment was 3 months (1.4-3.0) for the 38 patients with TKA-related infections and 3 months (1.4-6.0) for the 29 patients with THA infections. AEs were recorded in 24 (31%) of all study participants. Of those with AEs, the average number reported per patient was 1.6. Diarrhea, gastric upset, and nausea were each reported 7 times, accounting for 87% of all recorded AEs. Five participants reported having a rash while on antibiotics, and 2 experienced dysgeusia. One participant reported developing a yeast infection and another experienced vaginitis.
DISCUSSION
Among patients with orthopedic implant infections treated with intent to cure using a rifampin-containing antibiotic regimen at the MVAHCS, 82% had clinical success. Although this is lower than the success rates reported in clinical trials, this is not entirely unexpected.5,7 In most clinical trials studying DAIR and rifampin for PJI, patients are excluded if they do not have an acute staphylococcal infection in the setting of a well-fixed prosthesis without evidence of a sinus tract. Such exclusion criteria were not present in our retrospective study, which was designed to evaluate the real-world practice patterns at this facility. The population at the US Department of Veterans Affairs (VA) is older, more frail, and with more comorbid conditions than populations in prior studies. It is possible that patients with characteristics that would have caused them to be excluded from a clinical trial would be less likely to receive rifampin therapy with the intent to cure. This is suggested by the significantly higher prevalence of chronic infections (68%) in the without-intent-to-cure group compared with 22% in the intent-to-cure group. However, there were reasonably high proportions of participants included in the intent-to-cure group who did have conditions that would have led to their exclusion from prior trials, such as chronic infection (22%) and implant age ≥ 2 months (40%).
When evaluating participants by the age of their implant, treatment success rose to 93% for patients with implants < 2 months old compared with 65% for patients with older implants. This suggests that participants with a newer implant or more recent infection have a greater likelihood of successful treatment, which is consistent with the results of previous clinical trials.5,10 Considering how difficult multiple surgeries can be for older adult patients with comorbidities, we suggest that DAIR with a rifampin-containing regimen be considered as the primary treatment option for early PJIs at the MVAHCS. We also note inconsistent adherence to IDSA treatment guidelines on rifampin therapy, in that patients without intent to cure were prescribed a regimen including rifampin. This may reflect appropriate variability in the care of individual patients but may also offer an opportunity to change processes to improve care.
Limitations
Our analysis has limitations. As with any retrospective study evaluating the efficacy of a specific antibiotic, we were not able to attribute specific outcomes to the antibiotic of interest. Since the choice of antibiotics was left to the treating health care practitioner, therapy was not standardized, and because this was a retrospective study, causal relationships could not be inferred. Our analysis was also limited by the lack of intent to cure in 28 participants (36%), which could be an indication of practitioner bias in therapy selection or characteristic differences between the 2 groups. We looked for signs of infection failure 1 year after the completion of antimicrobial therapy, but longer follow-up could have led to higher rates of failure. Also, while participants’ infections were considered cured if they never sought further medical care for the infection at the MVAHCS, it is possible that patients could have sought care at another facility. We note that 9 patients were excluded because they were unable to complete a treatment course due to rifampin AEs, meaning that the success rates reported here reflect the success that may be expected if a patient can tolerate and complete a rifampin-based regimen. This study was conducted in a single VA hospital and may not be generalizable to nonveterans or veterans seeking care at other facilities.
Conclusions
DAIR followed by a short course of IV antibiotics and an oral regimen including rifampin and another antimicrobial is a reasonable option for veterans with acute staphylococcal orthopedic device infections at the MVAHCS. Patients with a well-placed prosthesis and an acute infection seem especially well suited for this treatment, and treatment with intent to cure should be pursued in patients who meet the criteria for rifampin therapy.
Acknowledgments
We thank Erik Stensgard, PharmD, for assistance in compiling the list of patients receiving rifampin and another antimicrobial.
Orthopedic implants are frequently used to repair fractures and replace joints. The number of total joint replacements is high, with > 1 million total hip (THA) and total knee (TKA) arthroplasties performed in the United States each year.1 While most joint arthroplasties are successful and significantly improve patient quality of life, a small proportion become infected.2 Prosthetic joint infection (PJI) causes substantial morbidity and mortality, particularly among older patients, and is difficult and costly to treat.3
The historic gold standard treatment for PJI is a 2-stage replacement, wherein the prosthesis is removed in one procedure and a new prosthesis is implanted in a second procedure after an extended course of antibiotics. This approach requires the patient to undergo 2 major procedures and spend considerable time without a functioning prosthesis, contributing to immobility and deconditioning. This option is difficult for frail or older patients and is associated with high medical costs.4
In 1998, a novel method of treatment known as debridement, antibiotics, and implant retention (DAIR) was evaluated in a small, randomized controlled trial.5 This study used a unique antimicrobial approach: the administration of ciprofloxacin plus either rifampin or placebo for 3 to 6 months, combined with a single surgical debridement. Eliminating a second surgical procedure and largely relying on oral antimicrobials reduces surgical risks and decreases costs.4 Current guidelines endorse DAIR with rifampin and a second antibiotic for patients diagnosed with PJI within about 30 days of prosthesis implantation who have a well-fixed implant without evidence of a sinus tract.6 Clinical trial data demonstrate that this approach is > 90% effective in patients with a well-fixed prosthesis and acute staphylococcal PJI.3,7
Thus far, clinical trials examining this approach have been small and did not include veterans who are typically older and have more comorbidities.8 The Minneapolis Veterans Affairs Health Care System (MVAHCS) infectious disease section has implemented the rifampin-based DAIR approach for orthopedic device-related infections since this approach was first described in 1998 but has not systematically evaluated its effectiveness or whether there are areas for improvement.
METHODS
We conducted a retrospective analysis of patients who underwent DAIR combined with a rifampin-containing regimen at the MVAHCS from January 1, 2001, through June 30, 2021. Inclusion required a diagnosis of orthopedic device-related infection and treatment with DAIR followed by antimicrobial therapy that included rifampin for 1 to 6 months. PJI was defined by meeting ≥ 1 of the following criteria: (1) isolation of the same microorganism from ≥ 2 cultures from joint aspirates or intraoperative tissue specimens; (2) purulence surrounding the prosthesis at the time of surgery; (3) acute inflammation consistent with infection on histopathological examination or periprosthetic tissue; or (4) presence of a sinus tract communicating with the prosthesis.
All cases of orthopedic device infection managed with DAIR and rifampin were included, regardless of implant stability, age of the implant at the time of symptom onset, presence of a sinus tract, or infecting microorganism. Exclusion criteria included patients who started or finished PJI treatment at another facility, were lost to follow-up, discontinued rifampin, died within 1 year of completing antibiotic therapy due to reasons unrelated to treatment failure, received rifampin for < 50% of their antimicrobial treatment course, had complete hardware removal, or had < 1 year between the completion of antimicrobial therapy and the time of data collection.
Management of DAIR procedures at the MVAHCS involves evaluating the fixation of the prosthesis, tissue sampling for microbiological analysis, and thorough debridement of infected tissue. Following debridement, a course of IV antibiotics is administered before initiating oral antibiotic therapy. To protect against resistance, rifampin is combined with another antibiotic typically from the fluoroquinolone, tetracycline, or cephalosporin class. Current guidelines suggest 3 and 6 months of oral antibiotics for prosthetic hip and knee infections, respectively.6
Treatment Outcomes
The primary outcome was treatment success, defined as meeting all of the following: (1) lack of clinical signs and symptoms of infection; (2) absence of radiological signs of loosening or infection within 1 year after the conclusion of treatment; and (3) absence of additional PJI treatment interventions for the prosthesis of concern within 1 year after completing the original antibiotic treatment.
Treatment failure was defined as meeting any of the following: (1) recurrence of PJI (original strain or different microorganism) within 1 year after the completion of antibiotic therapy; (2) death attributed to PJI anytime after the initial debridement; (3) removal of the prosthetic joint within 1 year after the completion of antibiotic therapy; or (4) long-term antibiotic use to suppress the PJI after the completion of the initial antibiotic therapy.
Statistical Analysis
Descriptive statistics were used to define the baseline characteristics of patients receiving rifampin therapy for orthopedic implant infections at the MVAHCS. Variables analyzed were age, sex, race and ethnicity, type of implant, age of implant, duration of symptoms, comorbidities (diabetes and rheumatoid arthritis), and presence of chronic infection. Patients were classified as having a chronic infection if they received previous infection treatment (antibiotics or surgery) for the orthopedic device in question. We created this category because patients with persistent infection after a medical or surgical attempt at treatment are likely to have a higher probability of treatment failure compared with those with no prior therapy. Charlson Comorbidity Index was calculated using clinical information present at the onset of infection.9 Fisher exact test was used to assess differences between categorical variables, and an independent t test was used to assess differences in continuous variables. P < .05 indicated statistical significance.
To assess the ability of a rifampin-based regimen to achieve a cure of PJI, we grouped participants into 2 categories: those with an intent to cure strategy and those without intent to cure based on documentation in the electronic health record (EHR). Participants who were prescribed rifampin with the documented goal of prosthesis retention with no further suppressive antibiotics were included in the intent-to-cure group, the primary focus of this study. Those excluded from the intent-to-cure group were given rifampin and another antibiotic, but there was a documented plan of either ongoing chronic suppression or eventual explantation; these participants were placed in the without-intent-to-cure group. Analysis of treatment success and failure was limited to the intent-to-cure group, whereas both groups were included for assessment of adverse effects (AEs) and treatment duration. This project was reviewed by the MVAHCS Institutional Review Board and determined to be a quality improvement initiative and to not meet the definition of research, and as such did not require review; it was reviewed and approved by the MVAHCS Research and Development Committee.
RESULTS
A total of 538 patients were identified who simultaneously received rifampin and another oral antibiotic between January 1, 2000, and June 30, 2021.
Forty-two participants (54%) had Staphylococcus aureus and 31 participants (40%) had coagulase-negative staphylococci infections, while 11 gram-negative organisms (14%) and 6 gram-positive anaerobic cocci (8%) infections were noted. Cutibacterium acnes and Streptococcus agalactiae were each found in 3 participants (4% of), and diphtheroids (not further identified) was found on 2 participants (3%). Candida albicans was identified in a single participant (1%), along with coagulase-negative staphylococci, and 2 participants (3%) had no identified organisms. There were multiple organisms isolated from 20 patients (26%).
Fifty participants had clear documentation in their EHR that cure of infection was the goal, meeting the criteria for the intent-to-cure group. The remaining 28 participants were placed in the without-intent-to-cure group. Success and failure rates were only measured in the intent-to-cure group, as by definition the without-intent-to-cure group patients would meet the criteria for failure (removal of prosthesis or long-term antibiotic use). The without-intent-to-cure group had a higher median age than the intent-to-cure group (69 years vs 64 years, P = .24) and a higher proportion of male participants (96% vs 80%, P = .09). The median (IQR) implant age of 11 months (1.0-50.5) in the without-intent-to-cure group was also higher than the median implant age of 1 month (0.6-22.0) in the primary group (P = .22). In the without-intent-to-cure group, 19 participants (68%) had a chronic infection, compared with 11 (22%) in the intent-to-cure group (P < .001).
The mean (SD) Charlson Comorbidity Index in the without-intent-to-cure group was 2.5 (1.3) compared with 1.9 (1.4) in the intent-to-cure group (P = .09). There was no significant difference in the type of implant or microbiology of the infecting organism between the 2 groups, although it should be noted that in the intent-to-cure group, 48 patients (96%) had Staphylococcus aureus or coagulase-negative staphylococci isolated.
The median (IQR) dosage of rifampin was 600 mg (300-900). The secondary oral antibiotics used most often were 36 fluoroquinolones (46%) followed by 20 tetracyclines (26%), 6 cephalosporins (8%), and 6 penicillins (8%). Additionally, 6 participants (8%) received IV vancomycin, and 1 participant (1%) was given an oral antifungal in addition to a fluoroquinolone because cultures revealed bacterial and fungal growth. The median (IQR) duration of antimicrobial therapy was 3 months (1.4-3.0). The mean (SD) duration of antimicrobial therapy was 3.6 (2.4) months for TKA infections and 2.4 (0.9) months for THA infections.
Clinical Outcome
Forty-one intent-to-cure group participants (82%) experienced treatment success. We further subdivided the intent-to-cure group by implant age. Participants whose implant was < 2 months old had a success rate of 93%, whereas patients whose implant was older had a success rate of 65% (P = .02).
Secondary Outcomes
The median (IQR) duration of antimicrobial treatment was 3 months (1.4-3.0) for the 38 patients with TKA-related infections and 3 months (1.4-6.0) for the 29 patients with THA infections. AEs were recorded in 24 (31%) of all study participants. Of those with AEs, the average number reported per patient was 1.6. Diarrhea, gastric upset, and nausea were each reported 7 times, accounting for 87% of all recorded AEs. Five participants reported having a rash while on antibiotics, and 2 experienced dysgeusia. One participant reported developing a yeast infection and another experienced vaginitis.
DISCUSSION
Among patients with orthopedic implant infections treated with intent to cure using a rifampin-containing antibiotic regimen at the MVAHCS, 82% had clinical success. Although this is lower than the success rates reported in clinical trials, this is not entirely unexpected.5,7 In most clinical trials studying DAIR and rifampin for PJI, patients are excluded if they do not have an acute staphylococcal infection in the setting of a well-fixed prosthesis without evidence of a sinus tract. Such exclusion criteria were not present in our retrospective study, which was designed to evaluate the real-world practice patterns at this facility. The population at the US Department of Veterans Affairs (VA) is older, more frail, and with more comorbid conditions than populations in prior studies. It is possible that patients with characteristics that would have caused them to be excluded from a clinical trial would be less likely to receive rifampin therapy with the intent to cure. This is suggested by the significantly higher prevalence of chronic infections (68%) in the without-intent-to-cure group compared with 22% in the intent-to-cure group. However, there were reasonably high proportions of participants included in the intent-to-cure group who did have conditions that would have led to their exclusion from prior trials, such as chronic infection (22%) and implant age ≥ 2 months (40%).
When evaluating participants by the age of their implant, treatment success rose to 93% for patients with implants < 2 months old compared with 65% for patients with older implants. This suggests that participants with a newer implant or more recent infection have a greater likelihood of successful treatment, which is consistent with the results of previous clinical trials.5,10 Considering how difficult multiple surgeries can be for older adult patients with comorbidities, we suggest that DAIR with a rifampin-containing regimen be considered as the primary treatment option for early PJIs at the MVAHCS. We also note inconsistent adherence to IDSA treatment guidelines on rifampin therapy, in that patients without intent to cure were prescribed a regimen including rifampin. This may reflect appropriate variability in the care of individual patients but may also offer an opportunity to change processes to improve care.
Limitations
Our analysis has limitations. As with any retrospective study evaluating the efficacy of a specific antibiotic, we were not able to attribute specific outcomes to the antibiotic of interest. Since the choice of antibiotics was left to the treating health care practitioner, therapy was not standardized, and because this was a retrospective study, causal relationships could not be inferred. Our analysis was also limited by the lack of intent to cure in 28 participants (36%), which could be an indication of practitioner bias in therapy selection or characteristic differences between the 2 groups. We looked for signs of infection failure 1 year after the completion of antimicrobial therapy, but longer follow-up could have led to higher rates of failure. Also, while participants’ infections were considered cured if they never sought further medical care for the infection at the MVAHCS, it is possible that patients could have sought care at another facility. We note that 9 patients were excluded because they were unable to complete a treatment course due to rifampin AEs, meaning that the success rates reported here reflect the success that may be expected if a patient can tolerate and complete a rifampin-based regimen. This study was conducted in a single VA hospital and may not be generalizable to nonveterans or veterans seeking care at other facilities.
Conclusions
DAIR followed by a short course of IV antibiotics and an oral regimen including rifampin and another antimicrobial is a reasonable option for veterans with acute staphylococcal orthopedic device infections at the MVAHCS. Patients with a well-placed prosthesis and an acute infection seem especially well suited for this treatment, and treatment with intent to cure should be pursued in patients who meet the criteria for rifampin therapy.
Acknowledgments
We thank Erik Stensgard, PharmD, for assistance in compiling the list of patients receiving rifampin and another antimicrobial.
1. Maradit Kremers H, Larson DR, Crowson CS, et al. Prevalence of total hip and knee replacement in the United States. J Bone Joint Surg Am. 2015;97(17):1386-1397. doi:10.2106/JBJS.N.01141
2. Kapadia BH, Berg RA, Daley JA, Fritz J, Bhave A, Mont MA. Periprosthetic joint infection. Lancet. 2016;387(10016):386-394. doi:10.1016/S0140-6736(14)61798-0
3. Zhan C, Kaczmarek R, Loyo-Berrios N, Sangl J, Bright RA. Incidence and short-term outcomes of primary and revision hip replacement in the United States. J Bone Joint Surg Am. 2007;89(3):526-533. doi:10.2106/JBJS.F.00952
4. Fisman DN, Reilly DT, Karchmer AW, Goldie SJ. Clinical effectiveness and cost-effectiveness of 2 management strategies for infected total hip arthroplasty in the elderly. Clin Infect Dis. 2001;32(3):419-430. doi:10.1086/318502
5. Zimmerli W, Widmer AF, Blatter M, Frei R, Ochsner PE. Role of rifampin for treatment of orthopedic implant-related staphylococcal infections: a randomized controlled trial. Foreign-Body Infection (FBI) Study Group. JAMA. 1998;279(19):1537-1541. doi:10.1001/jama.279.19.1537
6. Osmon DR, Berbari EF, Berendt AR, et al. Diagnosis and management of prosthetic joint infection: clinical practice guidelines by the Infectious Diseases Society of America. Clin Infect Dis. 2013;56(1):e1-e25. doi:10.1093/cid/cis803
7. Lora-Tamayo J, Euba G, Cobo J, et al. Short- versus long-duration levofloxacin plus rifampicin for acute staphylococcal prosthetic joint infection managed with implant retention: a randomised clinical trial. Int J Antimicrob Agents. 2016;48(3):310-316. doi:10.1016/j.ijantimicag.2016.05.021
8. Agha Z, Lofgren RP, VanRuiswyk JV, Layde PM. Are patients at Veterans Affairs medical centers sicker? A comparative analysis of health status and medical resource use. Arch Intern Med. 2000;160(21):3252-3257. doi:10.1001/archinte.160.21.3252
9. Charlson ME, Pompei P, Ales KL, MacKenzie CR. A new method of classifying prognostic comorbidity in longitudinal studies: development and validation. J Chronic Dis. 1987;40(5):373-383. doi:10.1016/0021-9681(87)90171-8
10. Vilchez F, Martínez-Pastor JC, García-Ramiro S, et al. Outcome and predictors of treatment failure in early post-surgical prosthetic joint infections due to Staphylococcus aureus treated with debridement. Clin Microbiol Infect. 2011;17(3):439-444. doi:10.1111/j.1469-0691.2010.03244.x
1. Maradit Kremers H, Larson DR, Crowson CS, et al. Prevalence of total hip and knee replacement in the United States. J Bone Joint Surg Am. 2015;97(17):1386-1397. doi:10.2106/JBJS.N.01141
2. Kapadia BH, Berg RA, Daley JA, Fritz J, Bhave A, Mont MA. Periprosthetic joint infection. Lancet. 2016;387(10016):386-394. doi:10.1016/S0140-6736(14)61798-0
3. Zhan C, Kaczmarek R, Loyo-Berrios N, Sangl J, Bright RA. Incidence and short-term outcomes of primary and revision hip replacement in the United States. J Bone Joint Surg Am. 2007;89(3):526-533. doi:10.2106/JBJS.F.00952
4. Fisman DN, Reilly DT, Karchmer AW, Goldie SJ. Clinical effectiveness and cost-effectiveness of 2 management strategies for infected total hip arthroplasty in the elderly. Clin Infect Dis. 2001;32(3):419-430. doi:10.1086/318502
5. Zimmerli W, Widmer AF, Blatter M, Frei R, Ochsner PE. Role of rifampin for treatment of orthopedic implant-related staphylococcal infections: a randomized controlled trial. Foreign-Body Infection (FBI) Study Group. JAMA. 1998;279(19):1537-1541. doi:10.1001/jama.279.19.1537
6. Osmon DR, Berbari EF, Berendt AR, et al. Diagnosis and management of prosthetic joint infection: clinical practice guidelines by the Infectious Diseases Society of America. Clin Infect Dis. 2013;56(1):e1-e25. doi:10.1093/cid/cis803
7. Lora-Tamayo J, Euba G, Cobo J, et al. Short- versus long-duration levofloxacin plus rifampicin for acute staphylococcal prosthetic joint infection managed with implant retention: a randomised clinical trial. Int J Antimicrob Agents. 2016;48(3):310-316. doi:10.1016/j.ijantimicag.2016.05.021
8. Agha Z, Lofgren RP, VanRuiswyk JV, Layde PM. Are patients at Veterans Affairs medical centers sicker? A comparative analysis of health status and medical resource use. Arch Intern Med. 2000;160(21):3252-3257. doi:10.1001/archinte.160.21.3252
9. Charlson ME, Pompei P, Ales KL, MacKenzie CR. A new method of classifying prognostic comorbidity in longitudinal studies: development and validation. J Chronic Dis. 1987;40(5):373-383. doi:10.1016/0021-9681(87)90171-8
10. Vilchez F, Martínez-Pastor JC, García-Ramiro S, et al. Outcome and predictors of treatment failure in early post-surgical prosthetic joint infections due to Staphylococcus aureus treated with debridement. Clin Microbiol Infect. 2011;17(3):439-444. doi:10.1111/j.1469-0691.2010.03244.x
A worthwhile tool in evaluating worrisome lesions
ABSTRACT
Background: We sought to examine whether electrical impedance spectroscopy (EIS), a diagnostic tool approved by the US Food and Drug Administration for the evaluation of pigmented skin lesions (PSLs), is beneficial to primary care providers (PCPs) by comparing the accuracy of PCPs’ management decisions for PSLs based on visual examination alone with those based on concurrent visual and EIS evaluation.
Methods: Physicians and nurse practitioners (NPs) participated in an anonymous online survey in which they viewed clinical images of PSLs and were asked to make 2 clinical decisions before and after being provided an EIS score that indicated the likelihood that the lesion was a melanoma. They were asked (1) if they would biopsy the lesion/refer the patient out and (2) what they expected the pathology results would show.
Results: Forty-four physicians and 17 NPs participated, making clinical decisions for 1354 presented lesions. Overall, with the addition of EIS to visual inspection of clinical images, the sensitivity of biopsy/referral decisions for melanomas and severely dysplastic nevi (SDN) increased from 69.2% to 90.0% (P < .001), while specificity increased from 44.0% to 72.6% (P < .001). Physicians and NPs, regardless of years of experience, each saw significant improvements in sensitivity, specificity, and diagnostic accuracy with the addition of EIS scores.
Conclusions: The incorporation of EIS data into clinical decision-making by PCPs significantly increased the sensitivity and specificity of biopsy/referral decisions for melanomas and SDN and overall diagnostic accuracy compared with visual inspection alone. The results of this study suggest that diagnostic accuracy for PSLs by PCPs may be improved with adjunctive use of EIS with visual inspection.
Primary care providers (PCPs) are often the first line of defense in detecting skin cancers. For patients with concerning skin lesions, PCPs may choose to perform a biopsy or facilitate access to specialty services (eg, Dermatology). Consequently, PCPs play a critical role in the timely detection of skin cancers, and it is paramount to employ continually improving detection methods, such as the application of technologic advances.1
Differentiating benign nevi from melanoma and severely dysplastic nevi (SDN), both of which warrant excision, poses a unique challenge to clinicians examining pigmented skin lesions (PSLs). PCPs often rely on visual inspection to differentiate benign skin lesions from malignant skin cancers. In some primary care practices, dermoscopy, which involves using a handheld device to evaluate lesions with polarized light and magnification, is used to improve melanoma detection. However, while visual inspection and dermoscopy are valid, effective techniques for the diagnosis of melanocytic lesions, in many instances they still can lead to missed cancers or unnecessary biopsies and specialty referrals. Adjunctive use of dermoscopy with visual inspection has been shown to increase the probability of skin cancer detection, but it fails to achieve a near-100% success rate.2 Furthermore, dermoscopy is heavily user-dependent, requiring significant training and experience for appropriate use.3
Another option is an electrical impedance spectroscopy (EIS) device (Nevisense, Scibase, Stockholm, Sweden), which has been approved by the US Food and Drug Administration (FDA) to assist in the detection of melanoma and differentiation from benign PSLs.4 EIS is a noninvasive, rapidly applied technology designed to accompany the visual examination of melanocytic lesions in office, with or without dermoscopy. Still relatively new, the technology is employed today by many dermatologists, increasing diagnostic accuracy for PSLs.5 The lightweight and portable instrument features a handheld probe, which is held against a lesion to obtain a reading. EIS uses a low-voltage electrode to apply a harmless electrical current to the skin at various frequencies.6 As benign and malignant tissues vary in cell shape, size, and composition, EIS distinguishes differential electrical resistance of the tissue to aid in diagnosis.7
Continue to: EIS provides high-sensitivity...
EIS provides high-sensitivity melanoma diagnosis vs histopathologic confirmation from biopsies, with 1 study showing a 96.6% sensitivity rating, detecting 256 of 265 melanomas.4 The EIS device, by measuring differences in electrical resistance between benign and cancerous cells, outputs a simple integer score ranging from 0 to 10 associated with the likelihood of the lesion being a melanoma.8 Based on data from the Nevisense pivotal trial,4 Nevisense reports that scores of 0 to 3 carry a negative predictive value of 99% for melanoma, whereas scores of 4 to 10 signify increasingly greater positive predictive values from 7% to 61%.
We aimed to assess whether EIS may be beneficial to PCPs by comparing the accuracy of clinical decision-making for PSLs based on visual examination alone with that based on concurrent visual and EIS evaluation.
METHODS
A questionnaire was distributed via email to 142 clinicians at clinics affiliated with either of 2 organizations delivering care to the New York City area through a network of community health centers: the Institute for Family Health (IFH) and the Community Healthcare Network (CHN). Of these recipients, 72 were affiliated with IFH across 27 community health centers and 70 were affiliated with CHN across 14 community health centers. Recipients were physicians and nurse practitioners (NPs) practicing at primary health care facilities.
Survey instrument. The first section of the survey instrument (APPENDIX) solicited demographic information and explained how to apply the EIS scores for diagnostic decision-making. The second featured images of 12 randomly selected, histologically confirmed, and EIS-evaluated PSLs from a previously published prospective blinded trial of 2416 lesions.4 The Institutional Review Board of the Icahn School of Medicine at Mount Sinai reviewed and approved the study and survey instrument.
Clinical images of these lesions, comprising 4 melanocytic nevi, 4 dysplastic nevi (including 3 mild-moderately dysplastic and 1 severely dysplastic nevus), and 4 melanomas
Continue to: Analysis
Analysis. A biopsy or referral rating of 4 or 5 was considered a decision to biopsy or refer (ie, a diagnostic decision consistent with melanoma or SDN warranting excision), whereas a selection of 1 to 3 was considered a decision not to biopsy or refer (ie, a diagnostic decision consistent with a benign PSL). The sensitivity and specificity of biopsy/referral decisions for melanomas and SDN, the proportion of missed melanomas and SDN, and the proportion of biopsy/referral decisions for benign lesions were separately determined for visual inspection alone and visual inspection with EIS score. Similarly, diagnostic accuracy was calculated for these clinical scenarios. These metrics were further stratified among different subsets of the respondent population. Differences in sensitivity, specificity, biopsy/referral decision proportions, and diagnostic accuracy were calculated using McNemar’s test for paired proportions.
RESULTS
Sixty-one respondents, comprising 44 physicians and 17 NPs, completed the survey, yielding a response rate of 43% (TABLE 1). In total, 1354 clinical decisions (677 based on visual inspection alone and 677 based on visual inspection plus EIS) were made. A biopsy/referral decision was made after assessing 416 of 677 cases (61%) with visual inspection alone and 360 of 677 cases (53%) when relying on visual inspection plus EIS. None of the respondents reported any prior experience with EIS.
When incorporating EIS scores, respondents’ mean sensitivity for melanomas and SDN increased from 69.2% to 90.0% (P < .001) and specificity from 44.0% to 72.6% (P < .001; TABLE 2). At baseline, physicians demonstrated a sensitivity and specificity of 74.6% and 46.5%, respectively, while NPs demonstrated a sensitivity and specificity of 56.1% and 37.9%, respectively.
All respondent subgroups stratified by occupation and years of experience saw significant increases in both sensitivity and specificity upon the incorporation of EIS scores, with NPs seeing a greater increase in sensitivity (56.1% vs 85.4%; P < .001) and specificity (37.9% vs 69.0%; P < .001) than physicians (sensitivity: 74.6% vs 91.9%; P < .001; specificity: 46.5% vs 74.1%; P < .001). The only difference in diagnostic performance based on years of experience was a greater pre-EIS sensitivity by clinicians who had been in practice for ≥ 15 years, compared with those in practice for shorter periods (TABLE 2).
Diagnostic accuracy increased significantly from 48% when based on visual inspection alone to 73% with the addition of EIS scores (P < .001; TABLE 3). Physicians and NPs each significantly increased their diagnostic accuracy upon the incorporation of EIS, with NPs exhibiting the greatest increase (from 36.9% to 65.7%; P < .001). PCPs with 6 to 14 years of experience saw the greatest increase in diagnostic accuracy when adding EIS (45.9% vs 76.4%; P < .001). Overall, the addition of EIS scores resulted in 58 fewer missed melanomas and SDN and 114 fewer benign referrals or biopsies (TABLE 4).
Continue to: DISCUSSION
DISCUSSION
Primary care evaluation plays a significant role in the diagnosis and management of PSLs, ultimately shaping outcomes for patients with melanoma. Improved accuracy of PSL classification could yield greater sensitivity for the diagnosis of melanomas and high-risk melanocytic lesions at earlier stages, while also reducing the number of unnecessary biopsies and referrals—leading to decreased patient morbidity and mortality and reduced health care spending.9
Diagnostic tools are valuable insofar as they can improve accuracy and positively impact clinical management and patient outcomes.10 In this case, increased sensitivity reduced missed melanoma diagnoses, while increased specificity avoided the additional costs and patient toll associated with a biopsy or referral for a benign lesion.
Dermoscopy has been shown to improve the sensitivity and specificity of PSL diagnosis compared with visual inspection alone; however, without substantial training and experience, accuracy with dermoscopy can be no better than examination with the naked eye.3,11,12 The dropout rates are high for training PCPs in its use, given that several months of training may be needed for competent use.13,14 To improve the clinical management of PSLs broadly in primary care, a need exists for easy-to-use adjunctive tools that increase diagnostic accuracy.15
In this study, with only a brief explanation of how to interpret EIS scores, clinicians without any prior experience using EIS demonstrated significantly improved accuracy in deciding appropriate management and classifying melanocytic lesions with the addition of EIS to visual inspection. These improvements, seen in clinicians of varying training and experience, suggest that the learning curve of EIS may not be as steep as that of dermoscopy.
The greater baseline sensitivity, specificity, and diagnostic accuracy of physicians’ clinical decision-making compared with NPs before the incorporation of EIS in the study may be a product of comparatively more extensive medical training. In addition, EIS yielded a greater benefit to NPs than to physicians, with greater increases in sensitivity and specificity noted. This suggests that the use of EIS is particularly advantageous to clinicians who are less proficient in assessing melanocytic lesions. Using visual inspection alone, more experienced respondents made biopsy/referral decisions with greater sensitivity but similar specificity to those with less experience. With the incorporation of EIS scores, the sensitivity and specificity of respondents’ clinical decision-making rose to comparable levels across all experience groups, providing further indication of EIS’s particular value to clinicians who are less proficient in PSL evaluation.
Continue to: This technology holds the potential...
This technology holds the potential to be seamlessly implemented into primary care practice, given that dermatology expertise training is not required to use the EIS device; this could allow for EIS measurement of lesions to be delegated to office staff (eg, nurses, medical assistants).16 Future studies are needed to assess EIS use among PCPs in a real-world setting, where factors such as its application on nonmelanocytic lesions (eg, seborrheic keratoses) and its pairing with patient historical data could produce varying results.
Limitations. While revealing, this study had its limitations. Respondents did not have access to additional pertinent clinical information, such as patients’ histories and risk factors. Clinical decisions in this survey were made based on digital images rather than in vivo examination. This may not represent a real-life evaluation; there is the potential for minimization of the true consequences of a missed melanoma or unnecessary biopsy in the minds of participants, and this does not factor in the operation of the actual EIS device. The Hawthorne effect may also have influenced PCPs’ diagnostic selections. Also, the limited sample size constitutes another limitation.
Of note, in this survey format, respondents rated their inclination to biopsy or refer each lesion from 1 to 5. For statistical analyses, lesions rated 1 to 3 were considered as not biopsied/referred and those rated 4 to 5 as biopsied/referred. The sensitivity and specificity values observed, for both visual examination and concurrent visual and EIS evaluation, are therefore based on this classification system of participants’ provided ratings. It is conceivable that differing sensitivity and specificity values might have been detected if clinicians were instead given a binary choice for referral/biopsy decisions.
CONCLUSIONS
Among PCPs tasked with evaluating melanocytic lesions, the incorporation of EIS data into clinical decision-making in this study significantly increased the sensitivity, specificity, and overall diagnostic accuracy of biopsy or referral decisions for melanomas and SDN compared with visual inspection alone. Overall, the results of this preliminary study suggest that diagnostic accuracy for PSLs by PCPs may be improved with the adjunctive use of EIS with visual inspection. This would ultimately improve patient care and reduce the morbidity and mortality of a melanoma diagnosis.
CORRESPONDENCE
Jonathan Ungar, MD, Kimberly and Eric J. Waldman Department of Dermatology, Icahn School of Medicine at Mount Sinai, 5 East 98th Street, 5th Floor, New York, NY 10029; jonathan.ungar@mountsinai.org
1. Goetsch NJ, Hoehns JD, Sutherland JE, et al. Assessment of postgraduate skin lesion education among Iowa family physicians. SAGE Open Med. 2017;5:2050312117691392. doi: 10.1177/2050312117691392
2. Dinnes J, Deeks JJ, Chuchu N, et al. Dermoscopy, with and without visual inspection, for diagnosing melanoma in adults. Cochrane Database Syst Rev. 2018;12:CD011902. doi: 10.1002/14651858.CD011902.pub2
3. Jones OT, Jurascheck LC, van Melle MA, et al. Dermoscopy for melanoma detection and triage in primary care: a systematic review. BMJ Open. 2019;9:e027529. doi: 10.1136/bmjopen-2018-027529
4. Malvehy J, Hauschild A, Curiel-Lewandrowski C, et al. Clinical performance of the Nevisense system in cutaneous melanoma detection: an international, multicentre, prospective and blinded clinical trial on efficacy and safety. Br J Dermatol. 2014;171:1099-1107. doi: 10.1111/bjd.13121
5. Svoboda RM, Prado G, Mirsky RS, et al. Assessment of clinician accuracy for diagnosing melanoma on the basis of electrical impedance spectroscopy score plus morphology versus lesion morphology alone. J Am Acad Dermatol. 2019;80:285-287. doi: 10.1016/j.jaad.2018.08.048
6. Mohr P, Birgersson U, Berking C, et al. Electrical impedance spectroscopy as a potential adjunct diagnostic tool for cutaneous melanoma. Skin Res Technol. 2013;19:75-83. doi: 10.1111/srt.12008
7. Rocha L, Menzies SW, Lo S, et al. Analysis of an electrical impedance spectroscopy system in short-term digital dermoscopy imaging of melanocytic lesions. Br J Dermatol. 2017;177:1432-1438. doi: 10.1111/bjd.15595
8. Litchman GH, Teplitz RW, Marson JW, et al. Impact of electrical impedance spectroscopy on dermatologists’ number needed to biopsy metric and biopsy decisions for pigmented skin lesions. J Am Acad Dermatol. 2021;85:976-979. doi: 10.1016/j.jaad.2020.09.011
9. Greenwood-Lee J, Jewett L, Woodhouse L, et al. A categorisation of problems and solutions to improve patient referrals from primary to specialty care. BMC Health Serv Res. 2018;18:1-16. doi: 10.1186/s12913-018-3745-y
10. Bossuyt PM, Reitsma JB, Linnet K, et al. Beyond diagnostic accuracy: the clinical utility of diagnostic tests. Clin Chem. 2012;58:1636-1643. doi: 10.1373/clinchem.2012.182576
11. Argenziano G, Cerroni L, Zalaudek I , et al. Accuracy in melanoma detection: a 10-year multicenter survey. J Am Acad Dermatol. 2012;67:54-59. doi: 10.1016/j.jaad.2011.07.019
12. Menzies SW, Vestergaard ME, Macaskill P, et al. Dermoscopy compared with naked eye examination for the diagnosis of primary melanoma: a meta-analysis of studies performed in a clinical setting. Br J Dermatol. 2008;159:669-676. doi: 10.1111/j.1365-2133.2008.08713.x
13. Menzies SW, Emery J, Staples M, et al. Impact of dermoscopy and short-term sequential digital dermoscopy imaging for the management of pigmented lesions in primary care: a sequential intervention trial. Br J Dermatol. 2009;161:1270-1277. doi: 10.1111/j.1365-2133.2009.09374.x
14. Noor O, Nanda A, Rao BK. A dermoscopy survey to assess who is using it and why it is or is not being used. Int J Dermatol. 2009;48:951-952. doi: 10.1111/j.1365-4632.2009.04095.x
15. Weigl BH, Boyle DS, de los Santos T, et al. Simplicity of use: a critical feature for widespread adoption of diagnostic technologies in low-resource settings. Expert Rev Med Devices. 2009;6:461-464. doi: 10.1586/erd.09.31
16. Sarac E, Meiwes A, Eigentler T, et al. Diagnostic accuracy of electrical impedance spectroscopy in non-melanoma skin cancer. Acta Derm Venereol. 2020;100:adv00328. doi: 10.2340/00015555-3689
ABSTRACT
Background: We sought to examine whether electrical impedance spectroscopy (EIS), a diagnostic tool approved by the US Food and Drug Administration for the evaluation of pigmented skin lesions (PSLs), is beneficial to primary care providers (PCPs) by comparing the accuracy of PCPs’ management decisions for PSLs based on visual examination alone with those based on concurrent visual and EIS evaluation.
Methods: Physicians and nurse practitioners (NPs) participated in an anonymous online survey in which they viewed clinical images of PSLs and were asked to make 2 clinical decisions before and after being provided an EIS score that indicated the likelihood that the lesion was a melanoma. They were asked (1) if they would biopsy the lesion/refer the patient out and (2) what they expected the pathology results would show.
Results: Forty-four physicians and 17 NPs participated, making clinical decisions for 1354 presented lesions. Overall, with the addition of EIS to visual inspection of clinical images, the sensitivity of biopsy/referral decisions for melanomas and severely dysplastic nevi (SDN) increased from 69.2% to 90.0% (P < .001), while specificity increased from 44.0% to 72.6% (P < .001). Physicians and NPs, regardless of years of experience, each saw significant improvements in sensitivity, specificity, and diagnostic accuracy with the addition of EIS scores.
Conclusions: The incorporation of EIS data into clinical decision-making by PCPs significantly increased the sensitivity and specificity of biopsy/referral decisions for melanomas and SDN and overall diagnostic accuracy compared with visual inspection alone. The results of this study suggest that diagnostic accuracy for PSLs by PCPs may be improved with adjunctive use of EIS with visual inspection.
Primary care providers (PCPs) are often the first line of defense in detecting skin cancers. For patients with concerning skin lesions, PCPs may choose to perform a biopsy or facilitate access to specialty services (eg, Dermatology). Consequently, PCPs play a critical role in the timely detection of skin cancers, and it is paramount to employ continually improving detection methods, such as the application of technologic advances.1
Differentiating benign nevi from melanoma and severely dysplastic nevi (SDN), both of which warrant excision, poses a unique challenge to clinicians examining pigmented skin lesions (PSLs). PCPs often rely on visual inspection to differentiate benign skin lesions from malignant skin cancers. In some primary care practices, dermoscopy, which involves using a handheld device to evaluate lesions with polarized light and magnification, is used to improve melanoma detection. However, while visual inspection and dermoscopy are valid, effective techniques for the diagnosis of melanocytic lesions, in many instances they still can lead to missed cancers or unnecessary biopsies and specialty referrals. Adjunctive use of dermoscopy with visual inspection has been shown to increase the probability of skin cancer detection, but it fails to achieve a near-100% success rate.2 Furthermore, dermoscopy is heavily user-dependent, requiring significant training and experience for appropriate use.3
Another option is an electrical impedance spectroscopy (EIS) device (Nevisense, Scibase, Stockholm, Sweden), which has been approved by the US Food and Drug Administration (FDA) to assist in the detection of melanoma and differentiation from benign PSLs.4 EIS is a noninvasive, rapidly applied technology designed to accompany the visual examination of melanocytic lesions in office, with or without dermoscopy. Still relatively new, the technology is employed today by many dermatologists, increasing diagnostic accuracy for PSLs.5 The lightweight and portable instrument features a handheld probe, which is held against a lesion to obtain a reading. EIS uses a low-voltage electrode to apply a harmless electrical current to the skin at various frequencies.6 As benign and malignant tissues vary in cell shape, size, and composition, EIS distinguishes differential electrical resistance of the tissue to aid in diagnosis.7
Continue to: EIS provides high-sensitivity...
EIS provides high-sensitivity melanoma diagnosis vs histopathologic confirmation from biopsies, with 1 study showing a 96.6% sensitivity rating, detecting 256 of 265 melanomas.4 The EIS device, by measuring differences in electrical resistance between benign and cancerous cells, outputs a simple integer score ranging from 0 to 10 associated with the likelihood of the lesion being a melanoma.8 Based on data from the Nevisense pivotal trial,4 Nevisense reports that scores of 0 to 3 carry a negative predictive value of 99% for melanoma, whereas scores of 4 to 10 signify increasingly greater positive predictive values from 7% to 61%.
We aimed to assess whether EIS may be beneficial to PCPs by comparing the accuracy of clinical decision-making for PSLs based on visual examination alone with that based on concurrent visual and EIS evaluation.
METHODS
A questionnaire was distributed via email to 142 clinicians at clinics affiliated with either of 2 organizations delivering care to the New York City area through a network of community health centers: the Institute for Family Health (IFH) and the Community Healthcare Network (CHN). Of these recipients, 72 were affiliated with IFH across 27 community health centers and 70 were affiliated with CHN across 14 community health centers. Recipients were physicians and nurse practitioners (NPs) practicing at primary health care facilities.
Survey instrument. The first section of the survey instrument (APPENDIX) solicited demographic information and explained how to apply the EIS scores for diagnostic decision-making. The second featured images of 12 randomly selected, histologically confirmed, and EIS-evaluated PSLs from a previously published prospective blinded trial of 2416 lesions.4 The Institutional Review Board of the Icahn School of Medicine at Mount Sinai reviewed and approved the study and survey instrument.
Clinical images of these lesions, comprising 4 melanocytic nevi, 4 dysplastic nevi (including 3 mild-moderately dysplastic and 1 severely dysplastic nevus), and 4 melanomas
Continue to: Analysis
Analysis. A biopsy or referral rating of 4 or 5 was considered a decision to biopsy or refer (ie, a diagnostic decision consistent with melanoma or SDN warranting excision), whereas a selection of 1 to 3 was considered a decision not to biopsy or refer (ie, a diagnostic decision consistent with a benign PSL). The sensitivity and specificity of biopsy/referral decisions for melanomas and SDN, the proportion of missed melanomas and SDN, and the proportion of biopsy/referral decisions for benign lesions were separately determined for visual inspection alone and visual inspection with EIS score. Similarly, diagnostic accuracy was calculated for these clinical scenarios. These metrics were further stratified among different subsets of the respondent population. Differences in sensitivity, specificity, biopsy/referral decision proportions, and diagnostic accuracy were calculated using McNemar’s test for paired proportions.
RESULTS
Sixty-one respondents, comprising 44 physicians and 17 NPs, completed the survey, yielding a response rate of 43% (TABLE 1). In total, 1354 clinical decisions (677 based on visual inspection alone and 677 based on visual inspection plus EIS) were made. A biopsy/referral decision was made after assessing 416 of 677 cases (61%) with visual inspection alone and 360 of 677 cases (53%) when relying on visual inspection plus EIS. None of the respondents reported any prior experience with EIS.
When incorporating EIS scores, respondents’ mean sensitivity for melanomas and SDN increased from 69.2% to 90.0% (P < .001) and specificity from 44.0% to 72.6% (P < .001; TABLE 2). At baseline, physicians demonstrated a sensitivity and specificity of 74.6% and 46.5%, respectively, while NPs demonstrated a sensitivity and specificity of 56.1% and 37.9%, respectively.
All respondent subgroups stratified by occupation and years of experience saw significant increases in both sensitivity and specificity upon the incorporation of EIS scores, with NPs seeing a greater increase in sensitivity (56.1% vs 85.4%; P < .001) and specificity (37.9% vs 69.0%; P < .001) than physicians (sensitivity: 74.6% vs 91.9%; P < .001; specificity: 46.5% vs 74.1%; P < .001). The only difference in diagnostic performance based on years of experience was a greater pre-EIS sensitivity by clinicians who had been in practice for ≥ 15 years, compared with those in practice for shorter periods (TABLE 2).
Diagnostic accuracy increased significantly from 48% when based on visual inspection alone to 73% with the addition of EIS scores (P < .001; TABLE 3). Physicians and NPs each significantly increased their diagnostic accuracy upon the incorporation of EIS, with NPs exhibiting the greatest increase (from 36.9% to 65.7%; P < .001). PCPs with 6 to 14 years of experience saw the greatest increase in diagnostic accuracy when adding EIS (45.9% vs 76.4%; P < .001). Overall, the addition of EIS scores resulted in 58 fewer missed melanomas and SDN and 114 fewer benign referrals or biopsies (TABLE 4).
Continue to: DISCUSSION
DISCUSSION
Primary care evaluation plays a significant role in the diagnosis and management of PSLs, ultimately shaping outcomes for patients with melanoma. Improved accuracy of PSL classification could yield greater sensitivity for the diagnosis of melanomas and high-risk melanocytic lesions at earlier stages, while also reducing the number of unnecessary biopsies and referrals—leading to decreased patient morbidity and mortality and reduced health care spending.9
Diagnostic tools are valuable insofar as they can improve accuracy and positively impact clinical management and patient outcomes.10 In this case, increased sensitivity reduced missed melanoma diagnoses, while increased specificity avoided the additional costs and patient toll associated with a biopsy or referral for a benign lesion.
Dermoscopy has been shown to improve the sensitivity and specificity of PSL diagnosis compared with visual inspection alone; however, without substantial training and experience, accuracy with dermoscopy can be no better than examination with the naked eye.3,11,12 The dropout rates are high for training PCPs in its use, given that several months of training may be needed for competent use.13,14 To improve the clinical management of PSLs broadly in primary care, a need exists for easy-to-use adjunctive tools that increase diagnostic accuracy.15
In this study, with only a brief explanation of how to interpret EIS scores, clinicians without any prior experience using EIS demonstrated significantly improved accuracy in deciding appropriate management and classifying melanocytic lesions with the addition of EIS to visual inspection. These improvements, seen in clinicians of varying training and experience, suggest that the learning curve of EIS may not be as steep as that of dermoscopy.
The greater baseline sensitivity, specificity, and diagnostic accuracy of physicians’ clinical decision-making compared with NPs before the incorporation of EIS in the study may be a product of comparatively more extensive medical training. In addition, EIS yielded a greater benefit to NPs than to physicians, with greater increases in sensitivity and specificity noted. This suggests that the use of EIS is particularly advantageous to clinicians who are less proficient in assessing melanocytic lesions. Using visual inspection alone, more experienced respondents made biopsy/referral decisions with greater sensitivity but similar specificity to those with less experience. With the incorporation of EIS scores, the sensitivity and specificity of respondents’ clinical decision-making rose to comparable levels across all experience groups, providing further indication of EIS’s particular value to clinicians who are less proficient in PSL evaluation.
Continue to: This technology holds the potential...
This technology holds the potential to be seamlessly implemented into primary care practice, given that dermatology expertise training is not required to use the EIS device; this could allow for EIS measurement of lesions to be delegated to office staff (eg, nurses, medical assistants).16 Future studies are needed to assess EIS use among PCPs in a real-world setting, where factors such as its application on nonmelanocytic lesions (eg, seborrheic keratoses) and its pairing with patient historical data could produce varying results.
Limitations. While revealing, this study had its limitations. Respondents did not have access to additional pertinent clinical information, such as patients’ histories and risk factors. Clinical decisions in this survey were made based on digital images rather than in vivo examination. This may not represent a real-life evaluation; there is the potential for minimization of the true consequences of a missed melanoma or unnecessary biopsy in the minds of participants, and this does not factor in the operation of the actual EIS device. The Hawthorne effect may also have influenced PCPs’ diagnostic selections. Also, the limited sample size constitutes another limitation.
Of note, in this survey format, respondents rated their inclination to biopsy or refer each lesion from 1 to 5. For statistical analyses, lesions rated 1 to 3 were considered as not biopsied/referred and those rated 4 to 5 as biopsied/referred. The sensitivity and specificity values observed, for both visual examination and concurrent visual and EIS evaluation, are therefore based on this classification system of participants’ provided ratings. It is conceivable that differing sensitivity and specificity values might have been detected if clinicians were instead given a binary choice for referral/biopsy decisions.
CONCLUSIONS
Among PCPs tasked with evaluating melanocytic lesions, the incorporation of EIS data into clinical decision-making in this study significantly increased the sensitivity, specificity, and overall diagnostic accuracy of biopsy or referral decisions for melanomas and SDN compared with visual inspection alone. Overall, the results of this preliminary study suggest that diagnostic accuracy for PSLs by PCPs may be improved with the adjunctive use of EIS with visual inspection. This would ultimately improve patient care and reduce the morbidity and mortality of a melanoma diagnosis.
CORRESPONDENCE
Jonathan Ungar, MD, Kimberly and Eric J. Waldman Department of Dermatology, Icahn School of Medicine at Mount Sinai, 5 East 98th Street, 5th Floor, New York, NY 10029; jonathan.ungar@mountsinai.org
ABSTRACT
Background: We sought to examine whether electrical impedance spectroscopy (EIS), a diagnostic tool approved by the US Food and Drug Administration for the evaluation of pigmented skin lesions (PSLs), is beneficial to primary care providers (PCPs) by comparing the accuracy of PCPs’ management decisions for PSLs based on visual examination alone with those based on concurrent visual and EIS evaluation.
Methods: Physicians and nurse practitioners (NPs) participated in an anonymous online survey in which they viewed clinical images of PSLs and were asked to make 2 clinical decisions before and after being provided an EIS score that indicated the likelihood that the lesion was a melanoma. They were asked (1) if they would biopsy the lesion/refer the patient out and (2) what they expected the pathology results would show.
Results: Forty-four physicians and 17 NPs participated, making clinical decisions for 1354 presented lesions. Overall, with the addition of EIS to visual inspection of clinical images, the sensitivity of biopsy/referral decisions for melanomas and severely dysplastic nevi (SDN) increased from 69.2% to 90.0% (P < .001), while specificity increased from 44.0% to 72.6% (P < .001). Physicians and NPs, regardless of years of experience, each saw significant improvements in sensitivity, specificity, and diagnostic accuracy with the addition of EIS scores.
Conclusions: The incorporation of EIS data into clinical decision-making by PCPs significantly increased the sensitivity and specificity of biopsy/referral decisions for melanomas and SDN and overall diagnostic accuracy compared with visual inspection alone. The results of this study suggest that diagnostic accuracy for PSLs by PCPs may be improved with adjunctive use of EIS with visual inspection.
Primary care providers (PCPs) are often the first line of defense in detecting skin cancers. For patients with concerning skin lesions, PCPs may choose to perform a biopsy or facilitate access to specialty services (eg, Dermatology). Consequently, PCPs play a critical role in the timely detection of skin cancers, and it is paramount to employ continually improving detection methods, such as the application of technologic advances.1
Differentiating benign nevi from melanoma and severely dysplastic nevi (SDN), both of which warrant excision, poses a unique challenge to clinicians examining pigmented skin lesions (PSLs). PCPs often rely on visual inspection to differentiate benign skin lesions from malignant skin cancers. In some primary care practices, dermoscopy, which involves using a handheld device to evaluate lesions with polarized light and magnification, is used to improve melanoma detection. However, while visual inspection and dermoscopy are valid, effective techniques for the diagnosis of melanocytic lesions, in many instances they still can lead to missed cancers or unnecessary biopsies and specialty referrals. Adjunctive use of dermoscopy with visual inspection has been shown to increase the probability of skin cancer detection, but it fails to achieve a near-100% success rate.2 Furthermore, dermoscopy is heavily user-dependent, requiring significant training and experience for appropriate use.3
Another option is an electrical impedance spectroscopy (EIS) device (Nevisense, Scibase, Stockholm, Sweden), which has been approved by the US Food and Drug Administration (FDA) to assist in the detection of melanoma and differentiation from benign PSLs.4 EIS is a noninvasive, rapidly applied technology designed to accompany the visual examination of melanocytic lesions in office, with or without dermoscopy. Still relatively new, the technology is employed today by many dermatologists, increasing diagnostic accuracy for PSLs.5 The lightweight and portable instrument features a handheld probe, which is held against a lesion to obtain a reading. EIS uses a low-voltage electrode to apply a harmless electrical current to the skin at various frequencies.6 As benign and malignant tissues vary in cell shape, size, and composition, EIS distinguishes differential electrical resistance of the tissue to aid in diagnosis.7
Continue to: EIS provides high-sensitivity...
EIS provides high-sensitivity melanoma diagnosis vs histopathologic confirmation from biopsies, with 1 study showing a 96.6% sensitivity rating, detecting 256 of 265 melanomas.4 The EIS device, by measuring differences in electrical resistance between benign and cancerous cells, outputs a simple integer score ranging from 0 to 10 associated with the likelihood of the lesion being a melanoma.8 Based on data from the Nevisense pivotal trial,4 Nevisense reports that scores of 0 to 3 carry a negative predictive value of 99% for melanoma, whereas scores of 4 to 10 signify increasingly greater positive predictive values from 7% to 61%.
We aimed to assess whether EIS may be beneficial to PCPs by comparing the accuracy of clinical decision-making for PSLs based on visual examination alone with that based on concurrent visual and EIS evaluation.
METHODS
A questionnaire was distributed via email to 142 clinicians at clinics affiliated with either of 2 organizations delivering care to the New York City area through a network of community health centers: the Institute for Family Health (IFH) and the Community Healthcare Network (CHN). Of these recipients, 72 were affiliated with IFH across 27 community health centers and 70 were affiliated with CHN across 14 community health centers. Recipients were physicians and nurse practitioners (NPs) practicing at primary health care facilities.
Survey instrument. The first section of the survey instrument (APPENDIX) solicited demographic information and explained how to apply the EIS scores for diagnostic decision-making. The second featured images of 12 randomly selected, histologically confirmed, and EIS-evaluated PSLs from a previously published prospective blinded trial of 2416 lesions.4 The Institutional Review Board of the Icahn School of Medicine at Mount Sinai reviewed and approved the study and survey instrument.
Clinical images of these lesions, comprising 4 melanocytic nevi, 4 dysplastic nevi (including 3 mild-moderately dysplastic and 1 severely dysplastic nevus), and 4 melanomas
Continue to: Analysis
Analysis. A biopsy or referral rating of 4 or 5 was considered a decision to biopsy or refer (ie, a diagnostic decision consistent with melanoma or SDN warranting excision), whereas a selection of 1 to 3 was considered a decision not to biopsy or refer (ie, a diagnostic decision consistent with a benign PSL). The sensitivity and specificity of biopsy/referral decisions for melanomas and SDN, the proportion of missed melanomas and SDN, and the proportion of biopsy/referral decisions for benign lesions were separately determined for visual inspection alone and visual inspection with EIS score. Similarly, diagnostic accuracy was calculated for these clinical scenarios. These metrics were further stratified among different subsets of the respondent population. Differences in sensitivity, specificity, biopsy/referral decision proportions, and diagnostic accuracy were calculated using McNemar’s test for paired proportions.
RESULTS
Sixty-one respondents, comprising 44 physicians and 17 NPs, completed the survey, yielding a response rate of 43% (TABLE 1). In total, 1354 clinical decisions (677 based on visual inspection alone and 677 based on visual inspection plus EIS) were made. A biopsy/referral decision was made after assessing 416 of 677 cases (61%) with visual inspection alone and 360 of 677 cases (53%) when relying on visual inspection plus EIS. None of the respondents reported any prior experience with EIS.
When incorporating EIS scores, respondents’ mean sensitivity for melanomas and SDN increased from 69.2% to 90.0% (P < .001) and specificity from 44.0% to 72.6% (P < .001; TABLE 2). At baseline, physicians demonstrated a sensitivity and specificity of 74.6% and 46.5%, respectively, while NPs demonstrated a sensitivity and specificity of 56.1% and 37.9%, respectively.
All respondent subgroups stratified by occupation and years of experience saw significant increases in both sensitivity and specificity upon the incorporation of EIS scores, with NPs seeing a greater increase in sensitivity (56.1% vs 85.4%; P < .001) and specificity (37.9% vs 69.0%; P < .001) than physicians (sensitivity: 74.6% vs 91.9%; P < .001; specificity: 46.5% vs 74.1%; P < .001). The only difference in diagnostic performance based on years of experience was a greater pre-EIS sensitivity by clinicians who had been in practice for ≥ 15 years, compared with those in practice for shorter periods (TABLE 2).
Diagnostic accuracy increased significantly from 48% when based on visual inspection alone to 73% with the addition of EIS scores (P < .001; TABLE 3). Physicians and NPs each significantly increased their diagnostic accuracy upon the incorporation of EIS, with NPs exhibiting the greatest increase (from 36.9% to 65.7%; P < .001). PCPs with 6 to 14 years of experience saw the greatest increase in diagnostic accuracy when adding EIS (45.9% vs 76.4%; P < .001). Overall, the addition of EIS scores resulted in 58 fewer missed melanomas and SDN and 114 fewer benign referrals or biopsies (TABLE 4).
Continue to: DISCUSSION
DISCUSSION
Primary care evaluation plays a significant role in the diagnosis and management of PSLs, ultimately shaping outcomes for patients with melanoma. Improved accuracy of PSL classification could yield greater sensitivity for the diagnosis of melanomas and high-risk melanocytic lesions at earlier stages, while also reducing the number of unnecessary biopsies and referrals—leading to decreased patient morbidity and mortality and reduced health care spending.9
Diagnostic tools are valuable insofar as they can improve accuracy and positively impact clinical management and patient outcomes.10 In this case, increased sensitivity reduced missed melanoma diagnoses, while increased specificity avoided the additional costs and patient toll associated with a biopsy or referral for a benign lesion.
Dermoscopy has been shown to improve the sensitivity and specificity of PSL diagnosis compared with visual inspection alone; however, without substantial training and experience, accuracy with dermoscopy can be no better than examination with the naked eye.3,11,12 The dropout rates are high for training PCPs in its use, given that several months of training may be needed for competent use.13,14 To improve the clinical management of PSLs broadly in primary care, a need exists for easy-to-use adjunctive tools that increase diagnostic accuracy.15
In this study, with only a brief explanation of how to interpret EIS scores, clinicians without any prior experience using EIS demonstrated significantly improved accuracy in deciding appropriate management and classifying melanocytic lesions with the addition of EIS to visual inspection. These improvements, seen in clinicians of varying training and experience, suggest that the learning curve of EIS may not be as steep as that of dermoscopy.
The greater baseline sensitivity, specificity, and diagnostic accuracy of physicians’ clinical decision-making compared with NPs before the incorporation of EIS in the study may be a product of comparatively more extensive medical training. In addition, EIS yielded a greater benefit to NPs than to physicians, with greater increases in sensitivity and specificity noted. This suggests that the use of EIS is particularly advantageous to clinicians who are less proficient in assessing melanocytic lesions. Using visual inspection alone, more experienced respondents made biopsy/referral decisions with greater sensitivity but similar specificity to those with less experience. With the incorporation of EIS scores, the sensitivity and specificity of respondents’ clinical decision-making rose to comparable levels across all experience groups, providing further indication of EIS’s particular value to clinicians who are less proficient in PSL evaluation.
Continue to: This technology holds the potential...
This technology holds the potential to be seamlessly implemented into primary care practice, given that dermatology expertise training is not required to use the EIS device; this could allow for EIS measurement of lesions to be delegated to office staff (eg, nurses, medical assistants).16 Future studies are needed to assess EIS use among PCPs in a real-world setting, where factors such as its application on nonmelanocytic lesions (eg, seborrheic keratoses) and its pairing with patient historical data could produce varying results.
Limitations. While revealing, this study had its limitations. Respondents did not have access to additional pertinent clinical information, such as patients’ histories and risk factors. Clinical decisions in this survey were made based on digital images rather than in vivo examination. This may not represent a real-life evaluation; there is the potential for minimization of the true consequences of a missed melanoma or unnecessary biopsy in the minds of participants, and this does not factor in the operation of the actual EIS device. The Hawthorne effect may also have influenced PCPs’ diagnostic selections. Also, the limited sample size constitutes another limitation.
Of note, in this survey format, respondents rated their inclination to biopsy or refer each lesion from 1 to 5. For statistical analyses, lesions rated 1 to 3 were considered as not biopsied/referred and those rated 4 to 5 as biopsied/referred. The sensitivity and specificity values observed, for both visual examination and concurrent visual and EIS evaluation, are therefore based on this classification system of participants’ provided ratings. It is conceivable that differing sensitivity and specificity values might have been detected if clinicians were instead given a binary choice for referral/biopsy decisions.
CONCLUSIONS
Among PCPs tasked with evaluating melanocytic lesions, the incorporation of EIS data into clinical decision-making in this study significantly increased the sensitivity, specificity, and overall diagnostic accuracy of biopsy or referral decisions for melanomas and SDN compared with visual inspection alone. Overall, the results of this preliminary study suggest that diagnostic accuracy for PSLs by PCPs may be improved with the adjunctive use of EIS with visual inspection. This would ultimately improve patient care and reduce the morbidity and mortality of a melanoma diagnosis.
CORRESPONDENCE
Jonathan Ungar, MD, Kimberly and Eric J. Waldman Department of Dermatology, Icahn School of Medicine at Mount Sinai, 5 East 98th Street, 5th Floor, New York, NY 10029; jonathan.ungar@mountsinai.org
1. Goetsch NJ, Hoehns JD, Sutherland JE, et al. Assessment of postgraduate skin lesion education among Iowa family physicians. SAGE Open Med. 2017;5:2050312117691392. doi: 10.1177/2050312117691392
2. Dinnes J, Deeks JJ, Chuchu N, et al. Dermoscopy, with and without visual inspection, for diagnosing melanoma in adults. Cochrane Database Syst Rev. 2018;12:CD011902. doi: 10.1002/14651858.CD011902.pub2
3. Jones OT, Jurascheck LC, van Melle MA, et al. Dermoscopy for melanoma detection and triage in primary care: a systematic review. BMJ Open. 2019;9:e027529. doi: 10.1136/bmjopen-2018-027529
4. Malvehy J, Hauschild A, Curiel-Lewandrowski C, et al. Clinical performance of the Nevisense system in cutaneous melanoma detection: an international, multicentre, prospective and blinded clinical trial on efficacy and safety. Br J Dermatol. 2014;171:1099-1107. doi: 10.1111/bjd.13121
5. Svoboda RM, Prado G, Mirsky RS, et al. Assessment of clinician accuracy for diagnosing melanoma on the basis of electrical impedance spectroscopy score plus morphology versus lesion morphology alone. J Am Acad Dermatol. 2019;80:285-287. doi: 10.1016/j.jaad.2018.08.048
6. Mohr P, Birgersson U, Berking C, et al. Electrical impedance spectroscopy as a potential adjunct diagnostic tool for cutaneous melanoma. Skin Res Technol. 2013;19:75-83. doi: 10.1111/srt.12008
7. Rocha L, Menzies SW, Lo S, et al. Analysis of an electrical impedance spectroscopy system in short-term digital dermoscopy imaging of melanocytic lesions. Br J Dermatol. 2017;177:1432-1438. doi: 10.1111/bjd.15595
8. Litchman GH, Teplitz RW, Marson JW, et al. Impact of electrical impedance spectroscopy on dermatologists’ number needed to biopsy metric and biopsy decisions for pigmented skin lesions. J Am Acad Dermatol. 2021;85:976-979. doi: 10.1016/j.jaad.2020.09.011
9. Greenwood-Lee J, Jewett L, Woodhouse L, et al. A categorisation of problems and solutions to improve patient referrals from primary to specialty care. BMC Health Serv Res. 2018;18:1-16. doi: 10.1186/s12913-018-3745-y
10. Bossuyt PM, Reitsma JB, Linnet K, et al. Beyond diagnostic accuracy: the clinical utility of diagnostic tests. Clin Chem. 2012;58:1636-1643. doi: 10.1373/clinchem.2012.182576
11. Argenziano G, Cerroni L, Zalaudek I , et al. Accuracy in melanoma detection: a 10-year multicenter survey. J Am Acad Dermatol. 2012;67:54-59. doi: 10.1016/j.jaad.2011.07.019
12. Menzies SW, Vestergaard ME, Macaskill P, et al. Dermoscopy compared with naked eye examination for the diagnosis of primary melanoma: a meta-analysis of studies performed in a clinical setting. Br J Dermatol. 2008;159:669-676. doi: 10.1111/j.1365-2133.2008.08713.x
13. Menzies SW, Emery J, Staples M, et al. Impact of dermoscopy and short-term sequential digital dermoscopy imaging for the management of pigmented lesions in primary care: a sequential intervention trial. Br J Dermatol. 2009;161:1270-1277. doi: 10.1111/j.1365-2133.2009.09374.x
14. Noor O, Nanda A, Rao BK. A dermoscopy survey to assess who is using it and why it is or is not being used. Int J Dermatol. 2009;48:951-952. doi: 10.1111/j.1365-4632.2009.04095.x
15. Weigl BH, Boyle DS, de los Santos T, et al. Simplicity of use: a critical feature for widespread adoption of diagnostic technologies in low-resource settings. Expert Rev Med Devices. 2009;6:461-464. doi: 10.1586/erd.09.31
16. Sarac E, Meiwes A, Eigentler T, et al. Diagnostic accuracy of electrical impedance spectroscopy in non-melanoma skin cancer. Acta Derm Venereol. 2020;100:adv00328. doi: 10.2340/00015555-3689
1. Goetsch NJ, Hoehns JD, Sutherland JE, et al. Assessment of postgraduate skin lesion education among Iowa family physicians. SAGE Open Med. 2017;5:2050312117691392. doi: 10.1177/2050312117691392
2. Dinnes J, Deeks JJ, Chuchu N, et al. Dermoscopy, with and without visual inspection, for diagnosing melanoma in adults. Cochrane Database Syst Rev. 2018;12:CD011902. doi: 10.1002/14651858.CD011902.pub2
3. Jones OT, Jurascheck LC, van Melle MA, et al. Dermoscopy for melanoma detection and triage in primary care: a systematic review. BMJ Open. 2019;9:e027529. doi: 10.1136/bmjopen-2018-027529
4. Malvehy J, Hauschild A, Curiel-Lewandrowski C, et al. Clinical performance of the Nevisense system in cutaneous melanoma detection: an international, multicentre, prospective and blinded clinical trial on efficacy and safety. Br J Dermatol. 2014;171:1099-1107. doi: 10.1111/bjd.13121
5. Svoboda RM, Prado G, Mirsky RS, et al. Assessment of clinician accuracy for diagnosing melanoma on the basis of electrical impedance spectroscopy score plus morphology versus lesion morphology alone. J Am Acad Dermatol. 2019;80:285-287. doi: 10.1016/j.jaad.2018.08.048
6. Mohr P, Birgersson U, Berking C, et al. Electrical impedance spectroscopy as a potential adjunct diagnostic tool for cutaneous melanoma. Skin Res Technol. 2013;19:75-83. doi: 10.1111/srt.12008
7. Rocha L, Menzies SW, Lo S, et al. Analysis of an electrical impedance spectroscopy system in short-term digital dermoscopy imaging of melanocytic lesions. Br J Dermatol. 2017;177:1432-1438. doi: 10.1111/bjd.15595
8. Litchman GH, Teplitz RW, Marson JW, et al. Impact of electrical impedance spectroscopy on dermatologists’ number needed to biopsy metric and biopsy decisions for pigmented skin lesions. J Am Acad Dermatol. 2021;85:976-979. doi: 10.1016/j.jaad.2020.09.011
9. Greenwood-Lee J, Jewett L, Woodhouse L, et al. A categorisation of problems and solutions to improve patient referrals from primary to specialty care. BMC Health Serv Res. 2018;18:1-16. doi: 10.1186/s12913-018-3745-y
10. Bossuyt PM, Reitsma JB, Linnet K, et al. Beyond diagnostic accuracy: the clinical utility of diagnostic tests. Clin Chem. 2012;58:1636-1643. doi: 10.1373/clinchem.2012.182576
11. Argenziano G, Cerroni L, Zalaudek I , et al. Accuracy in melanoma detection: a 10-year multicenter survey. J Am Acad Dermatol. 2012;67:54-59. doi: 10.1016/j.jaad.2011.07.019
12. Menzies SW, Vestergaard ME, Macaskill P, et al. Dermoscopy compared with naked eye examination for the diagnosis of primary melanoma: a meta-analysis of studies performed in a clinical setting. Br J Dermatol. 2008;159:669-676. doi: 10.1111/j.1365-2133.2008.08713.x
13. Menzies SW, Emery J, Staples M, et al. Impact of dermoscopy and short-term sequential digital dermoscopy imaging for the management of pigmented lesions in primary care: a sequential intervention trial. Br J Dermatol. 2009;161:1270-1277. doi: 10.1111/j.1365-2133.2009.09374.x
14. Noor O, Nanda A, Rao BK. A dermoscopy survey to assess who is using it and why it is or is not being used. Int J Dermatol. 2009;48:951-952. doi: 10.1111/j.1365-4632.2009.04095.x
15. Weigl BH, Boyle DS, de los Santos T, et al. Simplicity of use: a critical feature for widespread adoption of diagnostic technologies in low-resource settings. Expert Rev Med Devices. 2009;6:461-464. doi: 10.1586/erd.09.31
16. Sarac E, Meiwes A, Eigentler T, et al. Diagnostic accuracy of electrical impedance spectroscopy in non-melanoma skin cancer. Acta Derm Venereol. 2020;100:adv00328. doi: 10.2340/00015555-3689
Is the Altman Rule a proxy for glycemic load?
ABSTRACT
Background: The Altman Rule, a simple tool for consumers seeking to make healthier packaged food choices at the point of sale, applies to packaged carbohydrates. According to the Altman Rule, a food is a healthier option if it has at least 3 g of fiber per serving and the grams of fiber plus the grams of protein exceed the grams of sugar per serving. This study sought to evaluate whether the Altman Rule is a valid proxy for glycemic load (GL).
Methods: We compared the binary outcome of whether a food item meets the Altman Rule with the GL of all foods categorized as cereals, chips, crackers, and granola bars in the Nutrition Data System for Research Database (University of Minnesota, Version 2010). We examined the percentage of foods in low-, medium-, and high-GL categories that met the Altman Rule.
Results: There were 1235 foods (342 cereals, 305 chips, 379 crackers, and 209 granola bars) in this analysis. There was a significant relationship between the GL of foods and the Altman Rule (P < .001) in that most low-GL (68%), almost half of medium-GL (48%), and very few high-GL (7%) foods met the criteria of the rule.
Conclusions: The Altman Rule is a reasonable proxy for GL and can be a useful and accessible tool for consumers interested in buying healthier packaged carbohydrate foods.
Nutrition can be complicated for consumers interested in making healthier choices at the grocery store. Consumers may have difficulty identifying more nutritious options, especially when food labels are adorned with claims such as “Good Source of Fiber” or “Heart Healthy.”1 In addition, when reading food labels, consumers may find it difficult to decipher which data to prioritize when carbohydrates, total sugars, added sugars, total dietary fiber, soluble fiber, and insoluble fiber are all listed.
The concept of glycemic load (GL) is an important consideration, especially for people with diabetes. GL approximates the blood sugar response to different foods. A food with a high GL is digested quickly, and its carbohydrates are taken into the bloodstream rapidly. This leads to a spike and subsequent drop in blood sugars, which can cause symptoms of hyperglycemia and hypoglycemia in a person with diabetes.2,3 Despite its usefulness, GL may be too complicated for a consumer to understand, and it does not appear anywhere on the food label. Since GL is calculated using pooled blood sugar response from individuals after the ingestion of the particular food, estimation of the GL is not intuitable.4
Point-of-sale tools. People seeking to lose weight, control diabetes, improve dyslipidemia and/or blood pressure, and/or decrease their risk for heart disease may benefit from point-of-sale tools such as the Altman Rule, which simplifies and encourages the selection of more nutritious foods.1 Other tools—such as Guiding Stars (https://guidingstars.com), NuVal (www.nuval.com), and different variations of traffic lights—have been created to help consumers make more informed and healthier food choices.5-8 However, Guiding Stars and NuVal are based on complicated algorithms that are not entirely transparent and not accessible to the average consumer.6,7 Evaluations of these nutrition tools indicate that consumers tend to underrate the healthiness of some foods, such as raw almonds and salmon, and overrate the healthiness of others, such as fruit punch and diet soda, when using traffic light systems.6 Furthermore, these nutrition tools are not available in many supermarkets. Previous research suggests that the use of point-of-sale nutrition apps decreases with the time and effort involved in using an app.9
Continue to: The Altman Rule
The Altman Rule was developed by a family physician (author WA) to provide a more accessible tool for people interested in choosing healthier prepackaged carbohydrate foods while shopping. Since the user does not need to have a smartphone, and they are not required to download or understand an app for each purchase, the Altman Rule may be more usable compared with more complicated alternatives.
The Altman Rule can be used with nutrition labels that feature serving information and calories in enlarged and bold type, in compliance with the most recent US Food and Drug Administration (FDA) guideline from 2016. Many foods with high fiber also have high amounts of sugar, so the criteria of the Altman Rule includes a 2-step process requiring (1) a minimum of 3 g of total dietary fiber per serving and (2) the sum of the grams of fiber plus the grams of protein per serving to be greater than the total grams of sugar (not grams of added sugar or grams of carbohydrate) per serving (FIGURE 1A). Unlike the relatively complicated formula related to GL, this 2-part rule can be applied in seconds while shopping (FIGURE 1B).
The rule is intended only to be used for packaged carbohydrate products, such as bread, muffins, bagels, pasta, rice, oatmeal, cereals, snack bars, chips, and crackers. It does not apply to whole foods, such as meat, dairy, fruits, or vegetables. These foods are excluded to prevent any consumer confusion related to the nutritional content of whole foods (eg, an apple may have more sugar than fiber and protein combined, but it is still a nutritious option).
This study aimed to determine if the Altman Rule is a reasonable proxy for the more complicated concept of GL. We calculated the relationship between the GL of commercially available packaged carbohydrate foods and whether those foods met the Altman Rule.
METHODS
The Altman Rule was tested by comparing the binary outcome of the rule (meets/does not meet) with data on all foods categorized as cereals, chips, crackers, and granola bars in the Nutrition Data System for Research (NDSR) Database (University of Minnesota, Version 2010).
Continue to: To account for differences...
To account for differences in serving size, we used the standard of 50 g for each product as 1 serving. We used 50 g (about 1.7 oz) to help compare the different foods and between foods within the same group. Additionally, 50 g is close to 1 serving for most foods in these groups; it is about the size of a typical granola bar, three-quarters to 2 cups of cereal, 10 to 12 crackers, and 15 to 25 chips. We determined the GL for each product by multiplying the number of available carbohydrates (total carbohydrate – dietary fiber) by the product’s glycemic index/100. In general, GL is categorized as low (≤ 10), medium (11-19), or high (≥ 20).
We applied the Altman Rule to categorize each product as meeting or not meeting the rule. We compared the proportion of foods meeting the Altman Rule, stratified by GL and by specific foods, and used chi-square to determine if differences were statistically significant. These data were collected and analyzed in the summer of 2019.
RESULTS
There were 1235 foods (342 breakfast cereals, 305 chips, 379 crackers, and 209 granola bars) used for this analysis. There is a significant relationship between the GL of foods and the Altman Rule in that most low-GL (68%), almost half of medium-GL (48%), and only a few high-GL foods (7%) met the rule (P < .001) (TABLE 1). There was also a significant relationship between “meeting the Altman Rule” and GL within each food type (P < .001) (TABLE 2).
The medium-GL foods were the second largest category of foods we calculated; thus we further broke them into binary categories of
Foods that met the rule were more likely to be low GL and foods that did not pass the rule were more likely high GL. Within the medium-GL category, foods that met the rule were more likely to be low-medium GL.
Continue to: The findings within food categories...
The findings within food categories showed that very few cereals, chips, crackers, and granola bars were low GL. For every food category, except granola bars, far more low-GL foods met the Altman Rule than those that did not. At the same time, very few high-GL foods met the Altman Rule. The category with the most individual high-GL food items meeting the Altman Rule was cereal. This was also the subcategory with the largest percentage of high-GL food items meeting the Altman Rule. Thirty-nine cereals that were high GL met the rule, but more than 4 times as many high-GL cereals did not (n = 190).
DISCUSSION
Marketing and nutrition messaging create consumer confusion that makes it challenging to identify packaged food items that are more nutrient dense. The Altman Rule simplifies food choices that have become unnecessarily complex. Our findings suggest this 2-step rule is a reasonable proxy for the more complicated and less accessible GL for packaged carbohydrates, such as cereals, chips, crackers, and snack bars. Foods that meet the rule are likely low or low-medium GL and thus are foods that are likely to be healthier choices.
Of note, only 9% of chips (n = 27) passed the Altman Rule, likely due to their low dietary fiber content, which was typical of chips. If a food item does not have at least 3 grams of total dietary fiber per serving, it does not pass the Altman Rule, regardless of how much protein or sugar is in the product. This may be considered a strength or a weakness of the Altman Rule. Few nutrition-dense foods are low in fiber, but some foods could be nutritious but do not meet the Altman Rule due to having < 3 g of fiber.
With the high prevalence of chronic diseases such as hypertension, diabetes, hyperlipidemia, and cardiovascular disease, it is essential to help consumers prevent chronic disease altogether or manage their chronic disease by providing tools to identify healthier food choices. The tool also has a place in clinical medicine for use by physicians and other health care professionals. Research shows that physicians find both time and lack of knowledge/resources to be a barrier to providing nutritional counseling to patients.10 Since the Altman Rule can be shared and explained with very little time and without extensive nutritional knowledge, it meets these needs.
Limitations
Glycemic load. We acknowledge that the Altman Rule is not foolproof and that assessing this rule based on GL has some limitations. GL is not a perfect or comprehensive way to measure the nutritional value of a food. For example, fruits such as watermelon and grapes are nutritionally dense. However, they contain high amounts of natural sugars—and as such, their GL is relatively high, which could lead a consumer to perceive them as unhealthy. Nevertheless, GL is both a useful and accepted tool and a reasonable way to assess the validity of the rule, specifically when assessing packaged carbohydrates. The simplicity of the Altman Rule and its relationship with GL makes it such that consumers are more likely to make a healthier food choice using it.9
Continue to: Specificity and sensitivity
Specificity and sensitivity. There are other limitations to the Altman Rule, given that a small number of high-GL foods meet the rule. For example, some granola bars had high dietary protein, which offset a high sugar content just enough to pass the rule despite a higher GL. As such, concluding that a snack bar is a healthier choice because it meets the Altman Rule when it has high amounts of sugar may not be appropriate. This limitation could be considered a lack of specificity (the rule includes food it ought not to include). Another limitation to consider would be a lack of sensitivity, given that only 68% of low-GL foods passed the Altman Rule. Since GL is associated with carbohydrate content, foods with a low carbohydrate count often have little to no fiber and thus would fall into the category of foods that did not meet the Altman Rule but had low GL. In this case, however, the low amount of fiber may render the Altman Rule a better indicator of a healthier food choice than the GL.
Hidden sugars. Foods with sugar alcohols and artificial sweeteners may be as deleterious as caloric alternatives while not being accounted for when reporting the grams of sugar per serving on the nutrition label.7 This may represent an exception to the Altman Rule, as foods that are not healthier choices may pass the rule because the sugar content on the nutrition label is, in a sense, artificially lowered. Future research may investigate the hypothesis that these foods are nutritionally inferior despite meeting the Altman Rule.
The sample. Our study also was limited to working only with foods that were included in the NDSR database up to 2010. This limitation is mitigated by the fact that the sample size was large (> 1000 packaged food items were included in our analyses). The study also could be limited by the food categories that were analyzed; food categories such as bread, rice, pasta, and bagels were not included.
The objective of this research was to investigate the relationship between GL and the Altman Rule, rather than to conduct an exhaustive analysis of the Altman Rule for every possible food category. Studying the relationship between the Altman Rule and GL in other categories of food is an objective for future research. The data so far support a relationship between these entities. The likelihood of the nutrition facts of foods changing without the GL changing (or vice versa) is very low. As such, the Altman Rule still seems to be a reasonable proxy of GL.
CONCLUSIONS
Research indicates that point-of-sale tools, such as Guiding Stars, NuVal, and other stoplight tools, can successfully alter consumers’ behaviors.9 These tools can be helpful but are not available in many supermarkets. Despite the limitations, the Altman Rule is a useful decision aid that is accessible to all consumers no matter where they live or shop and is easy to use and remember.
The Altman rule can be used in clinical practice by health care professionals, such as physicians, nurse practitioners, physician assistants, dietitians, and health coaches. It also has the potential to be used in commercial settings, such as grocery stores, to help consumers easily identify healthier convenience foods. This has public health implications, as the rule can both empower consumers and potentially incentivize food manufacturers to upgrade their products nutritionally.
Additional research would be useful to evaluate consumers’ preferences and perceptions about how user-friendly the Altman Rule is at the point of sale with packaged carbohydrate foods. This would help to further understand how the use of information on food packaging can motivate healthier decisions—thereby helping to alleviate the burden of chronic disease.
CORRESPONDENCE
Kimberly R. Dong, DrPH, MS, RDN, Tufts University School of Medicine, Department of Public Health and Community Medicine, 136 Harrison Avenue, MV Building, Boston, MA 02111; kimberly.dong@tufts.edu
1. Hersey JC, Wohlgenant KC, Arsenault JE, et al. Effects of front-of-package and shelf nutrition labeling systems on consumers. Nutr Rev. 2013;71:1-14. doi: 10.1111/nure.12000
2. Jenkins DJA, Dehghan M, Mente A, et al. Glycemic index, glycemic load, and cardiovascular disease and mortality. N Engl J Med. 2021;384:1312-1322. doi: 10.1056/NEJMoa2007123
3. Brand-Miller J, Hayne S, Petocz P, et al. Low–glycemic index diets in the management of diabetes. Diabetes Care. 2003;26:2261-2267. doi: 10.2337/diacare.26.8.2261
4. Matthan NR, Ausman LM, Meng H, et al. Estimating the reliability of glycemic index values and potential sources of methodological and biological variability. Am J Clin Nutr. 2016;104:1004-1013. doi: 10.3945/ajcn.116.137208
5. Sonnenberg L, Gelsomin E, Levy DE, et al. A traffic light food labeling intervention increases consumer awareness of health and healthy choices at the point-of-purchase. Prev Med. 2013;57:253-257. doi: 10.1016/j.ypmed.2013.07.001
6. Savoie N, Barlow K, Harvey KL, et al. Consumer perceptions of front-of-package labelling systems and healthiness of foods. Can J Public Health. 2013;104:e359-e363. doi: 10.17269/cjph.104.4027
7. Fischer LM, Sutherland LA, Kaley LA, et al. Development and implementation of the Guiding Stars nutrition guidance program. Am J Health Promot. 2011;26:e55-e63. doi: 10.4278/ajhp.100709-QUAL-238
8. Maubach N, Hoek J, Mather D. Interpretive front-of-pack nutrition labels. Comparing competing recommendations. Appetite. 2014;82:67-77. doi: 10.1016/j.appet.2014.07.006
9. Chan J, McMahon E, Brimblecombe J. Point‐of‐sale nutrition information interventions in food retail stores to promote healthier food purchase and intake: a systematic review. Obes Rev. 2021;22. doi: 10.1111/obr.13311
10. Mathioudakis N, Bashura H, Boyér L, et al. Development, implementation, and evaluation of a physician-targeted inpatient glycemic management curriculum. J Med Educ Curric Dev. 2019;6:238212051986134. doi: 10.1177/2382120519861342
ABSTRACT
Background: The Altman Rule, a simple tool for consumers seeking to make healthier packaged food choices at the point of sale, applies to packaged carbohydrates. According to the Altman Rule, a food is a healthier option if it has at least 3 g of fiber per serving and the grams of fiber plus the grams of protein exceed the grams of sugar per serving. This study sought to evaluate whether the Altman Rule is a valid proxy for glycemic load (GL).
Methods: We compared the binary outcome of whether a food item meets the Altman Rule with the GL of all foods categorized as cereals, chips, crackers, and granola bars in the Nutrition Data System for Research Database (University of Minnesota, Version 2010). We examined the percentage of foods in low-, medium-, and high-GL categories that met the Altman Rule.
Results: There were 1235 foods (342 cereals, 305 chips, 379 crackers, and 209 granola bars) in this analysis. There was a significant relationship between the GL of foods and the Altman Rule (P < .001) in that most low-GL (68%), almost half of medium-GL (48%), and very few high-GL (7%) foods met the criteria of the rule.
Conclusions: The Altman Rule is a reasonable proxy for GL and can be a useful and accessible tool for consumers interested in buying healthier packaged carbohydrate foods.
Nutrition can be complicated for consumers interested in making healthier choices at the grocery store. Consumers may have difficulty identifying more nutritious options, especially when food labels are adorned with claims such as “Good Source of Fiber” or “Heart Healthy.”1 In addition, when reading food labels, consumers may find it difficult to decipher which data to prioritize when carbohydrates, total sugars, added sugars, total dietary fiber, soluble fiber, and insoluble fiber are all listed.
The concept of glycemic load (GL) is an important consideration, especially for people with diabetes. GL approximates the blood sugar response to different foods. A food with a high GL is digested quickly, and its carbohydrates are taken into the bloodstream rapidly. This leads to a spike and subsequent drop in blood sugars, which can cause symptoms of hyperglycemia and hypoglycemia in a person with diabetes.2,3 Despite its usefulness, GL may be too complicated for a consumer to understand, and it does not appear anywhere on the food label. Since GL is calculated using pooled blood sugar response from individuals after the ingestion of the particular food, estimation of the GL is not intuitable.4
Point-of-sale tools. People seeking to lose weight, control diabetes, improve dyslipidemia and/or blood pressure, and/or decrease their risk for heart disease may benefit from point-of-sale tools such as the Altman Rule, which simplifies and encourages the selection of more nutritious foods.1 Other tools—such as Guiding Stars (https://guidingstars.com), NuVal (www.nuval.com), and different variations of traffic lights—have been created to help consumers make more informed and healthier food choices.5-8 However, Guiding Stars and NuVal are based on complicated algorithms that are not entirely transparent and not accessible to the average consumer.6,7 Evaluations of these nutrition tools indicate that consumers tend to underrate the healthiness of some foods, such as raw almonds and salmon, and overrate the healthiness of others, such as fruit punch and diet soda, when using traffic light systems.6 Furthermore, these nutrition tools are not available in many supermarkets. Previous research suggests that the use of point-of-sale nutrition apps decreases with the time and effort involved in using an app.9
Continue to: The Altman Rule
The Altman Rule was developed by a family physician (author WA) to provide a more accessible tool for people interested in choosing healthier prepackaged carbohydrate foods while shopping. Since the user does not need to have a smartphone, and they are not required to download or understand an app for each purchase, the Altman Rule may be more usable compared with more complicated alternatives.
The Altman Rule can be used with nutrition labels that feature serving information and calories in enlarged and bold type, in compliance with the most recent US Food and Drug Administration (FDA) guideline from 2016. Many foods with high fiber also have high amounts of sugar, so the criteria of the Altman Rule includes a 2-step process requiring (1) a minimum of 3 g of total dietary fiber per serving and (2) the sum of the grams of fiber plus the grams of protein per serving to be greater than the total grams of sugar (not grams of added sugar or grams of carbohydrate) per serving (FIGURE 1A). Unlike the relatively complicated formula related to GL, this 2-part rule can be applied in seconds while shopping (FIGURE 1B).
The rule is intended only to be used for packaged carbohydrate products, such as bread, muffins, bagels, pasta, rice, oatmeal, cereals, snack bars, chips, and crackers. It does not apply to whole foods, such as meat, dairy, fruits, or vegetables. These foods are excluded to prevent any consumer confusion related to the nutritional content of whole foods (eg, an apple may have more sugar than fiber and protein combined, but it is still a nutritious option).
This study aimed to determine if the Altman Rule is a reasonable proxy for the more complicated concept of GL. We calculated the relationship between the GL of commercially available packaged carbohydrate foods and whether those foods met the Altman Rule.
METHODS
The Altman Rule was tested by comparing the binary outcome of the rule (meets/does not meet) with data on all foods categorized as cereals, chips, crackers, and granola bars in the Nutrition Data System for Research (NDSR) Database (University of Minnesota, Version 2010).
Continue to: To account for differences...
To account for differences in serving size, we used the standard of 50 g for each product as 1 serving. We used 50 g (about 1.7 oz) to help compare the different foods and between foods within the same group. Additionally, 50 g is close to 1 serving for most foods in these groups; it is about the size of a typical granola bar, three-quarters to 2 cups of cereal, 10 to 12 crackers, and 15 to 25 chips. We determined the GL for each product by multiplying the number of available carbohydrates (total carbohydrate – dietary fiber) by the product’s glycemic index/100. In general, GL is categorized as low (≤ 10), medium (11-19), or high (≥ 20).
We applied the Altman Rule to categorize each product as meeting or not meeting the rule. We compared the proportion of foods meeting the Altman Rule, stratified by GL and by specific foods, and used chi-square to determine if differences were statistically significant. These data were collected and analyzed in the summer of 2019.
RESULTS
There were 1235 foods (342 breakfast cereals, 305 chips, 379 crackers, and 209 granola bars) used for this analysis. There is a significant relationship between the GL of foods and the Altman Rule in that most low-GL (68%), almost half of medium-GL (48%), and only a few high-GL foods (7%) met the rule (P < .001) (TABLE 1). There was also a significant relationship between “meeting the Altman Rule” and GL within each food type (P < .001) (TABLE 2).
The medium-GL foods were the second largest category of foods we calculated; thus we further broke them into binary categories of
Foods that met the rule were more likely to be low GL and foods that did not pass the rule were more likely high GL. Within the medium-GL category, foods that met the rule were more likely to be low-medium GL.
Continue to: The findings within food categories...
The findings within food categories showed that very few cereals, chips, crackers, and granola bars were low GL. For every food category, except granola bars, far more low-GL foods met the Altman Rule than those that did not. At the same time, very few high-GL foods met the Altman Rule. The category with the most individual high-GL food items meeting the Altman Rule was cereal. This was also the subcategory with the largest percentage of high-GL food items meeting the Altman Rule. Thirty-nine cereals that were high GL met the rule, but more than 4 times as many high-GL cereals did not (n = 190).
DISCUSSION
Marketing and nutrition messaging create consumer confusion that makes it challenging to identify packaged food items that are more nutrient dense. The Altman Rule simplifies food choices that have become unnecessarily complex. Our findings suggest this 2-step rule is a reasonable proxy for the more complicated and less accessible GL for packaged carbohydrates, such as cereals, chips, crackers, and snack bars. Foods that meet the rule are likely low or low-medium GL and thus are foods that are likely to be healthier choices.
Of note, only 9% of chips (n = 27) passed the Altman Rule, likely due to their low dietary fiber content, which was typical of chips. If a food item does not have at least 3 grams of total dietary fiber per serving, it does not pass the Altman Rule, regardless of how much protein or sugar is in the product. This may be considered a strength or a weakness of the Altman Rule. Few nutrition-dense foods are low in fiber, but some foods could be nutritious but do not meet the Altman Rule due to having < 3 g of fiber.
With the high prevalence of chronic diseases such as hypertension, diabetes, hyperlipidemia, and cardiovascular disease, it is essential to help consumers prevent chronic disease altogether or manage their chronic disease by providing tools to identify healthier food choices. The tool also has a place in clinical medicine for use by physicians and other health care professionals. Research shows that physicians find both time and lack of knowledge/resources to be a barrier to providing nutritional counseling to patients.10 Since the Altman Rule can be shared and explained with very little time and without extensive nutritional knowledge, it meets these needs.
Limitations
Glycemic load. We acknowledge that the Altman Rule is not foolproof and that assessing this rule based on GL has some limitations. GL is not a perfect or comprehensive way to measure the nutritional value of a food. For example, fruits such as watermelon and grapes are nutritionally dense. However, they contain high amounts of natural sugars—and as such, their GL is relatively high, which could lead a consumer to perceive them as unhealthy. Nevertheless, GL is both a useful and accepted tool and a reasonable way to assess the validity of the rule, specifically when assessing packaged carbohydrates. The simplicity of the Altman Rule and its relationship with GL makes it such that consumers are more likely to make a healthier food choice using it.9
Continue to: Specificity and sensitivity
Specificity and sensitivity. There are other limitations to the Altman Rule, given that a small number of high-GL foods meet the rule. For example, some granola bars had high dietary protein, which offset a high sugar content just enough to pass the rule despite a higher GL. As such, concluding that a snack bar is a healthier choice because it meets the Altman Rule when it has high amounts of sugar may not be appropriate. This limitation could be considered a lack of specificity (the rule includes food it ought not to include). Another limitation to consider would be a lack of sensitivity, given that only 68% of low-GL foods passed the Altman Rule. Since GL is associated with carbohydrate content, foods with a low carbohydrate count often have little to no fiber and thus would fall into the category of foods that did not meet the Altman Rule but had low GL. In this case, however, the low amount of fiber may render the Altman Rule a better indicator of a healthier food choice than the GL.
Hidden sugars. Foods with sugar alcohols and artificial sweeteners may be as deleterious as caloric alternatives while not being accounted for when reporting the grams of sugar per serving on the nutrition label.7 This may represent an exception to the Altman Rule, as foods that are not healthier choices may pass the rule because the sugar content on the nutrition label is, in a sense, artificially lowered. Future research may investigate the hypothesis that these foods are nutritionally inferior despite meeting the Altman Rule.
The sample. Our study also was limited to working only with foods that were included in the NDSR database up to 2010. This limitation is mitigated by the fact that the sample size was large (> 1000 packaged food items were included in our analyses). The study also could be limited by the food categories that were analyzed; food categories such as bread, rice, pasta, and bagels were not included.
The objective of this research was to investigate the relationship between GL and the Altman Rule, rather than to conduct an exhaustive analysis of the Altman Rule for every possible food category. Studying the relationship between the Altman Rule and GL in other categories of food is an objective for future research. The data so far support a relationship between these entities. The likelihood of the nutrition facts of foods changing without the GL changing (or vice versa) is very low. As such, the Altman Rule still seems to be a reasonable proxy of GL.
CONCLUSIONS
Research indicates that point-of-sale tools, such as Guiding Stars, NuVal, and other stoplight tools, can successfully alter consumers’ behaviors.9 These tools can be helpful but are not available in many supermarkets. Despite the limitations, the Altman Rule is a useful decision aid that is accessible to all consumers no matter where they live or shop and is easy to use and remember.
The Altman rule can be used in clinical practice by health care professionals, such as physicians, nurse practitioners, physician assistants, dietitians, and health coaches. It also has the potential to be used in commercial settings, such as grocery stores, to help consumers easily identify healthier convenience foods. This has public health implications, as the rule can both empower consumers and potentially incentivize food manufacturers to upgrade their products nutritionally.
Additional research would be useful to evaluate consumers’ preferences and perceptions about how user-friendly the Altman Rule is at the point of sale with packaged carbohydrate foods. This would help to further understand how the use of information on food packaging can motivate healthier decisions—thereby helping to alleviate the burden of chronic disease.
CORRESPONDENCE
Kimberly R. Dong, DrPH, MS, RDN, Tufts University School of Medicine, Department of Public Health and Community Medicine, 136 Harrison Avenue, MV Building, Boston, MA 02111; kimberly.dong@tufts.edu
ABSTRACT
Background: The Altman Rule, a simple tool for consumers seeking to make healthier packaged food choices at the point of sale, applies to packaged carbohydrates. According to the Altman Rule, a food is a healthier option if it has at least 3 g of fiber per serving and the grams of fiber plus the grams of protein exceed the grams of sugar per serving. This study sought to evaluate whether the Altman Rule is a valid proxy for glycemic load (GL).
Methods: We compared the binary outcome of whether a food item meets the Altman Rule with the GL of all foods categorized as cereals, chips, crackers, and granola bars in the Nutrition Data System for Research Database (University of Minnesota, Version 2010). We examined the percentage of foods in low-, medium-, and high-GL categories that met the Altman Rule.
Results: There were 1235 foods (342 cereals, 305 chips, 379 crackers, and 209 granola bars) in this analysis. There was a significant relationship between the GL of foods and the Altman Rule (P < .001) in that most low-GL (68%), almost half of medium-GL (48%), and very few high-GL (7%) foods met the criteria of the rule.
Conclusions: The Altman Rule is a reasonable proxy for GL and can be a useful and accessible tool for consumers interested in buying healthier packaged carbohydrate foods.
Nutrition can be complicated for consumers interested in making healthier choices at the grocery store. Consumers may have difficulty identifying more nutritious options, especially when food labels are adorned with claims such as “Good Source of Fiber” or “Heart Healthy.”1 In addition, when reading food labels, consumers may find it difficult to decipher which data to prioritize when carbohydrates, total sugars, added sugars, total dietary fiber, soluble fiber, and insoluble fiber are all listed.
The concept of glycemic load (GL) is an important consideration, especially for people with diabetes. GL approximates the blood sugar response to different foods. A food with a high GL is digested quickly, and its carbohydrates are taken into the bloodstream rapidly. This leads to a spike and subsequent drop in blood sugars, which can cause symptoms of hyperglycemia and hypoglycemia in a person with diabetes.2,3 Despite its usefulness, GL may be too complicated for a consumer to understand, and it does not appear anywhere on the food label. Since GL is calculated using pooled blood sugar response from individuals after the ingestion of the particular food, estimation of the GL is not intuitable.4
Point-of-sale tools. People seeking to lose weight, control diabetes, improve dyslipidemia and/or blood pressure, and/or decrease their risk for heart disease may benefit from point-of-sale tools such as the Altman Rule, which simplifies and encourages the selection of more nutritious foods.1 Other tools—such as Guiding Stars (https://guidingstars.com), NuVal (www.nuval.com), and different variations of traffic lights—have been created to help consumers make more informed and healthier food choices.5-8 However, Guiding Stars and NuVal are based on complicated algorithms that are not entirely transparent and not accessible to the average consumer.6,7 Evaluations of these nutrition tools indicate that consumers tend to underrate the healthiness of some foods, such as raw almonds and salmon, and overrate the healthiness of others, such as fruit punch and diet soda, when using traffic light systems.6 Furthermore, these nutrition tools are not available in many supermarkets. Previous research suggests that the use of point-of-sale nutrition apps decreases with the time and effort involved in using an app.9
Continue to: The Altman Rule
The Altman Rule was developed by a family physician (author WA) to provide a more accessible tool for people interested in choosing healthier prepackaged carbohydrate foods while shopping. Since the user does not need to have a smartphone, and they are not required to download or understand an app for each purchase, the Altman Rule may be more usable compared with more complicated alternatives.
The Altman Rule can be used with nutrition labels that feature serving information and calories in enlarged and bold type, in compliance with the most recent US Food and Drug Administration (FDA) guideline from 2016. Many foods with high fiber also have high amounts of sugar, so the criteria of the Altman Rule includes a 2-step process requiring (1) a minimum of 3 g of total dietary fiber per serving and (2) the sum of the grams of fiber plus the grams of protein per serving to be greater than the total grams of sugar (not grams of added sugar or grams of carbohydrate) per serving (FIGURE 1A). Unlike the relatively complicated formula related to GL, this 2-part rule can be applied in seconds while shopping (FIGURE 1B).
The rule is intended only to be used for packaged carbohydrate products, such as bread, muffins, bagels, pasta, rice, oatmeal, cereals, snack bars, chips, and crackers. It does not apply to whole foods, such as meat, dairy, fruits, or vegetables. These foods are excluded to prevent any consumer confusion related to the nutritional content of whole foods (eg, an apple may have more sugar than fiber and protein combined, but it is still a nutritious option).
This study aimed to determine if the Altman Rule is a reasonable proxy for the more complicated concept of GL. We calculated the relationship between the GL of commercially available packaged carbohydrate foods and whether those foods met the Altman Rule.
METHODS
The Altman Rule was tested by comparing the binary outcome of the rule (meets/does not meet) with data on all foods categorized as cereals, chips, crackers, and granola bars in the Nutrition Data System for Research (NDSR) Database (University of Minnesota, Version 2010).
Continue to: To account for differences...
To account for differences in serving size, we used the standard of 50 g for each product as 1 serving. We used 50 g (about 1.7 oz) to help compare the different foods and between foods within the same group. Additionally, 50 g is close to 1 serving for most foods in these groups; it is about the size of a typical granola bar, three-quarters to 2 cups of cereal, 10 to 12 crackers, and 15 to 25 chips. We determined the GL for each product by multiplying the number of available carbohydrates (total carbohydrate – dietary fiber) by the product’s glycemic index/100. In general, GL is categorized as low (≤ 10), medium (11-19), or high (≥ 20).
We applied the Altman Rule to categorize each product as meeting or not meeting the rule. We compared the proportion of foods meeting the Altman Rule, stratified by GL and by specific foods, and used chi-square to determine if differences were statistically significant. These data were collected and analyzed in the summer of 2019.
RESULTS
There were 1235 foods (342 breakfast cereals, 305 chips, 379 crackers, and 209 granola bars) used for this analysis. There is a significant relationship between the GL of foods and the Altman Rule in that most low-GL (68%), almost half of medium-GL (48%), and only a few high-GL foods (7%) met the rule (P < .001) (TABLE 1). There was also a significant relationship between “meeting the Altman Rule” and GL within each food type (P < .001) (TABLE 2).
The medium-GL foods were the second largest category of foods we calculated; thus we further broke them into binary categories of
Foods that met the rule were more likely to be low GL and foods that did not pass the rule were more likely high GL. Within the medium-GL category, foods that met the rule were more likely to be low-medium GL.
Continue to: The findings within food categories...
The findings within food categories showed that very few cereals, chips, crackers, and granola bars were low GL. For every food category, except granola bars, far more low-GL foods met the Altman Rule than those that did not. At the same time, very few high-GL foods met the Altman Rule. The category with the most individual high-GL food items meeting the Altman Rule was cereal. This was also the subcategory with the largest percentage of high-GL food items meeting the Altman Rule. Thirty-nine cereals that were high GL met the rule, but more than 4 times as many high-GL cereals did not (n = 190).
DISCUSSION
Marketing and nutrition messaging create consumer confusion that makes it challenging to identify packaged food items that are more nutrient dense. The Altman Rule simplifies food choices that have become unnecessarily complex. Our findings suggest this 2-step rule is a reasonable proxy for the more complicated and less accessible GL for packaged carbohydrates, such as cereals, chips, crackers, and snack bars. Foods that meet the rule are likely low or low-medium GL and thus are foods that are likely to be healthier choices.
Of note, only 9% of chips (n = 27) passed the Altman Rule, likely due to their low dietary fiber content, which was typical of chips. If a food item does not have at least 3 grams of total dietary fiber per serving, it does not pass the Altman Rule, regardless of how much protein or sugar is in the product. This may be considered a strength or a weakness of the Altman Rule. Few nutrition-dense foods are low in fiber, but some foods could be nutritious but do not meet the Altman Rule due to having < 3 g of fiber.
With the high prevalence of chronic diseases such as hypertension, diabetes, hyperlipidemia, and cardiovascular disease, it is essential to help consumers prevent chronic disease altogether or manage their chronic disease by providing tools to identify healthier food choices. The tool also has a place in clinical medicine for use by physicians and other health care professionals. Research shows that physicians find both time and lack of knowledge/resources to be a barrier to providing nutritional counseling to patients.10 Since the Altman Rule can be shared and explained with very little time and without extensive nutritional knowledge, it meets these needs.
Limitations
Glycemic load. We acknowledge that the Altman Rule is not foolproof and that assessing this rule based on GL has some limitations. GL is not a perfect or comprehensive way to measure the nutritional value of a food. For example, fruits such as watermelon and grapes are nutritionally dense. However, they contain high amounts of natural sugars—and as such, their GL is relatively high, which could lead a consumer to perceive them as unhealthy. Nevertheless, GL is both a useful and accepted tool and a reasonable way to assess the validity of the rule, specifically when assessing packaged carbohydrates. The simplicity of the Altman Rule and its relationship with GL makes it such that consumers are more likely to make a healthier food choice using it.9
Continue to: Specificity and sensitivity
Specificity and sensitivity. There are other limitations to the Altman Rule, given that a small number of high-GL foods meet the rule. For example, some granola bars had high dietary protein, which offset a high sugar content just enough to pass the rule despite a higher GL. As such, concluding that a snack bar is a healthier choice because it meets the Altman Rule when it has high amounts of sugar may not be appropriate. This limitation could be considered a lack of specificity (the rule includes food it ought not to include). Another limitation to consider would be a lack of sensitivity, given that only 68% of low-GL foods passed the Altman Rule. Since GL is associated with carbohydrate content, foods with a low carbohydrate count often have little to no fiber and thus would fall into the category of foods that did not meet the Altman Rule but had low GL. In this case, however, the low amount of fiber may render the Altman Rule a better indicator of a healthier food choice than the GL.
Hidden sugars. Foods with sugar alcohols and artificial sweeteners may be as deleterious as caloric alternatives while not being accounted for when reporting the grams of sugar per serving on the nutrition label.7 This may represent an exception to the Altman Rule, as foods that are not healthier choices may pass the rule because the sugar content on the nutrition label is, in a sense, artificially lowered. Future research may investigate the hypothesis that these foods are nutritionally inferior despite meeting the Altman Rule.
The sample. Our study also was limited to working only with foods that were included in the NDSR database up to 2010. This limitation is mitigated by the fact that the sample size was large (> 1000 packaged food items were included in our analyses). The study also could be limited by the food categories that were analyzed; food categories such as bread, rice, pasta, and bagels were not included.
The objective of this research was to investigate the relationship between GL and the Altman Rule, rather than to conduct an exhaustive analysis of the Altman Rule for every possible food category. Studying the relationship between the Altman Rule and GL in other categories of food is an objective for future research. The data so far support a relationship between these entities. The likelihood of the nutrition facts of foods changing without the GL changing (or vice versa) is very low. As such, the Altman Rule still seems to be a reasonable proxy of GL.
CONCLUSIONS
Research indicates that point-of-sale tools, such as Guiding Stars, NuVal, and other stoplight tools, can successfully alter consumers’ behaviors.9 These tools can be helpful but are not available in many supermarkets. Despite the limitations, the Altman Rule is a useful decision aid that is accessible to all consumers no matter where they live or shop and is easy to use and remember.
The Altman rule can be used in clinical practice by health care professionals, such as physicians, nurse practitioners, physician assistants, dietitians, and health coaches. It also has the potential to be used in commercial settings, such as grocery stores, to help consumers easily identify healthier convenience foods. This has public health implications, as the rule can both empower consumers and potentially incentivize food manufacturers to upgrade their products nutritionally.
Additional research would be useful to evaluate consumers’ preferences and perceptions about how user-friendly the Altman Rule is at the point of sale with packaged carbohydrate foods. This would help to further understand how the use of information on food packaging can motivate healthier decisions—thereby helping to alleviate the burden of chronic disease.
CORRESPONDENCE
Kimberly R. Dong, DrPH, MS, RDN, Tufts University School of Medicine, Department of Public Health and Community Medicine, 136 Harrison Avenue, MV Building, Boston, MA 02111; kimberly.dong@tufts.edu
1. Hersey JC, Wohlgenant KC, Arsenault JE, et al. Effects of front-of-package and shelf nutrition labeling systems on consumers. Nutr Rev. 2013;71:1-14. doi: 10.1111/nure.12000
2. Jenkins DJA, Dehghan M, Mente A, et al. Glycemic index, glycemic load, and cardiovascular disease and mortality. N Engl J Med. 2021;384:1312-1322. doi: 10.1056/NEJMoa2007123
3. Brand-Miller J, Hayne S, Petocz P, et al. Low–glycemic index diets in the management of diabetes. Diabetes Care. 2003;26:2261-2267. doi: 10.2337/diacare.26.8.2261
4. Matthan NR, Ausman LM, Meng H, et al. Estimating the reliability of glycemic index values and potential sources of methodological and biological variability. Am J Clin Nutr. 2016;104:1004-1013. doi: 10.3945/ajcn.116.137208
5. Sonnenberg L, Gelsomin E, Levy DE, et al. A traffic light food labeling intervention increases consumer awareness of health and healthy choices at the point-of-purchase. Prev Med. 2013;57:253-257. doi: 10.1016/j.ypmed.2013.07.001
6. Savoie N, Barlow K, Harvey KL, et al. Consumer perceptions of front-of-package labelling systems and healthiness of foods. Can J Public Health. 2013;104:e359-e363. doi: 10.17269/cjph.104.4027
7. Fischer LM, Sutherland LA, Kaley LA, et al. Development and implementation of the Guiding Stars nutrition guidance program. Am J Health Promot. 2011;26:e55-e63. doi: 10.4278/ajhp.100709-QUAL-238
8. Maubach N, Hoek J, Mather D. Interpretive front-of-pack nutrition labels. Comparing competing recommendations. Appetite. 2014;82:67-77. doi: 10.1016/j.appet.2014.07.006
9. Chan J, McMahon E, Brimblecombe J. Point‐of‐sale nutrition information interventions in food retail stores to promote healthier food purchase and intake: a systematic review. Obes Rev. 2021;22. doi: 10.1111/obr.13311
10. Mathioudakis N, Bashura H, Boyér L, et al. Development, implementation, and evaluation of a physician-targeted inpatient glycemic management curriculum. J Med Educ Curric Dev. 2019;6:238212051986134. doi: 10.1177/2382120519861342
1. Hersey JC, Wohlgenant KC, Arsenault JE, et al. Effects of front-of-package and shelf nutrition labeling systems on consumers. Nutr Rev. 2013;71:1-14. doi: 10.1111/nure.12000
2. Jenkins DJA, Dehghan M, Mente A, et al. Glycemic index, glycemic load, and cardiovascular disease and mortality. N Engl J Med. 2021;384:1312-1322. doi: 10.1056/NEJMoa2007123
3. Brand-Miller J, Hayne S, Petocz P, et al. Low–glycemic index diets in the management of diabetes. Diabetes Care. 2003;26:2261-2267. doi: 10.2337/diacare.26.8.2261
4. Matthan NR, Ausman LM, Meng H, et al. Estimating the reliability of glycemic index values and potential sources of methodological and biological variability. Am J Clin Nutr. 2016;104:1004-1013. doi: 10.3945/ajcn.116.137208
5. Sonnenberg L, Gelsomin E, Levy DE, et al. A traffic light food labeling intervention increases consumer awareness of health and healthy choices at the point-of-purchase. Prev Med. 2013;57:253-257. doi: 10.1016/j.ypmed.2013.07.001
6. Savoie N, Barlow K, Harvey KL, et al. Consumer perceptions of front-of-package labelling systems and healthiness of foods. Can J Public Health. 2013;104:e359-e363. doi: 10.17269/cjph.104.4027
7. Fischer LM, Sutherland LA, Kaley LA, et al. Development and implementation of the Guiding Stars nutrition guidance program. Am J Health Promot. 2011;26:e55-e63. doi: 10.4278/ajhp.100709-QUAL-238
8. Maubach N, Hoek J, Mather D. Interpretive front-of-pack nutrition labels. Comparing competing recommendations. Appetite. 2014;82:67-77. doi: 10.1016/j.appet.2014.07.006
9. Chan J, McMahon E, Brimblecombe J. Point‐of‐sale nutrition information interventions in food retail stores to promote healthier food purchase and intake: a systematic review. Obes Rev. 2021;22. doi: 10.1111/obr.13311
10. Mathioudakis N, Bashura H, Boyér L, et al. Development, implementation, and evaluation of a physician-targeted inpatient glycemic management curriculum. J Med Educ Curric Dev. 2019;6:238212051986134. doi: 10.1177/2382120519861342