The health care industry is undergoing a significant transformation due to rapid technological advancements, particularly the rise of telehealth []. Telehealth includes a range of remote health care services, bridging gaps between health care demand and supply by providing convenient access to consultations, remote monitoring, and timely interventions []. In the United States, telehealth adoption accelerated during the COVID-19 pandemic, highlighting its role in improving access to care [].
Health care resources include medical personnel, equipment, and facilities required for service delivery []. For telehealth, resources also include digital platforms, communication devices, and technical support, while office visits rely on clinical spaces and on-site staff []. Resource use measures how efficiently these resources are managed, with studies indicating that telehealth can reduce wait times, increase visit rates, and minimize missed appointments [,-], enhancing care continuity.
Telehealth services were provided by a diverse range of health care facilities. Some adopted telehealth for the first time in response to the pandemic, while others, which already had telehealth services in place, experienced increased demand over the past 4 years. Consequently, facilities had to allocate resources—such as doctors and nurses—across multiple types of visits, including office visits, telehealth appointments, and emergency calls. A study found that larger facilities delivered a higher proportion of their visits through telehealth compared to smaller facilities with fewer doctors and nurses []. This suggests that health care facilities vary in their resource use and staffing needs.
Identifying the factors that influence telehealth use and evaluating its impact on health care resources are essential for effective resource planning. Several studies indicate that factors such as technological infrastructure, patient acceptance, and broadband access determine the feasibility and effectiveness of implementing telehealth [-]. These studies also highlight variations in adoption across different health care specialties and patient demographics.
One recent study reported that telehealth services improved resource use, as telehealth visits were associated with shorter waiting times and appointment durations compared to office visits in certain specialties. Additionally, both waiting times and appointment durations for telehealth services decreased over the study period []. Another study identified disparities in telehealth use, consistent with findings from various data sources and periods []. It calls for further research to determine whether differences in telehealth service proportions are linked to hospital or community telehealth capacity and to understand why these service proportions vary across demographic characteristics. These findings highlight the importance of nationwide analyses to assess telehealth use comprehensively.
This study aims to identify factors influencing resource use in health care facilities in the United States and to compare patterns across different settings (telehealth and office visits) and time frames. We focus on 2 metrics, patient-to-provider ratio and appointment duration, as key indicators of how health care resources are allocated and managed. The patient-to-provider ratio reflects the distribution of patient load among providers, serving as an indicator of resource allocation within health care systems. Higher ratios may indicate overburdened providers and diminished quality of care, whereas lower ratios suggest more manageable workloads and potentially improved patient attention. Appointment duration captures how providers allocate time per visit, offering insight into time management and the depth of patient engagement. Together, these metrics provide a comprehensive view of health care resource use in both telehealth and office-based contexts. Using national data from 2018 to 2023, the study seeks to generate new insights into telehealth adoption and resource use, offering a broader perspective than previous research [-]. In doing so, it provides evidence to inform health care providers, policymakers, and other stakeholders in planning for sustainable telehealth integration.
Building on these objectives, we focus on 4 specialties, psychiatry, behavioral health, bariatrics, and sleep medicine, due to their relatively higher levels of telehealth use. Specifically, the study examines whether telehealth use and patient- and facility-related factors are associated with variations in patient-to-provider ratios and appointment durations. It also explores whether clusters of health care facilities demonstrate distinct patterns in telehealth adoption, patient-to-provider ratios, and appointment durations over time.
We hypothesize that (1) telehealth visits are associated with lower patient-to-provider ratios and shorter appointment durations than office visits, (2) patient- and facility-related factors significantly impact resource use across care settings, and (3) clusters of health care facilities demonstrate meaningful differences in telehealth adoption and performance metrics (patients per provider and appointment duration) over time.
Data collection was approved by the institutional review board of the University of Arkansas for Medical Sciences (IRB 276339). It was determined as nonhuman subject research. The data were deidentified, and there was no interaction with individuals.
Data CollectionPatient data for this study were sourced from Epic Cosmos (hereafter referred to as Cosmos), a robust platform offering a diverse and representative sample of patients across various demographics, including race, sex, age, geographic settings, and insurance types []. The dataset spanned patient visits from health care facilities across 48 US states and was fully deidentified to ensure confidentiality. Access to the dataset was facilitated through the Epic Systems Cosmos research platform via a Microsoft SQL Server database.
This study focused on telehealth (audiovisual) and office (face-to-face) visits in the United States between 2018 and 2023, capturing pre-, mid-, and postpandemic periods to evaluate the impact of the COVID-19 pandemic on telehealth adoption and resource use.
Our study used the following data tables from Cosmos:
Encounter table: Details visit-related information, including encounter type (eg, office visits and telehealth), provider details, health care specialties, and appointment times.Patient table: Provides patient-related demographics and enables analysis of telehealth access across various populations (eg, older people and rural residents). It also includes the Centers for Disease Control and Prevention or the Agency for Toxic Substances and Disease Registry Social Vulnerability Index (SVI), offering insights into education, financial resources, and transportation needs.Insurance table: Contains coverage details for each visit, allowing identification of primary insurance providers.Hospital table: Includes data transfer start and end dates for each hospital, aiding in data cleaning processes.Data PreprocessingOverviewData preprocessing, including data cleaning and feature engineering, ensured and improved model performance []. This study focused on patient- and facility-related factors and on specialties with the highest rates of telehealth adoption to identify best practices for virtual care.
We processed visit-level data for these specialties using features derived from patient, encounter, insurance, and hospital tables. Recent studies frequently examined demographic factors, including age, sex, race, and rurality [,-]. Age was a key variable, with research comparing use rates across age groups to understand adoption trends. Sex and racial disparities were also explored to identify inequities in telehealth access, while rurality was examined to evaluate telehealth’s role in addressing health care gaps in remote areas. Additionally, insurance coverage was a critical determinant, influencing telehealth access through policy and reimbursement mechanisms. These demographic insights informed strategies to enhance equitable access.
We also incorporated the SVI percentiles from the Centers for Disease Control and Prevention to assess the impact of social vulnerability on telehealth use [,]. The SVI ranked census tracts based on socioeconomic status, household characteristics, minority and language status, and housing type and transportation. The SVI metric aided in resource allocation for vulnerable populations, enhancing public health preparedness and response efforts [].
Data CleaningThe process used to define the subset of visits included in our study is illustrated in . We excluded visits without a valid PatientID or transfer date to ensure data completeness. Because facilities periodically transfer electronic health record data, only visits within valid transfer start and end dates were included. We analyzed only same-day outpatient visits and focused on age, race, sex, rurality, insurance, and SVI percentiles. Preprocessing steps for these factors are shown in .
Figure 1. Flow diagram for data preprocessing steps. RUCA: rural-urban commuting area; SVI: Social Vulnerability Index. Data were aggregated at the hospital level by month across 72 periods (2018-2023). lists the variables, including temporal features (year, quarter, and month), to capture time trends. For demographics (race, sex, and insurance), we calculated the proportion of telehealth versus office visits; for age, rural-urban commuting area (RUCA), and SVI, we computed median age, average RUCA, and average SVI scores. To evaluate resource use, we calculated the patient-to-provider ratio and appointment duration for telehealth and office visits at each hospital and during each period. Appointment durations shorter than 10 minutes or longer than 100 minutes were excluded to reduce the influence of outliers and potential data entry errors. While such durations may occur in rare cases, they fall outside the typical range for standard provider-patient interactions and could skew the analysis. Finally, we scaled the training data before applying machine learning (ML) models [].
Table 1. Variables and feature types.Independent variablesFeature typeYearTemporalQuarterTemporalMonthTemporalAgePatient-relatedRUCAa codePatient-relatedSexPatient-relatedRacePatient-relatedSVIb householdPatient-relatedSVI housingPatient-relatedSVI minorityPatient-relatedSVI socioeconomicPatient-relatedPublic insurancePatient-relatedPrivate insurancePatient-relatedSelf-payPatient-relatedVisit typePatient-relatedNumber of visitsFacility-relatedNumber of providersFacility-relatedTelehealth useFacility-relatedRuralityFacility-relatedTotal visitsFacility-relatedaRUCA: rural-urban commuting area.
bSVI: Social Vulnerability Index.
Model DevelopmentOverviewWe conducted a comparative analysis of data from multiple health care facilities (2018-2023) to evaluate the impact of patient characteristics and health care settings on resource use and to identify key drivers. Given the large number of facilities, we used clustering to group similar facilities into meaningful profiles. This approach reduced complexity and highlighted common patterns across facility types, such as those with similar size, service mix, or adoption levels. By comparing these clusters, we provided insights that go beyond individual facilities and offered a clearer understanding of how telehealth influences resource use. In practice, this allowed facilities to benchmark their performance against comparable peers and consider alternative strategies. After clustering, we compared resource use between facility groups during and after the pandemic. illustrates the overall methodology, with details provided in the subsequent section.
Figure 2. Flow diagram of the proposed methodology. Learning AlgorithmsTo examine the influence of patient- and facility-related factors on resource use, specifically the patient-to-provider ratio and appointment durations, we used nationwide temporal data and applied ML techniques. Prior work showed that ML improved performance with large health care datasets by uncovering links between demographics, clinical characteristics, and service types [,].
We evaluated 3 ML regression models: random forests, extreme gradient boosting (XGBoost), and deep neural networks (DNNs). These models were selected because they are widely used and complementary—random forests provide robustness and interpretability, XGBoost is highly efficient for structured tabular data, and DNNs are well-suited for capturing complex nonlinear relationships in large-scale datasets [,]. To optimize performance, we used Optuna, a hyperparameter tuning framework, to efficiently explore the parameter space and identify the best configuration for each specialty []. Model performance was assessed using R2, which measures how well predicted outcomes align with actual values and enables comparison across models and datasets []. To determine the relative importance of features on resource use outcomes, we used Shapley Additive Explanations (SHAP), which computes SHAP values to provide interpretable insights into feature contributions [].
Clustering ModelAfter evaluating factors influencing resource use, we clustered health care facilities based on shared characteristics so that similar facilities can be analyzed collectively. The model included patient- and facility-related features and resource use metrics. To prepare the data for clustering, we aggregated it at the hospital level using the following methodology:
Weighted averages: We calculated weighted averages using the number of visits as the weighting factor for the following patient-related factors: age, RUCA code, SVI household, SVI housing, SVI minority, SVI socioeconomic, race, sex, and payment methods (public insurance, private insurance, and self-pay).Patient-to-provider ratio: We calculated the weighted average of this ratio with the number of providers as the weighting factor.Telehealth use: We summed the total number of telehealth visits and overall visits for each hospital. The proportion of telehealth visits was then determined by dividing the number of telehealth visits by the total number of visits.Provider counts: We aggregated the total number of providers offering telehealth and office visits for each hospital.Rurality variable: This variable remained unchanged, as it was already defined at the hospital level.The features initially considered in the clustering model are detailed in . We used k-means clustering, a widely used algorithm for partitioning datasets into distinct groups based on feature similarity []. First, we evaluated feature correlations and excluded those with a correlation coefficient equal to or higher than 0.70 []. Then, we further refined the clustering process by removing outliers using the isolation forest algorithm [].
Textbox 1. Variables for the clustering model.Independent variables
AgeRural-urban commuting area codeSocial Vulnerability Index (SVI) householdSVI housingSVI minoritySVI socioeconomicRaceSexPublic insurancePrivate insuranceSelf-payAppointment durationsRuralityTotal visitsNumber of telehealth visitsNumber of providersPatient-to-provider ratioTelehealth useTo determine the optimal number of clusters, we applied the elbow method []. The elbow point marked the balance between minimizing inertia and avoiding excessive clustering. Additionally, we considered cluster evaluation metrics, including the Davies-Bouldin Index, where lower values indicate better-defined clusters [], and the Calinski-Harabasz Index, which favors higher values for compact and well-separated clusters []. Once the optimal number of clusters was identified for each specialty, we executed separate clustering models for each specialty. Finally, we assessed feature importance within the clustering model using the variance explanation method [].
Comparative AnalysisAfter clustering health care facilities, we analyzed disparities in patient-to-provider ratios and appointment durations. Although we collected data for 3 time periods: pre–COVID-19 (2018-2019), during COVID-19 (2020-2021), and post–COVID-19 (2022-2023), the prepandemic years are excluded due to the limited telehealth adoption.
To investigate whether statistically significant differences exist in appointment durations or patient-to-provider ratios among the clusters of health care facilities, we applied 1-way ANOVA []. These analyses are conducted separately by specialty, visit type (telehealth and office visits), and time period (during and postpandemic). The hypotheses are defined as follows:
H0: There are no statistically significant differences in patient-to-provider ratios or appointment durations for telehealth or office visits during the pandemic or postpandemic periods among the clusters of health care facilities.Ha: There are statistically significant differences in patient-to-provider ratios or appointment durations for telehealth or office visits during the pandemic or postpandemic periods among the clusters of health care facilities.To further explore temporal differences, we compared patient-to-provider ratios and appointment durations within each cluster between the during pandemic and postpandemic periods using paired 2-tailed t tests []. These tests were also conducted separately by specialty and visit type. The paired 2-tailed t test hypotheses are as follows:
H0: There is no statistically significant difference in mean appointment durations or patient-to-provider ratios for telehealth or office visits within each cluster between the during pandemic and postpandemic periods.Ha: There is a statistically significant difference in mean appointment durations or patient-to-provider ratios for telehealth or office visits within each cluster between the during pandemic and postpandemic periods.Before performing these tests, we assessed the assumptions of normality and homogeneity of variance using the Shapiro-Wilk test and the Levene test [,]. If these assumptions were violated, alternative nonparametric methods were used: the Kruskal-Wallis test replaced 1-way ANOVA, and the Wilcoxon signed rank test substituted for the paired 2-tailed t test. Post-hoc comparisons were conducted using Tukey honestly significant difference for ANOVA [] and Dunn test with Bonferroni correction for the Kruskal-Wallis test [].
This section presents the results in alignment with . We begin with an exploratory data analysis to highlight telehealth trends and resource use patterns. We then present the results of the learning algorithm and identify the factors influencing resource use. Next, we describe the clustering of health care facilities and report the comparative analyses of resource use across these clusters.
These results directly address the study objectives by examining how telehealth use and patient- and facility-related factors affect patient-to-provider ratios and appointment durations and by exploring how clusters of health care facilities differ in telehealth adoption and performance metrics over time.
Trends in Telehealth and Office VisitsOverviewpresents the quarterly number of telehealth and office visits from 2018 to 2023. A significant decline in total visits is observed during the early months of the pandemic (Q1 and Q2 2020), accompanied by a sharp increase in telehealth visits, peaking in Q2 2020. Following this surge, telehealth use gradually declined but remained consistently used, underscoring its sustained role in health care delivery through the end of 2023.
Table 2. Number of telehealth and office visits (2018-2023).Quarter, yearTelehealth visits, nOffice visits, nQ1, 201831,65038,361,217Q2, 201834,20839,240,189Q3, 201838,99238,748,133Q4, 201849,53340,941,939Q1, 201952,59343,038,467Q2, 201957,86944,101,303Q3, 201960,09244,731,534Q4, 201974,95346,290,450Q1, 20201,001,36044,025,023Q2, 20209,566,51524,844,182Q3, 20206,069,12639,700,887Q4, 20206,426,34240,275,185Q1, 20216,562,66742,052,938Q2, 20214,748,01647,461,381Q3, 20214,194,78450,377,723Q4, 20214,142,67651,070,485Q1, 20225,011,82151,412,365Q2, 20223,993,35954,568,671Q3, 20223,874,79955,658,385Q4, 20224,123,91457,645,034Q1, 20234,318,72761,454,472Q2, 20233,909,78262,379,729Q3, 20233,814,82761,818,757Q4, 20234,207,16464,956,975Evaluating Telehealth Use Across Health Care SpecialtiesWe analyzed health care specialties with the highest telehealth adoption to identify best practices for virtual care (). Significant variations were observed, with psychiatry leading telehealth visits at 36.9% (4,543,021/12,311,710), followed by behavioral health (5,321,099/22,264,013, 23.9%), bariatrics (924,333/4,360,061, 21.2%), and sleep medicine (851,803/5,070,256, 16.8%). These adoption rates highlight strong telehealth integration within these specialties, offering valuable insights into factors driving adoption and associated outcomes. Visit trends for these specialties can be seen in .
Figure 3. Percentage and number of telehealth visits (2018-2023). Data PreprocessingOverviewshows the number of visits before and after data preprocessing and outlier removal, broken down by specialty as well as telehealth and office visits. The counts and percentages of visits excluded at each step are provided in Tables S1 and S2 in .
Table 3. Number of visits before and after data preprocessing and excluding outliers.SpecialtyTelehealthOffice- display the proportion of telehealth visits across RUCA codes from 2018 to 2023 for the selected 4 specialties.
Figure 4. Proportion of telehealth visits over time in psychiatry. RUCA: rural-urban commuting area.
Figure 5. Proportion of telehealth visits over time in behavioral health. RUCA: rural-urban commuting area.
Figure 6. Proportion of telehealth visits over time in bariatrics. RUCA: rural-urban commuting area.
Figure 7. Proportion of telehealth visits over time in sleep medicine. RUCA: rural-urban commuting area. In psychiatry, telehealth use was minimal before 2020 but increased sharply, particularly in rural areas (RUCA codes 8 and 10). The highest use occurred in 2021, with RUCA code 10 reaching 0.587 and RUCA code 1 at 0.583. RUCA code 10 maintained the highest telehealth use from 2021 to 2023, with RUCA code 1 consistently following. This trend highlights substantial telehealth adoption in psychiatry across both highly rural and urban areas. Behavioral health exhibited a similar trajectory, with negligible telehealth use before 2020. RUCA code 9 peaked at 0.466 in 2021, underscoring telehealth’s significance in areas with low commuting rates [].
In bariatrics, telehealth adoption began in 2020, with RUCA code 10 showing the highest rate (0.307). From 2021 to 2023, RUCA code 1 led telehealth use, reflecting its essential role in both urban and rural settings for bariatric services. Sleep medicine experienced gradual growth in telehealth adoption, with RUCA code 10 reaching its highest rate (0.262) in 2021. By 2023, telehealth use in rural areas remained high.
Our analysis revealed a consistent upward trend in telehealth visits across all 4 specialties from 2018 to 2023, particularly pronounced in specific RUCA codes. This underscored the growing acceptance and importance of telehealth services in both the most urban and rural areas. Although telehealth use peaked during the pandemic, it remained widely adopted across regions. However, the absence of uniform patterns across all RUCA codes for every specialty highlighted the unique telehealth use trends within each specialty, emphasizing the need for tailored analyses.
Resource Use Over Timeillustrates the trends in patient-to-provider ratios for telehealth and office visits across specialties from 2018 to 2023. The COVID-19 pandemic in 2020 caused a marked increase in telehealth ratios, as the health care system adapted to unprecedented challenges. In the postpandemic period, these ratios either stabilized or continued to grow, reflecting sustained telehealth adoption and ongoing adjustments in provider practices.
Figure 8. Patient-to-provider ratio over time. highlights appointment duration trends over the same period. During the pandemic, telehealth appointments were generally longer but eventually stabilized or decreased in the postpandemic period. Telehealth appointments were consistently shorter than office visits, with the exception of behavioral health in 2018.
Figure 9. Appointment durations over time. Distinct trends across specialties underscore the necessity of analyzing factors influencing patient-to-provider ratios and appointment durations. The substantial variations during and after the pandemic also call for a detailed comparison of these periods.
Given the large number of health care facilities in this study, we applied a clustering approach to group facilities with similar characteristics, allowing more comprehensive comparisons and clearer interpretation of patterns. Building on this, we developed models to examine resource use, its influencing factors, followed by a comparative analysis.
Learning AlgorithmsTo analyze the impact of patient- and facility-related factors on resource use, we evaluated 3 ML regression models, including random forest, XGBoost, and DNN models (see Tables S1-S3 in for model performances and hyperparameters). Ultimately, XGBoost was chosen due to its superior predictive performance.
XGBoost Regression ModelSeparate models were developed for each specialty, with hyperparameters optimized using Optuna. Typically, data were divided into an 80% training and 20% testing split to balance learning and evaluation []. However, to improve model performance, we adjusted the splits to 90%-10% split based on R2 results. shows the number of training and testing rows per specialty.
Table 4. Training and testing splits for the models.SpecialtyTrainingTestingPsychiatry93351038Behavioral health8464941Bariatrics8892989Sleep medicine94271049R2 values were calculated to evaluate model performance, showing strong predictive power for patient-to-provider ratios and moderate power for appointment durations (). All results were obtained within 4 seconds.
Table 5. R2 values for extreme gradient boosting regression models.SpecialtyPatient-to-provider ratio, R2Appointment duration, R2Psychiatry0.99350.6923Behavioral health0.96890.6673Bariatrics0.99480.6519Sleep medicine0.99340.6101Feature ImportanceIn SHAP summary plots, red points indicate higher feature values and blue points indicate lower values; their position on the x-axis shows whether they increase (right) or decrease (left) the prediction.
A-10D displays SHAP summary plots for output 1 (patient-to-provider ratio) across the selected specialties. Across all specialties, telehealth use emerges as an important factor, with higher telehealth use generally resulting in fewer patients per provider.
Figure 10. SHAP summary plots—patient-to-provider ratio. (A) Psychiatry, (B) behavioral health, (C) bariatrics, and (D) sleep medicine. RUCA: rural-urban commuting area; SHAP: Shapley Additive Explanations; SVI: Social Vulnerability Index. In psychiatry, SVI household and race significantly influence patient loads, reflecting socioeconomic and demographic factors. Higher SVI household values correspond to increased patient loads in psychiatry, behavioral health, and sleep medicine, while in bariatrics, no consistent trend is observed. Similarly, race shows mixed effects: health care facilities with a higher proportion of visits by White patients (red points) are linked to more patients per provider in psychiatry, but with fewer patients per provider in behavioral health and sleep medicine.
In behavioral health, sex is a significant factor, with facilities having a lower proportion of visits by female patients (blue points) associated with higher patient loads. Self-pay is also significant; however, it shows mixed effects. In bariatrics, visit type emerges as the most critical feature, with telehealth visits (red points) generally associated with fewer patients per provider compared to office visits (blue points), similar to sleep medicine. Telehealth use is the second most influential factor, where higher telehealth adoption correlates with decreased patient loads, again consistent with findings in sleep medicine.
In summary, while features such as telehealth use, visit type, SVI household, and race are universally important, the varying influence of demographic and socioeconomic factors underscores the diverse drivers of patient distribution across specialties.
A-11D displays SHAP summary plots for output 2 (appointment durations). Across specialties, visit type is an important factor: telehealth visits typically result in shorter durations, while office visits are associated with longer durations. Rurality also emerges as a significant factor but exhibits mixed effects across the 4 specialties. Year is another influential variable, with higher values (2021-2023) linked to shorter appointment durations, and lower values (2018-2020) linked to longer durations.
Figure 11. SHAP summary plots—appointment duration. (A) Psychiatry, (B) behavioral health, (C) bariatrics, and (D) sleep medicine. RUCA: rural-urban commuting area; SHAP: Shapley Additive Explanations; SVI: Social Vulnerability Index. In psychiatry, facilities with more providers (larger health care facilities) tended to have longer appointment durations, whereas the opposite pattern was observed in bariatrics, and no clear trend was evident in behavioral health or sleep medicine. Race is particularly important in sleep medicine, where facilities with a higher proportion of visits by White patients (red points) are associated with shorter durations.
Overall, visit type, rurality, and year consistently emerge as the most influential factors for appointment durations. Telehealth visits generally lead to shorter appointments, more recent years are associated with reduced durations, and rurality shows mixed effects across specialties.
Clustering ModelOverviewOur clustering model selected key features using SHAP analysis across all 4 health care specialties. To ensure diversity, we included 1 feature from each major category: 1 SVI metric, 1 demographic variable, 1 representing facility size, and 1 related to insurance. The final set of features consisted of number of providers, telehealth use, RUCA code, SVI socioeconomic, and public insurance. We applied the isolation forest algorithm with a contamination rate of 0.15 (excluding 15% of data points as anomalies). summarizes the number of health care facilities in each specialty before and after data cleaning.
Table 6. Number of health care facilities within the clustering model.SpecialtyBefore cleaning, n (%)After cleaning, n (%)Clusters createdPsychiatry134 (24.9)114 (24.9)6Behavioral health143 (26.5)121 (26.5)6Bariatrics117 (21.7)99 (21.7)6Sleep medicine145 (26.9)123 (26.9)6After evaluating clustering metrics, we selected 6 clusters for each specialty. Please see Figure S1A-S1D and Table S1 in and for details on the evaluation of clustering metrics and the importance of factors in the clustering model. Each cluster represents a distinct group of health care facilities. Figure S1A-S1D in displays the average telehealth use, rurality, SVI socioeconomic, SVI minority, and number of providers across clusters for each specialty. Figure S2A-S2D in illustrates the average patient-to-provider ratio, appointment durations, and facility counts per cluster. Additional feature averages for each cluster are detailed in Tables S1-S4 in .
Clusters of Health Care FacilitiesEach specialty’s clusters display unique resource use and patient care characteristics, emphasizing the diverse nature of health care delivery. This analysis highlights opportunities for optimizing health care services for various patient populations and settings.
In , we summarize the clustering results across specialties, focusing on average feature values, resource use, and variations observed during and after the pandemic.
Textbox 2. Highlights by specialty and clusters.Psychiatry
Telehealth appointments were shorter than office visits, with cluster 6 having the highest telehealth use.
Cluster 1: Younger patients (average age 40.68 years).Cluster 2: Few providers, 27.0% (5647/20,915) telehealth use, high rurality, and low telehealth patient-to-provider ratio during the pandemic, which increased after the pandemic with short telehealth appointments.Cluster 3: High public insurance coverage due to older patients (average age 49.8 years).Cluster 4: Many providers, 40.0% (48,585/121,463) telehealth use, low rurality, long appointments, and low patient-to-provider ratio.Cluster 5: High patient-to-provider ratio, 25.0% (7691/30,764) telehealth use, and vulnerable population based on the following factors: SVI household, housing, minority, and socioeconomic.Cluster 6: Few providers, 88.0% (10,607/12,053) telehealth use, low rurality, high self-paid visits, short appointments, and the lowest patient-to-provider ratio.Behavioral health
After the pandemic, office visits saw higher patient-to-provider ratios and shorter durations, while telehealth ratios were lower.
Cluster 1: High public insurance coverage.Cluster 2: Lowest telehealth use, 9.0% (1813/20,144), low telehealth, and high office patient-to-provider ratios.Cluster 3: Few providers, 83.0% (12,870/15,506) telehealth use, long appointments, and the highest patient-to-provider ratio.Cluster 4: Most providers and visits, low telehealth patient-to-provider ratio, and moderate telehealth use, 23.0% (39,890/173,435).Cluster 5: Moderate providers, 20.0% (18,533/92,665) telehealth use, high rurality, the highest office patient-to-provider ratio, and short telehealth appointments.Cluster 6: Few providers, 55.0% (3544/6444) telehealth use, and high self-paid visits.Bariatrics
Telehealth appointments were shorter than office visits, with 80.0% (119,241/149,051) of visits from female patients.
Cluster 1: Few providers, 7.0% (2047/29,243) telehealth use, and high rurality.Cluster 2: Moderate providers, 42.0% (9888/23,543) telehealth use, and the highest postpandemic telehealth patient-to-provider ratio.Cluster 3: Older patients (average age 50.91 years), least vulnerable, 15.0% (3895/25,967) telehealth use, and short appointments.Cluster 4: Few providers, 13.0% (1057/8131) telehealth use, the youngest patients, and high public insurance coverage.Cluster 5: Most providers, stable telehealth ratios, and increased office patient-to-provider ratio after the pandemic.Cluster 6: Few providers, 5.0% (793/15,860) telehealth use, lowest telehealth, and highest office patient-to-provider ratio.Sleep medicine
After the pandemic, office visits had higher patient-to-provider ratios, telehealth ratios were lower, and sleep medicine had an older patient population with 50.0% (98,312/196,624) female patients.
Cluster 1: Younger, vulnerable patients, low public insurance, 19.0% (2024/10,651), low telehealth use, 5.0% (1482/29,640), and long appointments.Cluster 2: Vulnerable patients and low telehealth use, 7.0% (1747/24,957).Cluster 3: High public insurance coverage.Cluster 4: Moderate providers, 15.0% (3737/24,913) telehealth use, high rurality, short appointments, and high patient-to-provider ratio.Cluster 5: Most providers and visits, low telehealth 9.0% (9171/101,900), and a high female patient proportion.Cluster 6: Moderate providers, 47.0% (13,282/28,260) telehealth use, short appointments, high telehealth, and low office patient-to-provider ratios.Comparative AnalysisOverviewThis section examines differences in patient-to-provider ratios and appointment durations between telehealth and office visits during and after the pandemic across specialty-specific clusters. Tables S1 and S1 in provide detailed data on these metrics, with Figures S1A-S1D and S2A-S2D in visualizing these comparisons. A significance threshold of α=.05 was used to assess differences in resource use over time.
Differences Between ClustersAll P values obtained are less than .05, indicating that the observed differences in patient-to-provider ratios (or appointment durations) are statistically significant. In addition, we present Dunn test results with corresponding P values in Tables S1-S8 in . For instance, Table S1 in shows significant differences in telehealth patient-to-provider ratios during the pandemic between cluster 6 and clusters 1-5. After the pandemic, additional differences emerged, such as between cluster 1 and clusters 2 and 3, indicating varying resource allocation and patient needs.
Resource Use During and After the Pandemicsummarizes these results. In psychiatry, telehealth appointment durations significantly changed (P=.03), while other metrics remained stable. Behavioral health showed significant differences in office visit metrics (P=.01) and appointment durations for both telehealth (P=.02) and office visits (P=.03). Bariatrics exhibited significant changes in telehealth visit patient-to-provider ratios (P<.001), and office appointment durations (P=.02). Sleep medicine showed marginal significance in telehealth patient-to-provider ratios (P=.08) and significant changes in office visit patient-to-provider ratios (P<.001). These results underscore the pandemic’s varied impact on resource use across specialties and visit types, particularly in behavioral health.
Table 7. Hypothesis testing results for patient-to-provider ratios and appointment durations during and after the pandemic for telehealth and office visita.SpecialtyDuring vs postPatient-to-provider ratioAppointment durationsHa: μd,t,ptp≠μp,t,ptp, P valueHa: μd,o,ptp≠μp,o,ptp, P valueHa: μd,t,app≠μp,t,app, P valueHa: μd,o,app≠μp,o,app, P valuePsychiatry.96.57.03.91Behavioral health.25.01.02.03Bariatrics.92<.001.02.86Sleep medicine.08<.001.10.95aμd,t,ptp, μp,t,ptp: mean patient-to-provider ratio for telehealth visits during the pandemic and after the pandemic; μd,o,ptp, μp,o,ptp mean patient-to-provider ratio for office visits during the pandemic and after the pandemic; μd,t,app, μp,t,app: mean appointment duration for telehealth visits during the pandemic and after the pandemic; and μd,o,app, μp,o,app: mean appointment duration for office visits during the pandemic and after the pandemic.
This study analyzed telehealth adoption using a nationwide dataset, focusing on psychiatry, behavioral health, bariatrics, and sleep medicine—specialties with high telehealth use. Our analysis shows that telehealth use peaked in 2020 with the COVID-19 pandemic and, while it has declined since, it remains above prepandemic levels. Telehealth adoption was especially high in both the most urban and most rural areas, underscoring telehealth’s role in addressing both access and convenience. Importantly, telehealth continues to be in use, highlighting its lasting role in health care delivery.
In terms of resource use, telehealth visits were typically associated with shorter appointment durations, while patient-to-provider ratios varied across specialties and over time. These findings support our first hypothesis and highlight telehealth’s role in improving time efficiency, though its effects on patient load vary across specialties. By combining descriptive, predictive, and clustering analyses, our study offers an integrated understanding of how facility characteristics, patient demographics and socioeconomic factors, and telehealth adoption jointly shape resource use patterns.
Identifying drivers of resource is essential for informing policy and management decisions aimed at optimizing resources, staffing, reimbursement, and equitable access to care. To examine variation in resource use among health care facilities, we clustered similar health care facilities and then compared these clusters. This approach allowed us to generate insights at the group level rather than for individual facilities, where substantial variability might otherwise obscure broader patterns.
Our XGBoost regression model provided a robust analytical framework for examining resource use, capturing both facility- and patient-level factors as well as specialty-specific dynamics. Consistent with our second hypothesis, the model results demonstrate that both patient and facility characteristics significantly impact how health care resources are distributed and used.
Socioeconomic and demographic characteristics distinctly influenced resource use, pointing to the need for tailored strategies across facilities, supporting our second hypothesis. For example, higher SVI household values were associated with increased patient loads in psychiatry, behavioral health, and sleep medicine, while no consistent trend was observed in bariatrics. This suggests that facilities serving populations with greater household vulnerability, such as those with higher proportions of older adults, children, single-parent households, or persons with disabilities, may face greater demand pressures. To respond effectively, such facilities could consider targeted staffing adjustments, optimized resource allocation strategies, and supportive services to reduce provider strain and maintain equitable access to care. Additional steps could include improving coordination among providers, expanding telehealth to reduce in-person delays, training providers to better serve vulnerable populations, and advocating for policies or reimbursement models that reflect the higher resource needs of these facilities.
Race played a complex role in resource use, with effects varying across specialties. Facilities serving a higher proportion of White patients experienced heavier patient loads in psychiatry but lighter loads in behavioral health and sleep medicine. These contrasting patterns may reflect differences in access, care-seeking behaviors, or service delivery models across patient populations. Recognizing these disparities is important for designing culturally responsive interventions, expanding outreach in underserved groups, and ensuring that telehealth integration supports equitable care delivery.
Beyond sociodemographic influences, the level of telehealth adoption was itself an important determinant of resource use. Facilities with higher telehealth use generally had fewer patients per provider. This finding fu
Comments (0)