1Department of Life Science, Sogang University, Seoul, Republic of Korea.
2Research Institute for Life Science, GW Vitek, Inc. Seoul, Republic of Korea.
Department of Life Science, Sogang University, Seoul 04107, Republic of Korea.
Tel: 82-2-705-8615;
Email: hdshin@sogang.ac.kr
Received : December 29, 2022,
Accepted : January 23, 2023
Published : January 31, 2023,
Archived : www.jclinmedcasereports.com
Fasting Blood Glucose (FBG) level is prevalent trait to predict several diseases. In the present study, we identified markers for prediction of blood glucose level. The genotype data of a total of 5,013 samples (2,373 men and 2,640 women) were obtained from the Korea Association Resource (KARE) project. We collected markers from a genome-wide association study (GWAS) catalog which included additional markers from nearby regions of GWAS catalog markers. In order to establish a FBG prediction model, we selected significant single nucleotide polymorphisms (SNPs) using a 10-fold cross-validation. In addition, we validated our prediction model using the final validation set. We selected a total of 7 SNP comprised of 2 SNPs (rs7754840 and rs12699673) for the men and 5 SNPs (rs102275, rs1574285, rs2908289, rs6494307, and rs917793) for the women from the 10-fold cross-validation process. The results of the 10-fold across-validation process in the men and the women indicated upward trends of FBG levels. Also, we validated our prediction model using the final validation set. In the final validation set, increased trends were observed across all of the sets. Our prediction model for FBG may be helpful to further FBG related studies.
Keywords: Genome-wide association study; SNP; Fasting blood glucose; weighted genetic risk score; SNP marker.
Copy right Statement: Content published in the journal follows Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0). © Shin HD (2022)
Journal: Open Journal of Clinical and Medical Case Reports is an international, open access, peer reviewed Journal mainly focused exclusively on the medical and clinical case reports.
Diabetes Mellitus (DM) is one of the most serious health problems in the world [1,2]. Of the various diagnosis factors for DM, Fasting Blood Glucose (FBG) level is an important factor for diagnosis of prediabetes as it has a high association with the development rate of diabetes [3]. In addition, more recent studies have observed that prediabetes status is associated with risk of coronary heart disease in both diabetic and nondiabetic subjects [4,5]. The comparison between subjects with prediabetes and normal subjects has also found that prediabetic status increases the risk of cardiovascular disease and other related symptoms such as stroke and coronary heart disease [6].
Due to the importance of FBG, large-scale genome-wide association studies have been conducted and have found numerous markers significantly associated with FBG level, such as G6PC2, FOXA2, MTNR1B, and LEPR [7-10]. Following replication studies also have identified the associations between these genes and FBG level. For example, one previous study reported significant association between the rs10830963 in MTNR1B and FBG level in Bosnian and Herzegovinian population [11]. The rs560887 in G6PC2 also showed significant associations with reduced FBG levels in normal subject of Hispanic populations [12].
Several function studies have also provided evidences of a relationship between the significant SNPs and FBG level. The previous study demonstrated that two SNPs (rs560887 and rs2232316) in G6PC2 are responsible for increasing of FBG level by enhancing G6PC2 pre-mRNA splicing and regulating transcription factor binding affinity [13]. Another study showed that rs1137100 in the LEPR gene is associated with glucose homeostasis [14].
Based on these previous observations, it is clear that there is a significant association between FBG levels and SNPs. And also, according to the relation of FBG with disease occurrence, we anticipate that the prediction of FBG level using SNP may suggest possible role in public health.
In the present study, we aimed to establish a prediction model of FBG for Korean men and women using the weighted genetic risk score (wGRS) method. We used the genotype data obtained from the Korea Association Resource (KARE) project. Only the SNPs which were reported in previous studies were used to construct prediction model. Also, we applied 10-fold cross-validation to our analysis procedure to improve the validity of our analysis.
Study subjects
A total 8,840 subjects (4,182 men and 4,658 women) initially used for this study. The genotype data was derived from KARE project [15] that approved by Public Institutional Bioethics Committee designated by the Ministry of Health and Ware (P01-201502-31-002). For quality of data, we removed inadequate samples and SNPs that have the data of call rate with under 0.98. The data of SNPs were also deleted with minor allele frequency (MAF) under 0.05. Finally, the 5,013 subjects (2,373 men and 2,640 women) were selected for further analyses. Among the 5,013 samples, we used 90% (2,135 men and 2,376 women) of samples as a SNP selection set for 10-fold cross validation process. The rest of samples (238 men and 264 women) were used as a final validation set.
The SNP were collected from GWAS catalog markers which were associated with blood glucose studies. In order to obtain genotype data, we used data of the collected markers of GWAS and other data derived from KARE data (2,581 SNPs) of nearby regions which are ±50 kb from the GWAS markers. Among the SNPs, we performed calculating the coefficients of Linkage Disequilibrium (LD) to avoid high LD values using the Haploview software [16]. Consequently, we obtained 207 SNP markers, which have shown related blood glucose studies previously. The regression analysis was conducted for the p-value that was used for identification SNPs in the training sets. We conducted the regression analysis using the GoldenHelix-SVS8 software (Bozeman, MT, USA).
SNP selection for fasting blood glucose
In order to prediction model, men subjects (2,135 samples) and women subjects (2,376 samples) were used for testing SNP selection sets separately. Using the training sets (1,921 men and 2,138 women) and the test sets (214 men and 238 women) of genotype data, 10-fold cross validation was conducted for identify SNPs. The SNP markers were selected for less than 0.05 of p-value across all sets from the top of 20 SNPs that were identified LD. Each p-value of 20 SNP markers is shown in Table 2 and Table 3. These SNPs were used for the wGRS. The wGRS was calculated by using the following formula: Weighted genetic risk score (wGRS) = where is weight that means the regression slope of selected SNPs, is the number of allele of FBG [17]. We conducted analysis separately by gender, using these scores to divide into 4 blocks after ascending sort of score. Then we calculated average of FBG values of each blocks and obtained the graphs. For validation these SNPs, the final validation sets (238 in men subjects and 264 in women subjects) were performed statistical analysis as same manner, resulting in trend line.
Clinical profile of the study subjects
In the present study, we used the genotype data obtained from a total of 5,013 individuals comprised of 2,373 men and 2,640 women subjects. Of the total samples, 90% (n=4,511) were used in the SNP selection set to obtain the significant SNPs for constructing the prediction model. The remaining 10% (n=502) were used in the Final validation set. The average age of men population was slightly lower than that of the women population in all analysis groups (51.57 vs. 52.62; 51.61 vs. 52.57; and 51.26 vs. 53.06 in the total subjects, SNP selection set, and final validation set, respectively). The FBG level was generally higher in men than women (90.46 vs. 85.74; 90.32 vs. 85.53; 91.70 vs. 86.44 in total subjects, SNP selection set, and final validation set). Detailed clinical profile of the samples is described in Table 1.
SNP selection to construct prediction model
To select SNPs for FBG prediction model, we performed 10-fold cross validation based on regression analyses using a total of 207 SNPs which collected from a GWAS catalog and nearby SNPs. After performing the 10-fold cross-validation process, we found that two SNPs (rs7754840 in CDKAL1 and rs12699673 located near DGKB) had constantly significant association with FBG level for men subject in all 10 training sets (Table 2). Similarly, we selected five SNPs (rs102275 in TMEM258, rs1574285 in GLIS3, rs2908289 in GCK, rs6494307 located near C2CD4B and rs917793 in YKT6) which had consistent significant association with FBG level for women subject (Table 3). Detailed information of selected SNPs was listed in Table 4 with their allele information, location, and genotype counts.
Results of prediction model for Fasting blood glucose level
Using the selected SNPs, we constructed a prediction model for FBG using wGRS method. We observed upward trends of blood glucose levels with increasing of wGRS in both men and women population (Figure 1). Finally, we applied wGRS of both men and women population in the final validation set. As we expected, fasting blood glucose levels increased with wGRS (Figure 2).
As diabetes mellitus type 2 (T2DM) is important to public health problem, numerous studies have been focused on identifying significant markers for T2DM. However, several previous reports have indicated that FBG might play a role in the prevention and treatment for various glucose-related diseases. One previous study showed that an increase in FBG level was significantly associated with upward diagnosis factors such as BMI and HDL-c [18]. Another recent study supported the correlation between FBG level and various clinical factors such as HbA1c, insulin, C-peptide and triglycerides [19]. These observations indicated that a prediction model for FBG level might contribute to public health not only with respect to diseases related to blood glucose but various factors of diseases.
In the present study, We used a total of 5,013 individuals to establish a prediction model for FBG level. Due to the low average FBG level of women compared to men, we decided to construct gender-specific FBG models rather than a single comprehensive model using total subjects. Based on our results, we selected two SNPs (rs7754840 and rs12699673) for men and five SNPs (rs6494307, rs1574285, rs102275, rs2908289, and rs917793) for women population. Numerous previous studies have demonstrated the relationship between the SNPs and glucose-related phenotypes.
Of the two polymorphisms for men population, rs7754840 (linked to rs9356744, r2 = 0.902) which is located in CDKAL1 was significantly associated with the Gestational Diabetes Mellitus (GDM). And it was used in the prediction model for T2DM in a Japanese population [20,21]. This significance was also found in other populations such as Iranian population and in large-scale meta-analysis using various populations [22,23]. Another study suggested an association between DPP-4 inhibitors which was used for T2DM treatment with rs7754840 [24]. The other selected SNP, rs2191349 (linked to rs12699673, r2 = 0.893) located near DGKB was responsible for various phenotypes of glucose metabolism. One previous study using Korean population proved an effect of rs2191349 in beta-cell function [25]. Other several studies have also support the association between the reduction of insulin and fasting glucose concentration [26,27].
As with the men population, numerous evidences has been found for the polymorphisms in the women subjects, with the exception two SNPs (rs6494307 located near C2CD4B and rs1574285 in GLIS3). The two SNPs, rs174550 (correlated with rs102275 in TMEM258, r2 = 1.000) and rs4607517 (correlated with rs917793 in YKT6, r2 = 0.886), have been found to affect beta-cell function, and were associated with GDM or prediabetes [25,28,29]. In addition, rs730497 (absolute LD with rs2908289 in GCK, r2=1.000) showed an effect to HbA1c level which was an important diagnosis factor for chronic glycemic and hyperglycemia [30,31]. However, there were no evidences of rs6494307 and rs1574285 impact on fasting glucose level or glucose related diseases. Further studies may be needed to identify the role of the SNPs in glucose metabolism.
The lack of confirmation using other independent cohorts was also one of limitation of the present study. Due to the problem, we selected only significant SNPs which had been reported on in previously studies. In this study, we set aside 10% of the total samples as a final validation set to overcome this limitation. Future studies might consider our limitation to build a more precise prediction model for FBG level using population-specific markers.
In summary, we constructed gender-specific FBG prediction models using genotype data from a Korean population. For the models, we selected two SNPs for men and five SNPs for women. Both of our models showed constantly upward trends of FBG level with increasing of wGRS. Prediction models of the present study might be useful for further glucose concentration studies and glucose-related diseases.