Skip to main content

Reveal key genes and factors affecting athletes performance in endurance sports using bioinformatic technologies

Abstract

Medium-intensity activities comprise the major proportion of many sorts of sports. The energy consumption of athletes has been a research emphasis for the purpose of improving both training efficiency and competition performance. However, the evidence based on large-scale gene screen has been rarely performed. This is a bioinformatic study revealing the key factors contributed to the metabolic difference between subjects with different endurance activity capacities. A dataset comprised of high- (HCR) and low-capacity running (LCR) rats was used. Differentially expressed genes (DEGs) were identified and analysed. The Gene Ontology (GO) and Kyoto Encyclopaedia of Genes and Genomes (KEGG) pathway enrichment was obtained. The DEGs' protein–protein interaction (PPI) network was built, and the enriched terms of the PPI network were also analysed. Our findings showed that the GO terms were enriched in lipid metabolism-related terms. The KEGG signalling pathway analysis enriched in the ether lipid metabolism. Plb1, Acad1, Cd2bp2, and Pla2g7 were identified as the hub genes. This study provides a theoretical foundation showing lipid metabolism plays an important role in the performance of endurance activities. Plb1, Acad1, and Pla2g7 may be the key genes involved. The training plan and diet for athletes can be designed based on above results and expecting a better competitive performance.

Peer Review reports

Introduction

Competitive athletic specialities are highly involved in oxygen consumption and metabolism. The performance of athletes is relied on the metabolic capabilities. Different sports have noticeable difference in oxygen consumption patterns. Maximum Oxygen Consumption (VO2 Max, ml/min/kg) refers to the maximum oxygen consumption while an individual is under maximal exercise. It is an indicator reflecting cardiorespiratory fitness and endurance capacity in exercise performance. Mishra et al. indicated that the football players (67.6700) have the highest VO2 Max, followed by basketball (65.5550), volleyball (63.2667) and hockey players (62.3858) [1]. The tennis and badminton players have lower VO2 Max, which are 50.69 and 50.20 respectively [2]. Noticeably, although some moderate-intensity sports such as tennis and badminton show lower VO2 Max, they still reach the maximum fat oxidation ranging from 45 to 65% VO2 Max [3]. Additionally, a series of terms comprehensively evaluate the pulmonary function capacities, such as vital capacity (VC), forced vital capacity (FVC), forced expiratory volume in one second (FEV1), and maximum voluntary ventilation (MVV). VC and FVC are defined as the maximal inhalation/exhalation volume in either relaxed or forced manner, respectively. MVV is the maximal volume of air that an individual can inhale/exhale by voluntary effort in one minute. FEV1 represents the maximal amount of air can forcibly exhale during the first second after a maximal inhalation. They are important measurements of pulmonary function. Usually a higher value indicates a better pulmonary function, or from another perspective, a more intensive activity engagement. Mazic et al. [4] investigated the VC, FVC, FEV1, and MVV of 493 top athletes in 15 different sports disciplines. It was found that basketball players, water polo players, and rowers all exhibited better VC, FVC, FEV1 than healthy inactive controls. When compared to controls, football and volleyball players showed lower VC and FVC. Boxing, kayaking, rugby, handball, taekwondo, and tennis all had reduced peak expiratory flow. These results indicate that the performance of football, basketball, and volleyball could be the subjects that are influenced mostly by oxygen metabolic capability. Despite of not being included in these subjects, tennis was indicated in another research that the FVC, FEV1, and MVV ratios were higher in tennis players than in the sedentary control individuals [5]. Collectively, these evidence raise needs of understanding the metabolic mechanism of aerobic-relied sports, which could be beneficial to the training and competition. However, the key factors at gene level influencing athlete’s metabolism have been rarely studied. This bioinformatic study can help to reveal the key node of improving the metabolic capabilities and thus the performance. The results can be used to design a more scientific diet and training strategy.

High-capacity running (HCR) and low-capacity running (LCR) rats are an ideal animal model for studying the interaction between aerobic capacities and chronic disease. They are artificial selection for intrinsic aerobic endurance running capacity, which is described in detail in a previous report [6]. Briefly, the HCR lines have 171% longer maximal running distance until exhaustion than the LCR lines after six generations of selection.. In this bioinformatic study, HCR and LCR rats at generation 18 were adopted. We assume they have different gene expression levels which led to huge aerobic capacities between HCR and LCR rats.

Microarray technology provides a high throughput method that can analyse tens of thousands of gene expression patterns in one test [7]. It is a powerful tool for comparison of bulk gene expression between conditions. Open-source databases such as the Gene Expression Omnibus (GEO) database [8] can be used to upload and share microarray dataset. Kivelä et al. [9] were using HCR and LCR rat model to study connections between low aerobic exercise capacity and complex metabolic diseases. They briefly displayed a preliminary microarray screen for the genome expression profiles with aerobic endurance capacity and metabolic disease risk factors. This study was conducted based on selected Kivelä’s data on gene expression patterns affected by rat strains (HCR or LCR), and further analysed the in-depth understanding of key genes and signalling pathways involved in performance of aerobic exercise. Datasets were downloaded containing gene expression of HCR and LCR rats. Differentially expressed genes (DEGs) were identified, and the hub genes and critical terms involved were investigated using Gene Ontology (GO), Kyoto Encyclopaedia of Genes and Genomes (KEGG), and protein–protein interaction (PPI) network analyses. In summary, 31 DEGs and 4 hub genes were identified. The lipid metabolism may be a key factor in improving athlete’s performance in endurance exercise.

Materials and methods

Microarray data information

GSE17190 gene expression profiles were obtained using the platform Illumina ratRef-12 v1.0 expression beadchip (Illumina, San Diego, US) [9] from a public functional genomics data repository GEO database (https://www.ncbi.nlm.nih.gov/geo) [8]. To screen HCR and LCR rats, endurance running ability was measured on a treadmill, and the total distance covered throughout the test was utilised to calculate maximum aerobic exercise capacity. Rats with the maximum running capacity from each generation were bred together to create the HCR strain, whereas rats with the lowest running capacity were bred together to create the LCR strain. 24 female rats (12 HCR and 12 LCR) from generation 18 was used in the study. Detailed information about the animals can be found in the reference [9], and all webtools and software used in this study can be found in the Appendix.

Data preprocessing and identification of DEGs

The preprocessing consists of several steps. In the raw data, the commercial probe serial numbers were converted into official gene symbols. Then uploaded onto the NetworkAnalyst 3.0 (https://www.networkanalyst.ca) [10], which is a visual analytics platform for comprehensive gene expression profiling and meta-analysis, for further preprocessing. Genes with variance percentile rank lower than 15, abundance lower than 4, or unannotated were discarded. After that, log2 transformation on the gene expression levels was performed for standardisation. The Limma package was then utilised to identify the up-regulated and down-regulated DEGs between HCR and LCR rats. The Benjamini–Hochberg test was also used to adjust the p-value. Finally, the DEG cut-off criterion was set to log2 fold change |log2FC|> 0.5 with an adjusted P < 0.05.

GO and pathway enrichment analyses

The g:Profiler (http://biit.cs.ut.ee/gprofiler/) is a publicly accessible web service for analysing and altering gene lists derived from high-throughput genomic data [11]. The g:GOSt on g:Profiler was used to do GO enrichment and KEGG pathway enrichment analysis (based on KEGG Biological Pathway database [12]) of DEGs in this study. P < 0.05 and the custom-made g:SCS algorithm [11] were used as cut-off criterion. This combination gives a tighter threshold to significant results hence more reliable, as it considers the set structure underlying gene sets annotated to terms of each organism [11]. Biological Processes (BP), Cellular Component (CC), and Molecular Function (MF) are all included in the GO analysis.

PPI network construction

The STRING (https://string-db.org) database [13] was used to recover the predicted associations between proteins encoded by DEGs and other proteins in order to understand the molecule mechanism and study the interactions between dynamic compression and chondrogenesis, as well as between proteins encoded by DEGs and other proteins. Since the number of identified DEGs is relatively low, PPI significance is defined as a confidence score of > 0.4, which is a range of medium to high confidence providing a balance between false positive and missing information. The false positive results can be identified by the following wet lab experiments. To visualise a PPI network, the interaction data was uploaded into the Cytoscape programme (version 3.8.0). In the PPI network, nodes represent proteins, and edges represent interactions. Their interaction is constructed based on the identified biological facts stored in the STRING database. The isolated proteins were excluded and the main network was preserved. The degree distribution was calculated by calculating the number of connections between the network's distinct proteins.

PPI network function analysis

The g:Profiler was used to further analyse GO terms and pathway enrichment in the main PPI network found by the Cytoscape. This can provide a more accurate information about the function of key genes.

Results

Data preprocessing and identification of DEGs

After normalisation, the gene expression data is shown in Fig. 1. In HCR rats, a total of 31 DEGs were identified, with 17 (54.84%) up-regulated genes and 14 (45.16%) down-regulated genes compared to LCR rats. The volcano map (Fig. 2) shows the differential expression status of all discovered genes while highlighting DEGs that are greater than the chosen cut-off value. Figure 3 shows the cluster heatmap of DGEs. To cluster the genes and generate the dendrograms, Euclidean distance was used. There are significant changes in DEG expression between the HCR and LCR groups, indicating that the DEGs are trustworthy and appropriate for further investigation. Table 1 lists the top ten most substantially up-regulated and down-regulated genes.

Fig. 1
figure 1

This is a figure. Schemes follow the same formatting

Fig. 2
figure 2

All genes found in the microarray are shown as a volcano. A gene is represented by each dot. Down- (blue) and up-regulated genes (red) are distinguished by colours. The log2-base fold change is on the X-axis, while the log10-base corrected P-value is on the Y-axis

Fig. 3
figure 3

Cluster heatmap depicts DEG-based hierarchical clustering analysis findings. Each column is a sample, and each row represents a DEG. The colour represents the relative degree of gene expression (log2 transformed). Green denotes lower gene expression levels, whereas red suggests greater levels

Table 1 The top 10 most significantly up-regulated and down-regulated DEGs

GO and Pathway Enrichment Analyses

The biological function of DEGs was determined using GO enrichment and KEGG pathway enrichment studies. Lipid metabolic process, cellular lipid catabolic process, and cellular lipid metabolic process were the biological process with the greatest significant enrichment in GO terms. The cellular component with the highest significant enrichment was specific granule membrane, organelle outer membrane, and outer membrane, although there was no obvious statistical difference shown. The greatest significant enrichment in molecular function was calcium-independent phospholipase A2 activity and oxidoreductase activity, acting on the CH-CH group of donors. The ether lipid metabolism was shown to be associated to the difference of aerobic exercise capability in the KEGG pathway enrichment, without statistical significance. Figure 4 and Table 2 show the whole list of enhanced GO terms and KEGG pathways.

Fig. 4
figure 4

GO enrichment and KEGG pathway enrichment analysis. Each bubble represents a term. The height of the bubbles stands for the significance of enrichment. The horizontal distance stands for the similarity of term subtrees. Bubble size stands for term size (gene counts). The X-axis represents the group of functional terms and coloured by data sources, and the Y-axis lays out adjusted p-value in negative log10 scale

Table 2 Significantly enriched GO terms of DEGs

PPI network construction

The STRING database created a PPI network comprising all DEGs (Fig. 5) with 4 nodes and 3 edges. The majority of proteins were isolated and thus excluded from the network. The Plb1, Acad1, Cd2bp2, and Pla2g7 were screened as hub genes. The degree of Plb1 and Acad11 were 2 and Cd2bp2 and Pla2g7 were 1. Plb1, Acad1, Cd2bp2, and Pla2g7 play a significant role in duration of aerobatic exercise.

Fig. 5
figure 5

PPI network of all DEGs. In the main network, the degree of Plb1 and Acad11 were 2 and Cd2bp2 and Pla2g7 were 1

PPI network function analysis

The Fig. 6 and Table 3 summarised the GO terms and KEGG pathway enriched in the PPI main network. The biological process with the most significant enrichment in GO terms were cellular lipid catabolic process, phosphatidylcholine catabolic process, and lipid metabolic process. In cellular component, low-density lipoprotein particle, high-density lipoprotein particle, and plasma lipoprotein particle were identified without significant statistical difference. In terms of molecular function, the most significant enrichment was calcium-independent phospholipase A2 activity, phospholipase A2 activity, and phospholipase activity. The ether lipid metabolism was enriched in the KEGG pathway enrichment (Fig. 6 and Table 3).

Fig. 6
figure 6

GO enrichment and KEGG pathway enrichment analysis of PPI main network. Each bubble represents a term. The height of the bubbles stands for the significance of enrichment. The horizontal distance stands for the similarity of term subtrees. Bubble size stands for term size (gene counts). The X-axis represents the group of functional terms and coloured by data sources, and the Y-axis lays out adjusted p-value in negative log10 scale

Table 3 Significantly enriched GO terms of PPI main network

Discussion

Competitive athletic specialities are facing huge energy consumption during training and competition. Carbohydrate and fat contributions to energy generation vary depending on exercise duration and intensity, training state, gender, and so forth. Football, basketball, volleyball, tennis, badminton, and others has traditionally been thought of being an intermittent activity, with bursts of high intensity movement interspersed with extended periods of medium-intensity activities (e.g. walking, jogging, etc.). Thus, professional players must therefore improve both anaerobic and aerobic capacities in order to prepare for competition. However, the medium-intensity activities occupy the majority time in a competitive game, player’s frequent movement ensures a good location for performing the last critical high intensity skills. This leads to the requirement on understanding the key factors improving the capability of endurance exercise. The evidence at gene level can be fundamental to provide an insight to metabolism and guidance to a better training/diet strategy.

This bioinformatic study found that compared to the LCR rats, the HCR rats had highly enriched GO terms regarding lipid metabolism, including lipid metabolic process, cellular lipid catabolic process, cellular lipid metabolic process, and phosphatidylcholine catabolic process. The KEGG signalling pathway analysis also showed the ether lipid metabolism, which is a lipid metabolism-related pathway, enriched in the HCR rats. Lipid is an important energy source during exercise. They are utilised in forms of circulating free fatty acids (FFAs) bound to albumin, triglycerides stored in very-low-density lipoprotein, and intramuscular triglyceride stores [14]. Exercise intensity influences substrate choice. Lipid is the primary fuel at low exercise intensities (under 40% VO2 Max), but as intensity increases, the body shifts to carbohydrate (CHO). Between 45 and 65% VO2 Max, maximum fat oxidation occurs and over this point the fuel preference balance switches toward carbohydrate. Endurance activities increase the energy utilisation from fat while sparing carbohydrate sources [3, 14]. A study investigated influence on plasma lipid parameters by high quality physical training in 11 groups of athletic specialities, including football, basketball, volleyball, boxing, wrestling, judo, sailing, skiing (slalom), track (two groups), throwing, and jumping. The results demonstrated that endurance sports, such as football, basketball, volleyball, short- and long-distance running showed higher high-density lipoprotein (HDL) and lipoprotein ratio factor (RF) values, compared to those strength sports such as wrestling, boxing, skiing (slalom), and throwing-jumping [15]. Another research studied 71 male athletes (25 football players, 14 volleyball players and 32 healthy sedentary subjects). Results showed that both football and volleyball players had considerably less body fat than the control group. Football players had the lowest body fat percentage [2]. Interestingly, there is evidence showing the capability of using lipid as energy is different in genders. A mice model study indicated that females had considerably increased gene expression in genes involved in skeletal muscle fatty acid oxidation. Female mice have a stronger endurance exercise capacity and a larger ability to mobilise and use fatty acids for energy [16]. However, we are unable to compare the gene expression difference in genders in this bioinformatic study as all rat participants were female. Collectively, the performance of aerobic exercise and training are highly relevant to the lipid metabolic ability and efficiency.

Plb1 is a protein coding gene and related to phospholipases and lipid metabolism. As FFAs are an importance energy source during exercise, a sufficient FFAs pool is critical for energy supply. Currently most of the reports of Plb1 focus on microbial species [17, 18]. Wright et al. [18] used yeast for studying FFA production. The yeast had high FFA production but the deletion of phospholipase genes Plb1 and Plb2 resulted in a 46% decrease in FFA levels and 105% increase in phospholipid levels. The decrease of FFA levels and increase of phospholipid levels strongly suggested that Plb1 is pivotal in FFAs formation and the FFAs were mainly generated through phospholipid hydrolysis [19]. However, it also indicated that the over expression of Plb1 didn’t witness a significant increase in FFAs, which suggested that phospholipase activity may not be a major limiting step in FFA production. There is also evidence showing the phospholipases are an important part of the virulence of pathogenic fungi [20]. So far, there lacks studies on mammals. Further research would help us understand the function of Plb1 in human and other animals.

Acad1 is also known as medium-chain acyl-CoA dehydrogenase (Acadm), an enzyme that catalyses the first step of β-oxidation and responsible for the breakdown of medium-chain fatty acids in the mitochondria [21]. Leg blood flow increases dramatically at the beginning of and throughout exercise, from around 0.3 L/min at rest to 5–6 L/min at moderate exercise intensities and up to 9 –12 L/min at 100% VO2 Max, which brings more FFAs and glucose [22]. Fatty acids are a primary source of energy for heart and oxidative skeletal muscle. Once within the myocyte, fatty acids are directed to either lipid metabolite production or mitochondrial β-oxidation. Intramuscular lipids can accumulate when fatty acid intake exceeds the rate of β-oxidation. On a mouse model, Acadm knockdown remarkably enhanced lipid accumulation [21]. In a study using biopsy specimens of 74 patients with nonalcoholic fatty liver disease (NAFLD), the expression of all genes (including Acad1) involved in the metabolism of fatty acids and iron were significantly downregulated as the stage and grade of NAFLD progressed [23]. It is shown that the Phosphatidylinositol 3-kinase (PI3K) signaling pathway plays an important role in the cellular lipid metabolism regulation. The PI3KR3 is a subunit of PI3K, it was increased after fasting and was downregulated in a high-fat diet (HFD) mouse model of fatty liver, which suggests a strong relationship between PI3KR3 and lipid metabolism. The overexpression of PIK3R3 upregulated the protein and mRNA expression levels of Acad1 and resulted in enhanced fatty acid β-oxidation and reduced fatty liver in vivo. Conversely, knockdown of PIK3R3 downregulated protein and mRNA expression levels of Acad1 which led to impaired hepatic fatty acid β-oxidation in cell culture and in vivo [24]. In terms of genders, it is shown that women have a lower respiratory exchange ratio (RER) compared with men, indicating higher lipid oxidation and can oxidize more fat during moderate intensity endurance exercise [25]. In summary, the Acad1 and PI3KR3 both are important candidates in lipid metabolism. They function in the body synergistically. A higher expression of these two genes may contribute to a more efficient energy gain from the lipocatabolic process.

There are limited reports on the function of Cd2bp2 and Pla2g7 in lipid metabolism. It has been known that the Cd2bp2 (adaptor protein CD2-binding protein 2) is a gene involved in oxidative phosphorylation, gluconeogenesis, lipid metabolism, signal transduction [26] and T-cell activation and immunology [27]. However, based on current literature, the role of Cd2bp2 largely remains unclear. Pla2g7 (platelet-activating factor acetylhydrolase) is well recognised for degrading platelet-activating factor, a phospholipid mediator of platelet aggregation and a powerful inflammatory stimulator and also a participant in lipid metabolism. It functions at the intersection of metabolism and immunity [28]. In human, Pla2g7 expression is increased in activated macrophages and atherosclerotic plaque foam cells [29]. It has also been identified as a key gene involved in coronary heart disease due to dysregulation of phospholipid metabolism [30]. In mice, obesity and a high-fat diet increased Pla2g7 expression, but caloric restriction reduced the expression. Pla2g7 regulates pathways that contribute to local and systemic metabolic integrity, immunological modulation, and inflammation through immunometabolic control in diverse tissues [28]. Similarly, another study reported the Pla2g7 had lower expression in humans undergoing caloric restriction (CR). The CR-induced low expression of Pla2g7 can decrease glucose consumption, lead to better adipose tissue metabolism, lower inflammation, and reduced thymic lipoatrophy [31]. The Cd2bp2 and Pla2g7 may be two new interesting targets and worthy of further investigations.

Based on above results, lipid metabolism is a critical point to the athletes participating in endurance exercise. We raise suggestions for improving their performance. The fat, in the form of our bodily reserves, provides a virtually limitless supply of energy. Improving our capacity to transfer it into the muscle and oxidise it during activity is a vital adaptation to training. Even the most highly trained athletes have not reached their maximum fat oxidation capacity during exercise, which may be improved even more by eating a high-fat meal before to the activity. Although fat oxidation has limited capability as a fuel source for high-intensity activities, if fat supplements or other items could improve fat utilisation at more moderate exercise intensities, it might provide a mechanism to 'spare' muscle glycogen reserves for the high-intensity stages of sport. Research suggests that diets containing 32% to 55% fat can boost endurance capacity compared to diets containing 15% fat [32]. Another strategy is 'fat adaptation' diet, a diet in which well-trained endurance athletes take a high-fat, low-CHO diet for up to 2 weeks while continuing their regular training, followed by CHO restoration (consuming a high-CHO diet and tapering for 1 – 3 days before a big endurance event). This "dietary periodization" approach boosts whole-body and muscle fat oxidation while decreasing muscle glycogenolysis during submaximal exercise as compared to an isoenergetic CHO diet over the same intervention period [33]. Except from diet, scientific training methods are also indispensable. Besides, the difference of gene expression levels is non-negligible. It could potentially affect the capability of lipid metabolism of athletes. Those athletes who innately have higher expression level of Plb1 and Acad1 and lower level of Pla2g7 may have more efficient lipid energy usage ability and thus better performance in endurance exercise. The role of Cd2bp2 is worthy to be further investigated.

Although this study has identified some key factors and genes affecting the performance in endurance exercise through bioinformatic methods, there still are some limitations. Above findings are database and algorism-based, they have not been verified by lab-based verification experiment. The bioinformatic tools are largely reliable but few false positive or false negative results may be inevitable due to the chosen thresholds. We are planning a following verification experiment in the near future to further support our findings. Additionally, wild type control groups are missing, as they are not included in the original dataset. However, the bioinformatic databases are running in a manner of voluntary sharing, limited datasets match our criteria. For example, some animals were treated with drugs or underwent extra conditions. The dataset utilised in this manuscript is the most matching one, despite it is not the very latest one. Rats have genetic similarity to human beings, but they still could lead to different results in the same condition as human. Using rats instead of human for study is a consideration of ethics and expense. The results we have in this study may not fully represent those of human beings.

Conclusions

In this study we used a set of gene microarray data to identify the RNA expression difference between HCR and LCR rats. It was found that the major difference in biological process is the HCR rats has better lipid metabolic capabilities, which may result in better efficiency of energy gain than the LCR rats. The KEGG signalling pathway analysis enriched in the ether lipid metabolism. Four hub genes including Plb1, Acad1, Cd2bp2, and Pla2g7 were identified. We suggest a diet strategy contains sufficient fat intake can improve the performance in endurance exercise by maximising the usage of lipid energy. Innate difference of gene expression levels may also affect the performance of athletes.

Availability of data and materials

Dataset GSE17190 used in this study can be downloaded on the GEO database (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE17190).

References

  1. Mishra MK, Pandey AK, Chaubey D. A comparative study of VO2 max among the basketball, football, volleyball and hockey male players. Int J Appl Res. 2015;1(11):245–7.

    Google Scholar 

  2. Masanovic BJSM. Comparative Study of Anthropometric Measurement and Body Composition between Junior Basketball and Volleyball Players from Serbian National League. Sport Mont. 2018;16(3):19–24.

    Article  Google Scholar 

  3. Ranallo RF, Rhodes EC. Lipid metabolism during exercise. Sports Med. 1998;26(1):29–42.

    Article  CAS  PubMed  Google Scholar 

  4. Mazic S, Lazovic B, Djelic M, Suzic-Lazic J, Djordjevic-Saranovic S, Durmic T, Soldatovic I, Zikic D, Gluvic Z, Zugic V. Respiratory parameters in elite athletes–does sport have an influence? Rev Port Pneumol (2006). 2015;21(4):192–7.

    CAS  PubMed  Google Scholar 

  5. Indira Y, Rani BBS. A Comparative Study of Pulmonary Function Tests between Normal Male Sedentary and Tennis Players, India. Int J Sci Stud. 2019;6(11):137–40.

    Google Scholar 

  6. Koch LG, Britton SL. Artificial selection for intrinsic aerobic endurance running capacity in rats. Physiol Genomics. 2001;5(1):45–52.

    Article  CAS  PubMed  Google Scholar 

  7. Lockhart DJ, Winzeler EA. Genomics, gene expression and DNA arrays. Nature. 2000;405(6788):827–36.

    Article  CAS  PubMed  Google Scholar 

  8. Clough E, Barrett T. The Gene Expression Omnibus Database. Methods Mol Biol. 2016;1418:93–110.

    Article  PubMed  PubMed Central  Google Scholar 

  9. Kivela R, Silvennoinen M, Lehti M, Rinnankoski-Tuikka R, Purhonen T, Ketola T, Pullinen K, Vuento M, Mutanen N, Sartor MA, Reunanen H, Koch LG, Britton SL, Kainulainen H. Gene expression centroids that link with low intrinsic aerobic exercise capacity and complex disease risk. FASEB J. 2010;24(11):4565–74.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Zhou G, Soufan O, Ewald J, Hancock REW, Basu N, Xia J. NetworkAnalyst 3.0: a visual analytics platform for comprehensive gene expression profiling and meta-analysis. Nucleic Acids Res. 2019;47(W1):W234–41.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Raudvere U, Kolberg L, Kuzmin I, Arak T, Adler P, Peterson H, Vilo J. g:Profiler: a web server for functional enrichment analysis and conversions of gene lists (2019 update). Nucleic Acids Res. 2019;47(W1):W191–8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  12. Kanehisa M, Furumichi M, Sato Y, Kawashima M, Ishiguro-Watanabe M. KEGG for taxonomy-based analysis of pathways and genomes. Nucleic Acids Res. 2023;51(D1):587–92. https://doi.org/10.1093/nar/gkac963.

  13. Jensen LJ, Kuhn M, Stark M, Chaffron S, Creevey C, Muller J, Doerks T, Julien P, Roth A, Simonovic M, Bork P, von Mering C. STRING 8–a global view on proteins and their functional interactions in 630 organisms. Nucleic Acids Res. 2009;37(Database issue):D412-6.

    Article  CAS  PubMed  Google Scholar 

  14. Noland RC. Exercise and Regulation of Lipid Metabolism. Prog Mol Biol Transl Sci. 2015;135:39–74.

    Article  PubMed  Google Scholar 

  15. Tsopanakis C, Kotsarellis D, Tsopanakis AD. Lipoprotein and lipid profiles of elite athletes in Olympic sports. Int J Sports Med. 1986;7(6):316–21.

    Article  CAS  PubMed  Google Scholar 

  16. Holcomb LE, Rowe P, O’Neill CC, DeWitt EA, Kolwicz SC Jr. Sex differences in endurance exercise capacity and skeletal muscle lipid metabolism in mice. Physiol Rep. 2022;10(3):e15174.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Aloulou A, Rahier R, Arhab Y, Noiriel A, Abousalham A. Phospholipases: An Overview. Methods Mol Biol. 1835;2018:69–105.

    Google Scholar 

  18. Wright LC, Santangelo RM, Ganendren R, Payne J, Djordjevic JT, Sorrell TC. Cryptococcal lipid metabolism: phospholipase B1 is implicated in transcellular metabolism of macrophage-derived lipids. Eukaryot Cell. 2007;6(1):37–47.

    Article  CAS  PubMed  Google Scholar 

  19. Ferreira R, Teixeira PG, Siewers V, Nielsen J. Redirection of lipid flux toward phospholipids in yeast increases fatty acid turnover and secretion. Proc Natl Acad Sci U S A. 2018;115(6):1262–7.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. Djordjevic JT, Del Poeta M, Sorrell TC, Turner KM, Wright LC. Secretion of cryptococcal phospholipase B1 (PLB1) is regulated by a glycosylphosphatidylinositol (GPI) anchor. Biochem J. 2005;389(Pt 3):803–12.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  21. Ma APY, Yeung CLS, Tey SK, Mao X, Wong SWK, Ng TH, Ko FCF, Kwong EML, Tang AHN, Ng IO, Cai SH, Yun JP, Yam JWP. Suppression of ACADM-Mediated Fatty Acid Oxidation Promotes Hepatocellular Carcinoma via Aberrant CAV1/SREBP1 Signaling. Cancer Res. 2021;81(13):3679–92.

    Article  CAS  PubMed  Google Scholar 

  22. Lundsgaard AM, Fritzen AM, Kiens B. Molecular Regulation of Fatty Acid Oxidation in Skeletal Muscle during Aerobic Exercise. Trends Endocrinol Metab. 2018;29(1):18–30.

    Article  CAS  PubMed  Google Scholar 

  23. Mitsuyoshi H, Yasui K, Harano Y, Endo M, Tsuji K, Minami M, Itoh Y, Okanoue T, Yoshikawa T. Analysis of hepatic genes involved in the metabolism of fatty acids and iron in nonalcoholic fatty liver disease. Hepatol Res. 2009;39(4):366–73.

    Article  CAS  PubMed  Google Scholar 

  24. Yang X, Fu Y, Hu F, Luo X, Hu J, Wang G. PIK3R3 regulates PPARalpha expression to stimulate fatty acid beta-oxidation and decrease hepatosteatosis. Exp Mol Med. 2018;50(1):e431.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Maher AC, Akhtar M, Vockley J, Tarnopolsky MA. Women have higher protein content of beta-oxidation enzymes in skeletal muscle than men. PLoS ONE. 2010;5(8):e12025.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Li BL, Zhao XS, Sun XM, Li J, Chen J, Luo R, Zhang Q. Effect of serum from sub-healthy subjects with fatigue on the gene expression profile of skeletal muscle cells. Mol Med Rep. 2013;7(2):454–60.

    Article  CAS  PubMed  Google Scholar 

  27. Heinze M, Kofler M, Freund C. Investigating the functional role of CD2BP2 in T cells. Int Immunol. 2007;19(11):1313–8.

    Article  CAS  PubMed  Google Scholar 

  28. Rhoads TW, Anderson RM. Caloric restriction has a new player. Science. 2022;375(6581):620–1.

    Article  CAS  PubMed  Google Scholar 

  29. Ferguson JF, Hinkle CC, Mehta NN, Bagheri R, Derohannessian SL, Shah R, Mucksavage MI, Bradfield JP, Hakonarson H, Wang X, Master SR, Rader DJ, Li M, Reilly MP. Translational studies of lipoprotein-associated phospholipase A(2) in inflammation and atherosclerosis. J Am Coll Cardiol. 2012;59(8):764–72.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  30. Chen W, Xia X, Song N, Wang Y, Zhu H, Deng W, Kong Q, Pan X, Qin C. Cross-Species Analysis of Gene Expression and Function in Prefrontal Cortex, Hippocampus and Striatum. PLoS ONE. 2016;11(10):e0164295.

    Article  PubMed  PubMed Central  Google Scholar 

  31. Spadaro O, Youm Y, Shchukina I, Ryu S, Sidorov S, Ravussin A, Nguyen K, Aladyeva E, Predeus AN, Smith SR, Ravussin E, Galban C, Artyomov MN, Dixit VD. Caloric restriction in humans reveals immunometabolic regulators of health span. Science. 2022;375(6581):671–7.

    Article  CAS  PubMed  Google Scholar 

  32. Pendergast DR, Leddy JJ, Venkatraman JT. A perspective on fat intake in athletes. J Am Coll Nutr. 2000;19(3):345–50.

    Article  CAS  PubMed  Google Scholar 

  33. Yeo WK, Carey AL, Burke L, Spriet LL, Hawley JA. Fat adaptation in well-trained athletes: effects on cell metabolism. Appl Physiol Nutr Metab. 2011;36(1):12–22.

    Article  CAS  PubMed  Google Scholar 

Download references

Acknowledgements

Not applicable.

Data source declaration

The authors declare that they have no involvement in the rats experiment but only used the genomic data published on the public database.

Funding

This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.

Author information

Authors and Affiliations

Authors

Contributions

Conceptualisation, methodology, software, formal analysis; writing—original draft preparation, J.Y.; writing—review and editing, supervision, J.B. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Jie Bai.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file1:

Figure 1. This is a figure. Schemes follow the same formatting. Figure 2. All genes found in the microarray are shown as a volcano. A gene is represented by each dot. Down- (blue) and up-regulated genes (red) are distinguished by colours. The log2-base fold change is on the X-axis, while the log10-base corrected P-value is on the Y-axis. Figure 3. Cluster heatmap depicts DEG-based hierarchical clustering analysis findings. Each column is a sample, and each row represents a DEG. The colour represents the relative degree of gene expression (log2 transformed). Green denotes lower gene expression levels, whereas red suggests greater levels. Figure 4. GO enrichment and KEGG pathway enrichment analysis. Each bubble represents a term. The height of the bubbles stands for the significance of enrichment. The horizontal distance stands for the similarity of term subtrees. Bubble size stands for term size (gene counts). The X-axis represents the group of functional terms and coloured by data sources, and the Y-axis lays out adjusted p-value in negative log10 scale. Figure 5. PPI network of all DEGs. In the main network, the degree of Plb1 and Acad11 were 2 and Cd2bp2 and Pla2g7 were 1. Figure 6. GO enrichment and KEGG pathway enrichment analysis of PPI main network. Each bubble represents a term. The height of the bubbles stands for the significance of enrichment. The horizontal distance stands for the similarity of term subtrees. Bubble size stands for term size (gene counts). The X-axis represents the group of functional terms and coloured by data sources, and the Y-axis lays out adjusted p-value in negative log10 scale.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Yan, J., Bai, J. Reveal key genes and factors affecting athletes performance in endurance sports using bioinformatic technologies. BMC Genom Data 24, 10 (2023). https://doi.org/10.1186/s12863-023-01106-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12863-023-01106-9

Keywords