1. Introduction
Medicinal plants have often been selected for the phytochemicals extraction [
1]; however, the residues of cultivated plants have been also investigated for the production of raw antifungal extracts in a view of circular economy and agricultural waste valorization. Examples of non-edible plant parts used as source of phytochemicals for anti-pathogenic applications were reported for pepper and eggplant leaves [
2,
3]. Potato is an important food and economic crop worldwide, ranking first in both yield and planting areas. In recent years, research on potatoes has increased significantly; however, handling the leaf waste generated after planting remains a challenging issue. Although some of this waste can be used as silage, a significant portion is still discarded, leading to the waste of valuable plant resources and causing severe environmental harm. Therefore, research and development on the resource utilization of potato leaves has great practical importance. Studies have shown that the aerial parts of potatoes, namely, their leaves, contain various chemical components, primarily protein compounds, alkaloids, phenolic compounds, and flavonoids [
4,
5,
6,
7]. Plants produce various metabolites with bactericidal, fungicidal, insecticidal, and nematocidal properties to protect themselves against phytophagous organisms and diseases [
8]. Several studies have demonstrated the potential of these natural substances to be integrated with traditional crop management practices, addressing the urgent need for sustainable agriculture, safe food production, and organic farming [
9,
10,
11].
Steroidal glycoalkaloids (SGAs) categorized as alkaloids, are a class of secondary metabolites with multiple biological activities in potatoes [
12,
13]. In potato, these compounds are found mainly in the leaves, stems, buds, and tuber skin. Notably, their content can significantly increase when tubers are mechanically damaged, exposed to light, or sprout [
14]. In potato tubers, α-chaconine and α-solanine constitute more than 90% of the total SGA content [
15]. However, more than 50 different SGAs, including solanidine, leptines, demissine, and tomatine, have been identified in various wild potato species and commercial cultivars [
16,
17]. However, the main components of steroidal alkaloids in potato leaves have rarely been reported. As natural defense substances against pests and diseases, SGAs are potential raw materials for the development of plant-based pesticides [
18]. In addition to SGAs, terpenoids also play crucial roles in potato leaves. Although numerous terpenoids have been identified in plants [
19], the specific composition of the compounds present in potato leaves remains unclear.
Thanks to the development of advanced analytical technologies, it is now possible to better characterize the complex composition of many plants’ raw materials, with a focus on understanding their biologically active compounds. SGA biosynthesis is a complex enzymatic process that shares common synthetic pathways, substrates, and intermediates with terpenes, saponins, and phytosterols [
20]. SGAs exhibit weak basicity. When exposed to acids and heat, their glycosidic bonds are cleaved, releasing solanine and sugars. Under alkaline conditions, they form precipitates that can be separated. Owing to these properties, weakly acidic solutions and organic solvents are commonly used to extract SGAs [
21]. Building on existing SGA extraction techniques, we employed an optimized method to extract SGAs from potato leaves, obtaining a crude extract [
22]. A C18 purification column was used to isolate purified products. To date, research on the factors affecting SGA levels has focused primarily on α-solanine and α-chaconine, neglecting other naturally occurring SGAs. Therefore, the work aims to explore scientific knowledge on the content of the main metabolites, focusing on the group of main compounds present in different extracts from potato leaves. We analyzed the crude extracts and purified extracts from potato leaves through LC-MS/MS technology, systematically characterized the phytochemical properties of both crude extracts and purified extracts. The main components of the extracts were identified and screened by untargeted metabolomics, providing foundational data for future investigations into the biological activities of these metabolites.
2. Materials and Methods
2.1. Plant Materials
The potato variety used in the experiment was 'Jizhangshu No. 12', which was collected from the Potato Experimental Base of the Agricultural and Forestry Science Research Institute in Ulanqab, Inner Mongolia. During the late growth stage, potato leaves were harvested. The fresh samples were weighed, dried, crushed, sieved, and then stored in sealed bags for later use.
2.2. Preparation of Steroidal Alkaloid Extracts
The potato leaf powder (5 g) was weighed and mixed with 1.5% cellulase in a sealed container. Next, 40 mL of acidic aqueous solution (adjusted to pH 5 with 0.02 mol/L hydrochloric acid in distilled water) was added. The mixture was vortexed and extracted via ultrasonic treatment in a water bath at 45°C for 30 min. Subsequently, the enzymatic extraction was performed in a water bath for 1.5 hours. After enzymatic hydrolysis, the mixture was heated to 95°C to deactivate cellulase. Anhydrous ethanol was added until the total volume reached 70%, and the mixture was left to stand for 18 h. The mixed solution was then filtered under reduced pressure, and the filtrate was transferred to a rotary evaporator for concentration. The remaining residue was dissolved in 30 mL of 5% H2SO4 and filtered, and the pH of the filtrate was adjusted to 10.0-10.5 with concentrated ammonia water. The solution was stored overnight in a refrigerator at 4°C. The sample was subsequently centrifuged at 4000 r/min for 20 min, after which the supernatant was discarded. The precipitate was washed several times with 1% aqueous ammonia until the washing solution became clear. The resulting precipitation was used as the crude extract. This crude extract was dissolved in 5 mL of methanol and decolorized using activated carbon, yielding a 5 mL methanol extract (steroidal alkaloid extract), referred to as TQW, for LC‒MS/MS detection.
2.3. Purification of Steroidal Alkaloid Crude Extracts
The purification was performed via a C18 solid-phase extraction column. First, the column was activated by adding 5 mL of methanol, followed by vacuum application to remove the eluent. The column was then reactivated by adding 5 mL of distilled water, and the eluent was discarded. Next, 6 mL of the crude extract was loaded onto the column and vacuumed, and the waste liquid was discarded. The column was washed with 5 mL distilled water, and the elusion was discarded under vacuum. A collection tube was placed under the column, and 5 mL of methanol was added to the column. The eluent was vacuum-collected, yielding purified steroidal alkaloids, referred to as CHW, for LC‒MS/MS detection.
2.4. Detection Methods
2.4.1. Liquid Samples
The sample was removed from the -80°C refrigerator, thawed until there was no ice in the sample, vortexed for 10 s, and mixed thoroughly. One hundred microlitres of each sample was added to a corresponding 1.5 mL centrifuge tube, and 100 μL of 70% methanol was added to the internal standard extraction mixture (less than 100 μL of extraction mixture was added at a ratio of 1:1 (v/v)). The mixture was vortexed for 15 min at 12000 r/min at 4°C and centrifuged for 3 min. The supernatant was removed, filtered through a microporous membrane (0.22 μm pore size), and stored in an injection vial for LC‒ MS/MS detection.
2.4.2. HPLC Conditions
All the samples were subjected to two LC‒MS methods. One aliquot was analyzed under positive ion conditions and eluted from a T3 column (Waters ACQUITY Premier HSS T3 Column 1.8 µm, 2.1 mm * 100 mm) using 0.1% formic acid in water as solvent A and 0.1% formic acid in acetonitrile as solvent B in the following gradient: 5 to 20% in 2 min, increased to 60% in the following 3 min, increased to 99% in 1 min and held for 1.5 min, then returned to 5% mobile phase B within 0.1 min, held for 2.4 min. The analytical conditions were as follows: column temperature, 40°C; flow rate, 0.4 mL/min; injection volume, 4 μL; and another aliquot, which used negative ion conditions and was the same as the elution gradient of positive mode.
2.4.3. MS Conditions (AB)
The data acquisition was performed in information-dependent acquisition (IDA) mode via Analyst TF 1.7.1 software (Sciex, Concord, ON, Canada). The source parameters were set as follows: ion source gas 1 (GAS1), 50 psi; ion source gas 2 (GAS2), 50 psi; curtain gas (CUR), 25 psi; temperature (TEM), 550°C; declustering potential (DP), 60 V, or −60 V in positive or negative mode, respectively; and ion spray voltage floating (ISVF), 5000 V or −4000 V in positive or negative mode, respectively. The TOF MS scan parameters were set as follows: mass range, 50–1000 Da; accumulation time, 200 ms; and dynamic background subtraction, on. The product ion scan parameters were set as follows: mass range, 25-1000 Da; accumulation time, 40 ms; collision energy, 30 or −30 V in positive or negative mode, respectively; collision energy spread, 15; resolution, UNIT; charge state, 1--1; intensity, 100 cps; excluding isotopes within 4 Da; mass tolerance, 50 ppm; and maximum number of candidate ions to monitor per cycle, 18.
2.4.4. UPLC Conditions
The sample extracts were analyzed via a UPLC‒ESI‒MS‒MS‒MS system (UPLC, ExionLC™ AD,
https://sciex.com.cn/) and a tandem mass spectrometry system (
https://sciex.com.cn/). The analytical conditions were as follows: UPLC: column, Agilent SB-C18 (1.8 µm, 2.1 mm * 100 mm). The mobile phase consisted of solvent A, pure water with 0.1% formic acid, and solvent B, acetonitrile with 0.1% formic acid. Sample measurements were performed with a gradient program that employed the starting conditions of 95% A, 5% B. Within 9 min, a linear gradient to 5% A, 95% B was programmed, and a composition of 5% A, 95% B was maintained for 1 min. Subsequently, a composition of 95% A, 5.0% B was adjusted within 1.1 min and maintained for 2.9 min. The flow velocity was set as 0.35 mL per minute. The column oven was set to 40°C, and the injection volume was 2 μL. The effluent was alternatively connected to an ESI-triple quadrupole-linear ion trap (QTRAP)-MS.
2.5. Analysis Methods
2.5.1. PCA
Unsupervised PCA (principal component analysis) was performed via the statistics function prcomp within R (
www.r-project.org). The data was subjected to unit variance scaling before unsupervised PCA.
2.5.2. Hierarchical Cluster Analysis
The HCA (hierarchical cluster analysis) results of the samples and metabolites are presented as heatmaps with dendrograms, whereas the Pearson correlation coefficients (PCCs) between samples were calculated via the cor function in R and are presented as heatmaps. Both HCA and PCC were carried out via the R package Complex Heatmap. For HCA, normalized signal intensities of metabolites (unit variance scaling) are visualized as a color spectrum.
2.5.3. Differentially Abundant Metabolites Selected
For two-group analysis, differentially abundant metabolites were determined by VIP (VIP > 1) and absolute Log2FC (|Log2FC| ≥ 1.0). VIP values were extracted from the OPLS-DA results, which also contain score plots and permutation plots, generated via the R package MetaboAnalystR. The data were log transformed (log2) and mean centered before OPLS-DA. To avoid overfitting, a permutation test (200 permutations) was performed.
2.5.4. Metabolite Annotation
The qualitative analysis of primary and secondary mass spectrometry data, as well as the quantification of widely targeted metabolites, was performed by MetWare Biotechnology (Wuhan, China) with the custom-built MWDB database and publicly available metabolite databases. The analysis software used for substance identification was developed by MetWare Biotechnology. According to the priority of data analysis, the first is the search algorithm of the local high-resolution database built by the company (MS2-search-local), followed by the public online libraries Metlin (
http://metlin.scripps.edu/index.php), HMDB (
https://hmdb.ca/), Mona (
https://mona.fiehnlab.ucdavis. edu/), MassBank (
http://www.massbank.jp/), etc. (MS2-search-DB), and finally the prediction library based on machine learning (MS2-In silico). In addition, for substances that have not been identified at the second level, a first-level close result is given as a reference on the basis of the Q1 error (MS1).
4. Discussion
Plant extracts are a blend of bioactive molecules, which can work synergistically in counteracting harmful microorganisms [
29,
30]. Potato leaves contain a variety of primary and secondary metabolites, including sugars, proteins, alkaloids, terpenoids, phenolic acids, flavonoids and organic acids [
31]. The main components of the extract and other unknown metabolites can be clearly identified by specifically extracting the main component from leaves. LC-MS/MS analysis conducted here putatively identified a set of possible antifungal compounds into the extracts, namely solanidine, α-chaconine, solanine, isopimaric acid, chlorin e6 diphosphate, α-hederin, capsicoside C, lsonuatigenin and dioscin. etc. Moreover, solanidine, α-chaconine and solanine are steroidal alkaloids with a presumable role in potato protection from phytopathogens. The main glycoalkaloids in potato, α-solanine, and α-chaconine showed potent antimicrobial activity to a wide range of organisms; for this reason, they are considered as defensive allelochemicals [
32]. The experimental results revealed that the chemical composition of the extract not only contained steroidal alkaloids, but also included other alkaloids, terpenoids, phenolic acids, flavonoids and organic acids, steroids, lipids, etc., and a total of 1,761 metabolites. The conducted research was the first comparative analysis providing phytochemical profiling of different extracts obtained from raw potato leaf materials. steroidal alkaloids (solanidine, α-chaconine and solanine) were identified as the major compound responsible for antimicrobial activity, especially, solanidine was found to content with a relatively high level. Therefore, untargeted metabolomics can comprehensively reveal the types and relative contents of these metabolites, providing a scientific basis for the development of anti-inflammatory, anticancer, and antibacterial functional components.
Through untargeted detection and comparison of the CHW_vs_TQW groups for all the obtained metabolites, 800 significantly different metabolites were screened, whereas 960 metabolites were not significantly different. The metabolites with significant differences were mainly amino acids and derivatives, organic acids, benzene and substituted derivatives and lipids. The metabolites with no significant difference and relatively higher contents were mainly lipids, steroidal alkaloids (solanidine, α-chaconine and solanine). The purification process showed no significant effect on the content of the main component. The purified extract can be fully applied to agricultural, medical, and other industrial products.
Potatoes are rich in nutrients and contain a variety of active ingredients, making them an important source of various active components in the human diet [
33]. If the various effective components of potatoes can be extracted, separated, and utilized, they can have greater health benefits. Current research on the extraction of bioactive components from potato tubers has focused mainly on polysaccharides, proteins, and polyphenols, which have relatively high contents and mature extraction methods. However, research on the active components of potato leaves was limited. The results of this study show that steroidal alkaloids, especially solanidine, α-chaconine, and solanine are the primary bioactive compounds in the leaves. These secondary metabolites have antibacterial, antitumor, cholesterol-lowering, antimalarial, anti-inflammatory, cardiotonic, anti-inflammatory, and analgesic effects [
34,
35]. Researchers have demonstrated that α-solanine and α-chaconine have inhibitory effects on the proliferation of HCT-8 tumor cells and that the 6-OH of the glycoside alkaloid sugar chain was crucial for its antitumor effect [
36]. The application of potato glycoalkaloids to treat phlebitis caused by tumor drugs has shown that these compounds have anti-inflammatory effects [
37]. Terpenoids also have anti-inflammatory, antibacterial, and anticancer effects [
38,
39]. Terpenoids in potatoes, especially sesquiterpenes, have good cytotoxic and antitumor effects [
40]. Song
et al. [
41] evaluated the
in vitro antiviral activity of pentacyclic triterpene glycosides in potatoes on the basis that terpenoids can inhibit the recognition and binding of hemagglutinin (HA) protein to sialic acid receptors on the surface of host cells and reported that the antiviral activity of ursane-type pentacyclic triterpene glycosides were the strongest. In this study, high contents of terpenoids, including isopimaric acid, chlorin e6 iphosphate and α-hederin, were detected in the extracts of potato leaves. Recently, a non-targeted metabolite profiling of potato leaves carried out using HPLC-ESI-QTOF-MS, tentatively identified 109 compounds, including organic acids, amino acids and derivatives, phenolic acids, flavonoids, iridoids, oxylipins and other polar and semi-polar compounds, demonstrating that they are an important source of bioactive molecules putatively responsible for the observed antifungale ects [
7,
42]. Researchers believe that steroidal alkaloids and terpenoids coexist in potatoes [
43], and this study also confirms this conclusion. At present, there are no systematic studies on terpenoids in potatoes, which is a future direction for the study of the active components in potatoes.
Steroidal saponins could be found in many species of the solanum, and they are an important group of natural products exhbiting a number of potent beneficent properties, such as anti-tumor, antibacterial, antifungal, anti-inflammatory, antiviral, antioxidant, hypoglycemic, and antihyperlipidemic effects [
44,
45]. In the study, A total of 21 steroids were detected in the extracts from potato leaves. The steroids of higher content included lanatoside C, chikusetsusaponin III, carnosifloside III, dioscin and koryoginsenoside R1 in the CHW sample, and capsicoside C, isonuatigenin, dioscin, hellebrigenin glucoside and asclepin in the TQW. The current isolation process obtained a wide variety of steroidal compounds, demonstrating that potato leaves contain abundant amounts of steroidal substances. Therefore, we conclude that steroidal compounds are also contributors to the antimicrobial bioactive substances in potato leaves.
Author Contributions
Conceptualization, Y.C. and Y.F.; methodology, Y.H.; software, Y.H.; validation, Y.C. and Y.F.; formal analysis, Y.C.; investigation, Z.Z.; resources, R.X.; data curation, J.F.; writing—original draft preparation, Y.C.; writing—review and editing, Y.F.; visualization, Y.C. and Y.F.; supervision, R.W. and X.X.; project administration, L.N.; funding acquisition, L.N. All authors have read and agreed to the published version of the manuscript.
Figure 1.
Total ion chromatogram (TIC) overlay plot of QC sample(a,b) in mass spectrometry analysis by LC-MS/MS detection. a. maps- positive. b, maps- negative.
Figure 1.
Total ion chromatogram (TIC) overlay plot of QC sample(a,b) in mass spectrometry analysis by LC-MS/MS detection. a. maps- positive. b, maps- negative.
Figure 2.
Extracted ion chromatogram (EIC) of blank samples(a,b) in mass spectrometry analysis by LC-MS/MS detection. a. maps- positive. b, maps- negative.
Figure 2.
Extracted ion chromatogram (EIC) of blank samples(a,b) in mass spectrometry analysis by LC-MS/MS detection. a. maps- positive. b, maps- negative.
Figure 3.
Correlation analysis plot of quality control (QC) samples for the two samples with three biological replicates by LC-MS/MS. The chart shows the correlation matrix of QC samples (QC01, QC02, QC03). The lower left square of the diagonal line is the correlation scatter plot of the corresponding QC samples, the horizontal and vertical coordinates are the metabolite contents (with Log treatment), and each point in the plot represents one metabolite; the upper right square of the diagonal line is the Pearson correlation coefficient of the corresponding QC samples (e.g., the correlation coefficient between QC01 and QC02 is 0.9989).
Figure 3.
Correlation analysis plot of quality control (QC) samples for the two samples with three biological replicates by LC-MS/MS. The chart shows the correlation matrix of QC samples (QC01, QC02, QC03). The lower left square of the diagonal line is the correlation scatter plot of the corresponding QC samples, the horizontal and vertical coordinates are the metabolite contents (with Log treatment), and each point in the plot represents one metabolite; the upper right square of the diagonal line is the Pearson correlation coefficient of the corresponding QC samples (e.g., the correlation coefficient between QC01 and QC02 is 0.9989).
Figure 4.
PCA analysis of metabolites (a) and CV distribution diagram of samples (b) for the two samples. a. PC1 denotes the first principal component, PC2 denotes the second principal component, and the percentage denotes the rate of explanation of the data set by this principal component. b. The horizontal coordinate represents the CV value, and the vertical coordinate indicates the proportion of the number of substances smaller than the corresponding CV value to the total number of substances, with different colors representing different grouped samples, and QC is the quality control sample, in which the two reference lines perpendicular to the X-axis correspond to CV values of 0.3 and 0.5, and the two reference lines parallel to the X-axis correspond to the number of substances accounting for 75% and 85% of the total number of substances.
Figure 4.
PCA analysis of metabolites (a) and CV distribution diagram of samples (b) for the two samples. a. PC1 denotes the first principal component, PC2 denotes the second principal component, and the percentage denotes the rate of explanation of the data set by this principal component. b. The horizontal coordinate represents the CV value, and the vertical coordinate indicates the proportion of the number of substances smaller than the corresponding CV value to the total number of substances, with different colors representing different grouped samples, and QC is the quality control sample, in which the two reference lines perpendicular to the X-axis correspond to CV values of 0.3 and 0.5, and the two reference lines parallel to the X-axis correspond to the number of substances accounting for 75% and 85% of the total number of substances.
Figure 5.
An overview of the principal component analysis results for the CHW and TQW samples. a. PCA of metabolites for two samples with three biological replicates. The horizontal axis represents the predictive principal components, indicating intergroup differences in this direction. The vertical axis represents the orthogonal principal components, reflecting intragroup variations in this direction. The percentages indicate the contribution of each component to the dataset. Different colors represent different samples. b. Evaluation parameters of OPLS-DA models. R2X and R2Y represent the model's explained variance, and Q2 represents the model's predictive ability. c. Orthogonal partial least squares-discriminant analysis validation plot. X-axis: Values of the model's goodness-of-fit (R²Y) and predictive ability (Q²). Y-axis: Frequency of occurrence for classification performance metrics across 200 random permutation tests. Orange, Distribution of R²Y values from permuted (randomly shuffled) models (goodness-of-fit). Purple, Distribution of Q² values from permuted models (predictive ability). Black arrow, Actual values of the original (non-permuted) model (R²X, R²Y, and Q²).
Figure 5.
An overview of the principal component analysis results for the CHW and TQW samples. a. PCA of metabolites for two samples with three biological replicates. The horizontal axis represents the predictive principal components, indicating intergroup differences in this direction. The vertical axis represents the orthogonal principal components, reflecting intragroup variations in this direction. The percentages indicate the contribution of each component to the dataset. Different colors represent different samples. b. Evaluation parameters of OPLS-DA models. R2X and R2Y represent the model's explained variance, and Q2 represents the model's predictive ability. c. Orthogonal partial least squares-discriminant analysis validation plot. X-axis: Values of the model's goodness-of-fit (R²Y) and predictive ability (Q²). Y-axis: Frequency of occurrence for classification performance metrics across 200 random permutation tests. Orange, Distribution of R²Y values from permuted (randomly shuffled) models (goodness-of-fit). Purple, Distribution of Q² values from permuted models (predictive ability). Black arrow, Actual values of the original (non-permuted) model (R²X, R²Y, and Q²).

Figure 6.
Metabolites analysis among CHW and TQW samples. a. Composition analysis of metabolite classes. The colored sections in the outer ring represent the relative proportions of each class, whereas the legend on the right provides the class labels and their percentages. b. OPLS-DA S-plot. X-axis represents the covariance between principal components and metabolites; y-axis indicates their correlation coefficients. Metabolites positioned closer to the upper-right or lower-left corners demonstrate more significant differences. Red dots denote metabolites with the variable importance in projection (VIP) values >1, green dots represent those with VIP values ≤1. OPLS-DA S-plot. c. Volcano plot of DAMs in the CHW_vs_TQW compare group. The plot shows the distribution of DAMs, with the x-axis representing Log₂ (fold change) and the y-axis representing the VIP value. Red dots indicate upregulated metabolites (Up, 162), green dots represent downregulated metabolites (Down, 638), and gray dots denote metabolites with no significant change (Insignificant, 960). The vertical and horizontal dashed lines indicate thresholds for significance and fold change, respectively.
Figure 6.
Metabolites analysis among CHW and TQW samples. a. Composition analysis of metabolite classes. The colored sections in the outer ring represent the relative proportions of each class, whereas the legend on the right provides the class labels and their percentages. b. OPLS-DA S-plot. X-axis represents the covariance between principal components and metabolites; y-axis indicates their correlation coefficients. Metabolites positioned closer to the upper-right or lower-left corners demonstrate more significant differences. Red dots denote metabolites with the variable importance in projection (VIP) values >1, green dots represent those with VIP values ≤1. OPLS-DA S-plot. c. Volcano plot of DAMs in the CHW_vs_TQW compare group. The plot shows the distribution of DAMs, with the x-axis representing Log₂ (fold change) and the y-axis representing the VIP value. Red dots indicate upregulated metabolites (Up, 162), green dots represent downregulated metabolites (Down, 638), and gray dots denote metabolites with no significant change (Insignificant, 960). The vertical and horizontal dashed lines indicate thresholds for significance and fold change, respectively.

Figure 7.
Heatmap of hierarchical clustering of the DAMs. Different colors represent values obtained after standardizing the relative content, with red indicating high content and green indicating low content.
Figure 7.
Heatmap of hierarchical clustering of the DAMs. Different colors represent values obtained after standardizing the relative content, with red indicating high content and green indicating low content.
Figure 8.
Comparison of the mean relative contents of metabolites in both the TQW and CHW samples. a. steroidal alkaloids, b. Terpenes metabolites, c. Steroids metabolites.
Figure 8.
Comparison of the mean relative contents of metabolites in both the TQW and CHW samples. a. steroidal alkaloids, b. Terpenes metabolites, c. Steroids metabolites.