Next Article in Journal
High-Mass Loading Hierarchically Porous Activated Carbon Electrode for Pouch-Type Supercapacitors with Propylene Carbonate-Based Electrolyte
Next Article in Special Issue
Probing the Intracellular Bio-Nano Interface in Different Cell Lines with Gold Nanostars
Previous Article in Journal
Ultra-Narrow Metallic Nano-Trenches Realized by Wet Etching and Critical Point Drying
Previous Article in Special Issue
Magnetic Mesoporous Carbon/β-Cyclodextrin–Chitosan Nanocomposite for Extraction and Preconcentration of Multi-Class Emerging Contaminant Residues in Environmental Samples
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Development of Single-Molecule Electrical Identification Method for Cyclic Adenosine Monophosphate Signaling Pathway

1
Institute of Science and Industrial Research, Osaka University, 8-1 Mihogaoka, Ibaraki, Osaka 567-0047, Japan
2
Artificial Intelligence Research Center, The Institute of Scientific and Industrial Research, Osaka University, 8-1 Mihogaoka, Ibaraki, Osaka 567-0047, Japan
*
Author to whom correspondence should be addressed.
Nanomaterials 2021, 11(3), 784; https://doi.org/10.3390/nano11030784
Submission received: 12 February 2021 / Revised: 11 March 2021 / Accepted: 16 March 2021 / Published: 19 March 2021
(This article belongs to the Special Issue Development of Nanomaterials for Applications in Trace Analysis)

Abstract

:
Cyclic adenosine monophosphate (cAMP) is an important research target because it activates protein kinases, and its signaling pathway regulates the passage of ions and molecules inside a cell. To detect the chemical reactions related to the cAMP intracellular signaling pathway, cAMP, adenosine triphosphate (ATP), adenosine monophosphate (AMP), and adenosine diphosphate (ADP) should be selectively detected. This study utilized single-molecule quantum measurements of these adenosine family molecules to detect their individual electrical conductance using nanogap devices. As a result, cAMP was electrically detected at the single molecular level, and its signal was successfully discriminated from those of ATP, AMP, and ADP using the developed machine learning method. The discrimination accuracies of a single cAMP signal from AMP, ADP, and ATP were found to be 0.82, 0.70, and 0.72, respectively. These values indicated a 99.9% accuracy when detecting more than ten signals. Based on an analysis of the feature values used for the machine learning analysis, it is suggested that this discrimination was due to the structural difference between the ribose of the phosphate site of cAMP and those of ATP, ADP, and AMP. This method will be of assistance in detecting and understanding the intercellular signaling pathways for small molecular second messengers.

1. Introduction

Second messengers are important for the signal transduction of intracellular signals because they are always released by the cell in response to exposure to extracellular signaling molecules and trigger physiological changes at the cellular level, such as proliferation, differentiation, migration, survival, apoptosis, and depolarization [1,2,3]. Therefore, measuring the molecular behavior of second messengers is important to understand the signal transduction events that occur inside cells. Among these second messengers, cyclic adenosine monophosphate (cAMP) is an important research target because it activates protein kinases, which are used to regulate the passage of ions and small molecules such as Ca2+ through ion channels [4,5,6]. This cAMP is produced from adenosine triphosphate (ATP) by a g-protein enzyme and is converted to ATP by an adenylate cyclase enzyme. As a result that the chemical properties and structures of nucleic acid molecules (i.e., ATP, adenosine monophosphate (AMP), and adenosine diphosphate (ADP)) are very similar to those of cAMP, the cAMP should be selectively detected in order to detect the chemical reactions related to its intracellular signaling pathway.
Fluorescence optical imaging using optically synthesized molecular or protein probes has been as a selective detection method for cAMP. These probes are composed of two functional parts that selectively bind to cAMP and have the ability to detect a fluorescent or luminescent signal inside cells [7,8,9,10]. However, in order to simultaneously detect multiple targets consisting of cAMP-related molecules such as ATP, cAMP, and ADP to understand the signal transduction of the cAMP pathway, each target requires a separate probe. In addition, because these probes are bound to the individual target molecules in the recognition, the detection conditions are somewhat different from the physiological conditions, and it is appropriate to directly measure each of these targets. Recently, single-molecule detection nanotechnologies, such as nanopore and nanogap devices, have developed so that these methods become promising approaches, which provide their potential applications within the life science fields [11,12,13,14,15,16]. Among them, one of the approaches is single-molecule quantum detection, which is based on the following detection principle. When sample molecules pass between nanogap electrodes that are used as sensors, a tunnel phenomenon induces an electron transfer through the molecules, resulting in the detection of the individual electrical conductivity due to the electronic state of each of the molecules [14,15,16,17]. To date, we have reported the identification of various biomolecules such as DNA, RNA, and amino acids using single-molecule quantum detection methods [18,19,20,21,22].
The current study investigated cAMP and its related molecules using the single-molecule quantum detection method (Figure 1a), and succeeded in electrically detecting cAMP at the single-molecule level. The developed machine learning method was also successfully used to discriminate the cAMP signals from those of ATP, AMP, and ADP (Figure 1b), which are molecules similar to cAMP and coexist under physiological conditions. The single cAMP signal discrimination accuracies from AMP, ADP, and ATP were found to be 0.82, 0.70, and 0.72, respectively. These values indicated a 99.9% accuracy when detecting more than 10 signals. Furthermore, it was found that the features that significantly contributed to the discrimination by the developed machine learning method were affected by the interaction of the electrodes and target molecules. This result indicated that the structural difference from the ribose of the phosphate site of cAMP enabled it to be distinguished from ATP, ADP, and AMP. Thus, it can be said that the single-molecule quantum measurement method has shown the potential for identifying one molecule of cAMP and visualizing the information transmission system in the body containing the cAMP.

2. Materials and Methods

2.1. Single-Molecule Electrical Measurement of Sample Nucleotide Solution

5′-Adenylic acid (AMP), 3′,5′-cyclic adenosine monophosphate (cAMP), adenosine 5′-trihydrogen diphosphate (ADP), and adenosine 5′-tetrahydrogen triphosphate (ATP) were purchased from Sigma-Aldrich (St. Louis, MO, USA) and used without further purification. Their chemical structures are shown in Figure 1b. These were used to prepare 10 µM aqueous solutions without further purification. The prepared sample solutions were dropped (20 µL for each measurement) at the center of a sensor plate that contained the fabricated nanogap electrodes.
The nanogap electrodes were constructed using the mechanically controllable break junction (MCBJ) nanofabrication method (Figure 1c). The procedures for fabricating MCBJs are detailed elsewhere [23,24,25]. In this study, a nanogap device was fabricated as follows. First, a silicon substrate was electrically insulated by applying a thin polyimide layer. A nano-gold junction was fabricated using electron-beam lithography on the substrate. Next, a SiO2 layer was deposited using chemical vapor deposition. Here, a nanochannel pattern was superimposed on the nano-gold junctions by electron beam lithography. Finally, this pattern was developed, and dry etching was performed to form nanochannels. A cover made of polydimethylsiloxane (PDMS) was attached to the silicon substrate. The PDMS cover had a microchannel that connected the hole for introducing the sample solution and the nanochannel of the sensor in advance. The PDMS was purchased from Toray Dow Corning. Finally, the PDMS cover and silicon substrate were treated with ozone plasma and then bonded.
The nanogap electrode devices were used for single-molecule measurements of the sample nucleotide solutions. The gap size was set to 0.65 nm and finely tuned by the piezoelectric element during all the measurements. During the measurement, the gap distance was maintained by feedback control of the piezo actuators. The gap distance was estimated using the following equation for the direct tunneling current:
I = c o n s t exp ( 4 π h 2 m w l )
where h, m, w, and l represent the Planck constant, electron mass, work function of the gold electrode, and gap distance, respectively. An electron mass of 9.1 × 10−31 kg was used for m, and the work function of Au(111) (5.1 eV) was used for w [19].
After every 1 h period of current–time (It) measurements, the MCBJ sample was replaced with a new one. The current across the electrodes was amplified by a custom-built logarithmic current amplifier and recorded at 10 kHz and 100 kHz using an NI PXIe-4081 digital multimeter (National Instruments, Austin, TX, USA) and NI PXI-5922 (National Instruments, Austin, TX, USA) under a DC bias voltage of 100 mV. More than three sets of measurements were performed using different gold-gap sensors.

2.2. Single-Pickup and Machine Learning Method for Single Analysis

The signal-rise time was judged to be the start of the signal from the estimated baseline with a value exceeding six times the “noise level” as the threshold value. At this time, the signal exceeded the noise level. Next, the signal was determined to have ended when a data point below the noise level appeared from the estimated baseline after the signal started. The baseline was defined as the mode of the histogram of the last 2000 data points at each time. The noise level was defined as the mode of the standard deviation of 200 data points obtained by dividing the latest 10,000 data points at each time into 500 equal parts.
A machine learning analysis was performed using Python 3.6. First, positive and unlabeled data classification (PUC) was performed to remove “blank signals“, which were also observed in blank solutions because of the migration of gold atoms from the electrode and contamination and reported in previous studies [26,27,28,29,30]. The PUC was an appropriate algorithm for blank signal removal. The PUC algorithm is based on the method of Elkan and Noto [31]. The inner classifier was the Gaussian naïve Bayes from the scikit-learn library, version 0.21.3 [32]. After blank signal removal, the signals were trained and classified using supervised machine learning with the random forest classifier from the scikit-learn library, version 0.21.3 [32]. A 10-fold cross-validation was performed, where all the data of the single-molecule signals were partitioned into 10 sub-datasets, and 10 classifications were performed for each of the sub-datasets used as test data, with other sub-datasets used as training data. The average ratios of the 10 classifications are shown in the confusion matrices.

3. Results

3.1. Single-Molecule Electrical Detection of cAMP, ATP, AMP, and ADP

First, the cAMP solution was investigated using the nanogap electrodes (Figure 1a: schematics). The current–time (I–t) profiles for the blank and cAMP solutions are shown in Figure 2a,b, respectively. Further current-time traces of cAMP measurements are shown in Figure S1 in Supplementary Materials. The cAMP molecules were detected as characteristic pulse current signals because increases in the tunnel current were induced by the cAMP molecules translocating through the nanogap electrodes. A characteristic current signal was detected, as shown in Figure 2c. The signal’s peak value was defined as Ip, and the signal duration time was defined as td. Compared to the cAMP signals in Figure 2b, the average Ip for the characteristic current signals decreased from 33 pA (Figure 2b) to 18 pA (Figure 2a). In addition, the signal frequency decreased significantly (Figure 2d). The duration-time was found to be 0.3 milliseconds. In the previous study, it was found that the signal intensity was around 10 pA, and the duration time was around 0.2 milliseconds. Such statistical signal values such as signal intensities and duration-time were very similar to those of the blank signals obtained in buffer aqueous solutions in previous studies [19]. These results suggested that the obtained current signals were typical molecular signals for cAMP molecules.
Next, single-molecule measurements were made of the ATP, AMP, and ADP. To evaluate the differences among these four types of molecular signals, the peak current (Ip) values were statistically compared. Figure 3a–d show the current signal histograms. The average Ip values of cAMP, AMP, ADP, and ATP at a bias of 100 mV were 32 ± 18, 27 ± 13, 41 ± 21, and 36 ± 15 pA, respectively. These results demonstrated that the differences in the conductance values were small. Similarly, the duration (td) values were statistically compared. Figure 3e–h show the signal duration time histograms. The average signal duration-time were 4.6, 4.6, 4.1, and 5.3 ms for cAMP, AMP, ADP, and ATP, respectively. These duration time values (td) were also similar, although the ATP signals had slightly longer averaged td values. Thus, statistical analyses of these signals showed that a single feature parameter (i.e., Ip or td) would not be sufficient for their discrimination. However, multiple feature parameter analysis has potential for improving the discrimination accuracy of cAMP, ATP, AMP, and ADP. Figure 3i–l show two-dimensional plots of Ip and td, and the dependency of the sample species appears in these plots. The histograms of cAMP and AMP are similar. But the difference was observed as shown in Figure S2 in Supplymentary Material. These results indicated that information about the entire waveforms of the molecular signals or several characteristic feature parameters extracted from the signals are needed for accurate discrimination.

3.2. Discrimination of cAMP ATP, AMP, and ADP Using Machine Learning Method

To improve the discrimination of cAMP, ATP, AMP, and ADP using a multiple feature parameter analysis, a machine learning-based classification method was applied, with multidimensional feature parameters prepared using each of the individual signals. The scheme of this machine learning method is shown in Figure 4a,b. In the first step, each signal was obtained by the single-molecule electrical measurement. In the second step, the single-molecule signal was extracted based on the criteria described in the materials and methods section. This study used the following 13 feature parameters: the peak value of the signal current (Ip), average signal value (Iave), duration time (td), and ten-dimensional shape factors (Sn = In/Ip (n = 1, 2, …, 10)). The values of Sn were defined as follows. The current–time (I–t) profile of a signal region was separated into ten time regions, and then each of the average current values (blue circles) for each of the time regions (I1, I2, I3, I4, I5, I6, I7, I8, I9, and I10) was calculated (Figure 4c). Each of these current values was normalized by the maximum current value of the signal (Ip), and the values in each region were defined as the shape factors, i.e., S1, S2, S3, S4, S5, S6, S7, S8, S9, and S10. In the third step, each of the extracted signals was converted into feature parameter data (Ip, Iave, td, Sn (n = 1, 2, …, 10)). The dataset of the feature parameter data was split into two datasets: the training data and test data. In the fourth step, a machine learning classifier learned the feature parameters from the training data, as shown in Figure 4b. In the fifth step, the classifier was utilized to identify each of the chemical species in the test data signals for cAMP, ATP, AMP, and ADP, and the discrimination accuracies were evaluated as F-measure values. In the final step, the accuracy was validated by performing a 10-fold cross-validation, as described in the materials and methods section.
These feature parameters (Ip, Iave, td, and Sn) were used to perform classifications with the supervised machine learning method for all the different discrimination combinations of cAMP, ADP, AMP, and ATP. The evaluation results are shown in Figure 4d. The discrimination accuracy value of 0.28 for cAMP was poor, while those for the ADP, AMP, and ATP signals showed high accuracy.
In the single-molecule detection, the obtained signals include target sample signals and also include signals which were generated from the fluctuation of the molecule or the electrode as shown in Figure 2a,d. We called these signals blank signals. The blank signals are not Johnson noise, (thermal noise), and/or shot noise of measuring instruments generated by electrical measurement, but the signal. In the previous reports, this blank signal adversely affected the discrimination of molecules [26,27,29] so that the reduction of blank signal are a key step for the discrimination with high accuracy. To address the issues of the blank signals, the signal of the sample target, i.e., cAMP, ADP, AMP, and ATP, had to be extracted from the obtained signals.
In order to remove the blank signals, the PUC method was applied to this signal discrimination. In the PUC method, the signals obtained from the blank solution (control experiments) were used as “positive” signals. On the other hand, the signal data of the sample solution were regarded as “unlabeled” data because the signals obtained from the sample solution contained both blank signals and the sample molecule signals. The scheme is shown in Figure 5a,b. In the first step, single-molecule measurements were performed for the solvent (blank). The signal data of the blank were regarded as positive data. In the second step, each signal was converted into a feature parameter value (Ip, Iave, td, and Sn). In the third step, the machine-learning classifier learned the feature parameters of the blank signals. In the fourth step, single-molecule measurements were performed on the sample solutions, and these signal data were regarded as unlabeled data. In the fifth step, the sample molecule signals were discriminated from the blank signals using the machine learning classifier of blank signals learned in the third step. In the final step, the extracted sample data (cAMP, ADP, AMP, and ATP) were used for the subsequent classification by the supervised machine learning method.
Figure 5c shows the evaluation results for the cAMP, ADP, AMP, and ATP discrimination accuracies using the PUC method. Compared to the discrimination accuracy value of 0.25 for cAMP without the PUC method (Figure 4d), the discrimination accuracy of cAMP was significantly improved to 0.44. Together with the other discrimination accuracies of 0.50 for AMP, 0.49 for ADP, and 0.59 for ATP, the average discrimination accuracy was also improved to 0.50. For instance, when more than ten signals with a discrimination accuracy of 0.50 were obtained, a 99% accuracy is obtained (Figure 5d). In the case of identification with only two types, the discrimination accuracy values were found to be 0.70 for AMP/cAMP, 0.69 for ADP/cAMP, 0.75 for ADP/cAMP, 0.69 for AMP/ATP, 0.74 for ATP/cAMP, and 0.69 for ADP/ATP (Figure 5e–j). The obtained accuracy represents that the specific molecular species were identified with a certain accuracy for each signal. Therefore, in order to determine the ratio of sample molecules in the mixed solution, the discrimination error of each signal can be determined from the obtained confusion matrix such as Figure 5c. In the previous report, we discriminated ethylated guanosine from guanosine and estimated the mixture ratio [27], and also identified each sample in a mixed solution containing dopamine, serotonin, and noradrenaline and estimated the mixture ratio, which is comparable to that of the prepared solution [26]. Therefore, this method will be applicable for understanding cAMP-related reactions such as enzymatic reactions at the single-molecular level. However, we also found the determined mixture ratio has some difference which depends on the sample species. Since the discrimination errors were due to the difference between the concentration of bulk solution and that of near the sensor gap, further development of the active mass transport function in solution such as electrophoresis [30] would be needed.

4. Discussion

This molecular recognition method using machine learning (Figure 5a,b) had at least two advantages compared to the previous discrimination method that only analyzed a single parameter such as Ip or td. The first advantage was the reduction of the blank signal using the PUC method before signal discrimination. The discrimination accuracy was significantly improved after PUC (Figure 5c) compared to that before PUC (Figure 4d). In addition, after PUC, the average Ip values of the cAMP, AMP, ADP, and ATP signals were 120 ± 18 pA, 100 ± 24 pA, 104 ± 13 pA, and 148 ± 22 pA, respectively. As a result that the intensity of blank signals is known to be small, the difference in the currents with and without PUC was due to the blank signal reduction when using the PUC method. Therefore, the PUC method allowed the target signals to be extracted from the obtained signals, which made it possible to better detect slight differences in the chemical structures compared to conventional methods.
The second advantage was that by using multiple parameters (Ip, Iave, td, Sn (n = 1, 2, …, 10)) for each of the signals, this analysis significantly improved the discrimination accuracy and understanding of the differences in the physical behaviors of each molecule. In this analysis, the maximum current value (Ip), average value (Iave), signal duration (td), and signal shape factor of each region (Sn) were used for supervised machine learning. A comparison of several classification algorithms, including support vector machine (SVM) and extreme gradient boosting (XGBoost), showed that the random forest algorithm provided the best discrimination accuracy (Figure 6a). In an ensemble learning method such as random forest, some feature parameters are randomly selected for classification trees, and then classification is performed using the ensemble. If only a specific feature parameter is important for discrimination, while other parameters are not important, the identification accuracy in a random forest becomes low. Therefore, the good discrimination accuracy using the random forest method indicated that all the prepared features used here were involved in the discrimination.
Next, to evaluate each feature parameter’s contribution to the discrimination, the discrimination accuracy was determined after changing the combination of feature parameters used. Among the four sets of parameter combinations, the best combination was found to be the data sets of the maximum current value (Ip), average current value (Iave), td, and shape factors (Sn). This confirmed that all of the prepared features used here were useful for the discrimination. Importantly, among the three types of features, (Ip, Iave), (Ip, Iave, td), and the shape factors (Sn), the shape factors were found to have the highest discrimination accuracy of 0.44, compared to 0.40 for the maximum and average values of the current and the signal duration (Ip, Iave, td), and 0.39 for the average and maximum values of the current (Ip, Iave) (Figure 6b). It is suggested that the shape factors were important because the translocation behaviors inside the nanogap electrodes differed depending on the size and polarization of the molecule, resulting in characteristic signal behaviors.
Finally, the importance of each feature parameter was estimated to quantitatively investigate which feature was more effective for discrimination. The random forest classifier could evaluate the importance of each feature [33], which was an index of how much the discrimination accuracy decreased when that feature parameter was omitted. Figure 6c shows the importance index for each feature parameter. Among these indexes, the two most important feature parameters were found to be the maximum current value (Ip) and first shape factor (S1).
For the maximum current value (Ip), an increase in the signal current was induced by electron transmission via a single molecule at the nanogap electrodes. It is well known that the current intensity is proportional to the transmission of a single-molecule junction, τ, as represented in the following equation [34,35,36].
τ = 4 Γ L Γ R ε 2 + ( Γ L + Γ R ) 2
Here, ε is the conductive molecular orbital alignment from the gold Fermi level, and ΓL and ΓR denote the coupling values between the molecule and the left and right electrodes, respectively. A change in the molecular orbital position causes a change in the single-molecule conductance. It has been previously reported that the difference in conductivity is closely related to the highest occupied molecular orbital (HOMO) level [15,16,19,26,37,38]. In this study, all of the molecules investigated (cAMP, ATP, ADP, and AMP) have an adenine structure, which is the main local electron density site and related to the HOMO level of the molecule. The electron density of the adenine structure is influenced by the position of the phosphate group, which is an electron-withdrawing functional group. For instance, the phosphate group of cAMP approaches the vicinity of adenine because of two covalent bonds between the phosphate and ribose. Thus, differences in the sizes of the molecules reflect those at each of the HOMO levels, resulting in signal intensity differences.
The electronic structure of molecules, such as HOMO levels, is one of the important parameters for each of the signal characters. In addition to the electronic structure of molecules, it was also found that fluctuations of sample molecules inside electrodes, i.e., structural motion of the DNA and its environment, are closely related to the signal characters [15,16]. In this study, among various potential candidates, we utilized td, Sn (n = 1, 2, …, 10)) for this discrimination as feature parameters for this machine learning because the characteristic fluctuation, which should be influenced by the molecular orientation and interaction inside nanogap, would induce the characteristic signal shape and signal duration time. Among the shape factors of Sn (n = 1, 2, …, 10)), the first shape factor (S1) was the first region of the ten separated regions (Figure 4c), i.e., the initial rising current for each signal. The difference in S1 was due to the initial interaction between the molecule and gold electrodes. The cAMP molecules are smaller than the ATP, AMP, and ADP molecules because the phosphate group of cAMP is covalently bound to the hydroxyl group of ribose molecules. Therefore, there was considered to be a large difference in the initial state of attraction to the electrode. In summary, it was shown that using numerous features with machine learning made it possible to determine the slight differences in the molecules, which assisted in improving the identification accuracy of the single-molecule measurements.

5. Conclusions

A single-molecule quantum measurement method was used for cAMP and succeeded in detecting cAMP for the first time. Furthermore, this study succeeded in identifying signals from ATP, AMP, and ADP, which are molecules similar to cAMP. The single cAMP signal discrimination accuracy values for AMP, ADP, and ATP were found to be 0.82, 0.70, and 0.72, respectively. These values indicated a 99.9% accuracy when detecting more than ten signals. Furthermore, the features that contributed to the identification were identified. As a result that the features were affected by the interaction with the electrodes, the structural difference caused by the ribose of the phosphate site of cAMP was considered to be the reason for the differences compared to the ATP, ADP, and AMP signals. In this study, as a proof-of-concept, we demonstrated cAMP detection and its discrimination from other similar molecules. When multiple gap devices are developed and some kinds of cultured cell samples are mounted on the device, it is possible to visualize the cAMP related intercellular signaling pathway.

Supplementary Materials

The following are available online at https://www.mdpi.com/2079-4991/11/3/784/s1, Figure S1: Current–time profiles for signals obtained cAMP aqueous solution, Figure S2: Distribution of the first component for AMP and cAMP. The first component is 0.29Ip + 0.05Id + 0.09Iave + 0.23S1 + 0.30S2 + 0.31S3 + 0.31S4 + 0.30S5 + 0.30S6 + 0.31S7 + 0.30S8 + 0.29S9 + 0.33S10.

Author Contributions

Conceptualization, Y.K., T.O. and M.T.; methodology, T.O.; software, Y.K.; validation, Y.K., T.O. and M.T.; formal analysis, Y.K.; investigation, T.O. and Y.K.; resources, T.O.; data curation, Y.K.; writing—original draft preparation, T.O.; writing—review and editing, T.O. and Y.K.; visualization, Y.K.; supervision, T.O. and M.T.; project administration, T.O. and M.T.; funding acquisition, M.T., T.O. and Y.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by KAKENHI Grant No. 19H00852, No.18K14091, and JST-CREST Grant Number JPMJCR1666, and the New Energy and Industrial Technology Development Organization (NEDO), Grant project New Industry Creation New Technology leading research program (14004).

Data Availability Statement

The data presented in this study are available on request from the corresponding authors.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the study design; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. Municio, A.M.; Miras-Portugal, M.T. Cell Signal Transduction, Second Messengers, and Protein Phosphorylation in Health and Disease; Springer: Boston, MA, USA, 1994. [Google Scholar]
  2. Pollard, T.D.; Earnshaw, W.C.; Lippincott-Schwartz, J.; Johnson, G. (Eds.) Second Messengers. In Cell Biology, 3rd ed.; Elsevier Inc.: Philadelphia, PA, USA, 2017. [Google Scholar]
  3. Wouters, F.S.; Verveer, P.J.; Bastiaens, P.I.H. Imaging biochemistry inside cells. Trends Cell Biol. 2001, 11, 203–211. [Google Scholar] [CrossRef]
  4. Chou, S.-H.; Guiliani, N.; Lee, V.; Römling, U. Microbial Cyclic Di-Nucleotide Signaling; Springer: Cham, Switzerland, 2020. [Google Scholar]
  5. Bender, A.T.; Beavo, J.A. Cyclic nucleotide phosphodiesterases: Molecular regulation to clinical use. Pharmacol. Rev. 2006, 58, 488–520. [Google Scholar] [CrossRef] [PubMed]
  6. Beavo, J.A.; Brunton, L.L. Cyclic nucleotide research—Still expanding after half a century. Nat. Rev. Mol. Cell Biol. 2002, 3, 710–718. [Google Scholar] [CrossRef]
  7. Greenwald, E.C.; Mehta, S.; Zhang, J. Genetically Encoded Fluorescent Biosensors Illuminate the Spatiotemporal Regulation of Signaling Networks. Chem. Rev. 2018, 118, 11707–11794. [Google Scholar] [CrossRef]
  8. Adams, S.R.; Harrotunian, A.T.R.; Buechlerm, Y.J.; Talyer, S.S.; Tsien, R.Y. Fluorescence ratio imaging of cyclic AMP in single cells. Nature 1991, 349, 694–697. [Google Scholar] [CrossRef]
  9. Zhang, J.; Campbell, R.E.; Ting, A.Y.; Tsien, R.Y. Creating new fluorescent probes for cell biology. Nat. Rev. Mol. Cell Biol. 2002, 3, 906–918. [Google Scholar] [CrossRef] [PubMed]
  10. Zaccolo, M.; De Giorgi, F.; Cho, C.Y.; Feng, L.X.; Knapp, T.; Negulescu, P.A.; Taylor, S.S.; Tsien, R.Y.; Pozzan, T. A genetically encoded, fluorescent indicator for cyclic AMP in living cells. Nat. Cell Biol. 2000, 2, 25–29. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  11. Branton, D.; Deamer, D.W.; Marziali, A.; Bayley, H.; Benner, S.A.; Butler, T.; Di Ventra, M.; Garaj, S.; Hibbs, A.; Huang, X.H. The potential and challenges of nanopore sequencing. Nat. Biotechnol. 2008, 26, 1146–1153. [Google Scholar] [CrossRef]
  12. Shendure, J.; Balasubramanian, S.; Church, G.M.; Gilbert, W.; Rogers, J.; Schloss, J.A.; Waterston, R.H. DNA sequencing at 40: Past, present and future. Nature 2017, 550, 345–353. [Google Scholar] [CrossRef] [PubMed]
  13. Shi, W.Q.; Friedman, A.K.; Baker, L.A. Nanopore Sensing. Anal. Chem. 2017, 89, 157–188. [Google Scholar] [CrossRef] [Green Version]
  14. Tokeshi, M. Applications of Microfluidic Systems in Biology and Medicine; Springer: Singapore, 2019. [Google Scholar]
  15. Zwolak, M.; Di Ventra, M. Electronic signature of DNA nucleotides via transverse transport. Nano Lett. 2005, 5, 421–424. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  16. Lagerqvist, J.; Zwolak, M.; Di Ventra, M. Fast DNA sequencing via transverse electronic transport. Nano Lett. 2006, 6, 779–782. [Google Scholar] [CrossRef] [Green Version]
  17. Di Ventra, M.; Taniguchi, M. Decoding DNA, RNA and peptides with quantum tunnelling. Nat. Nanotechnol. 2016, 11, 117. [Google Scholar] [CrossRef]
  18. Tsutsui, M.; Taniguchi, M.; Yokota, K.; Kawai, T. Identifying single nucleotides by tunnelling current. Nat. Nanotechnol. 2010, 5, 286–290. [Google Scholar] [CrossRef]
  19. Ohshiro, T.; Tsustui, M.; Matsubara, K.; Furuhashi, M.; Taniguchi, M.; Kawai, T. Single-Molecule Electrical Random Resequencing of DNA and RNA. Sci. Rep. 2012, 2, 501–507. [Google Scholar] [CrossRef]
  20. Ohshiro, T.; Tsutsui, M.; Yokota, K.; Taniguchi, M. Quantitative analysis of DNA with single-molecule sequencing. Sci. Rep. 2018, 8, 8517. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  21. Ohshiro, T.; Tsutsui, T.; Yokota, K.; Furuhashi, M.; Taniguchi, M.; Kawai, T. Single-Molecule Proteomic Analysis of Post-translational Modification. Nat. Nanotechnol. 2014, 9, 835–840. [Google Scholar] [CrossRef]
  22. Ohshiro, T.; Komoto, Y.; Konno, M.; Koseki, J.; Asai, A.; Ishii, H.; Taniguchi, M. Direct Analysis of Incorporation of an Anticancer Drug into DNA at Single-Molecule Resolution. Sci. Rep. 2019, 9, 3886. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Tsutsui, M.; Shoji, K.; Taniguchi, M.; Kawai, T. Formation and self-breaking mechanism of stable atom-sized junctions. Nano Lett. 2008, 8, 345–349. [Google Scholar] [CrossRef]
  24. Agrait, N.; Yeyati, A.L.; van Ruitenbeek, J.M. Quantum properties of atomic sized conductors. Phys. Rep. 2003, 377, 81–279. [Google Scholar] [CrossRef] [Green Version]
  25. Michaelson, H.B. The work function of the elements and its periodicity. J. Appl. Phys. 1977, 48, 4729. [Google Scholar] [CrossRef] [Green Version]
  26. Komoto, Y.; Ohshiro, T.; Yoshida, T.; Tarusawa, E.; Yagi, T.; Washio, T.; Taniguchi, M. Time-resolved neurotransmitter detection in mouse brain tissue using an artificial intelligence-nanogap. Sci. Rep. 2020, 10, 11244. [Google Scholar] [CrossRef]
  27. Komoto, Y.; Ohshiro, T.; Taniguchi, M. Detection of alcohol-derived cancer marker by single-molecule quantum sequencing. Chem. Commun. 2020, 56, 14299–14302. [Google Scholar] [CrossRef] [PubMed]
  28. Taniguchi, M.; Ohshiro, T.; Komoto, Y.; Takaai, T.; Yoshida, T.; Washio, T.J. High-precision single-molecule identification based on single-molecule information within a noisy matrix. Phys. Chem. C 2019, 123, 15867. [Google Scholar] [CrossRef]
  29. Komoto, Y.; Ohshiro, T.; Taniguchi, M. Length discrimination of homo-oligomeric nucleic acids with single-molecule measurement. Anal. Sci. 2021, 37, 513–518. [Google Scholar] [CrossRef] [PubMed]
  30. Ohshiro, T.; Komoto, Y.; Taniguchi, M. Single-Molecule Counting of Nucleotide by Electrophoresis with Nanofluid integrated nano-gap devices. Micromachines 2020, 11, 982. [Google Scholar] [CrossRef]
  31. Elkan, C.; Noto, K. Learning classifiers from only positive and unlabeled data. In Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, Las Vegas, NV, USA, 24–27 August 2008; pp. 213–220. [Google Scholar]
  32. Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.J. Scikit-learn: Machine learning in Python. Mach. Learn. Res. 2011, 12, 2825. [Google Scholar]
  33. Landauer, R. Electrical resistance of disordered one-dimensional lattices. Philos. Mag. 1970, 21, 863. [Google Scholar] [CrossRef]
  34. Breiman, L.; Friedman, J.H.; Olshen, R.A.; Stone, C.J. Classification and Regression Trees; Routledge: Boca Raton, FL, USA, 1984. [Google Scholar]
  35. Kim, Y.; Pietsch, T.; Erbe, A.; Belzig, W.; Scheer, E. Benzenedithiol: A Broad-Range Single-Channel Molecular Conductor. Nano Lett. 2011, 11, 3734. [Google Scholar] [CrossRef] [Green Version]
  36. Komoto, Y.; Fujii, S.; Nakamura, H.; Tada, T.; Nishino, T.; Kiguchi, M. Resolving metal-molecule interfaces at single-molecule junctions. Sci. Rep. 2016, 6, 26606. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  37. Frisenda, R.; Perrin, M.L.; Valkenier, H.; Hummelen, J.C.; van der Zant, H.S.J. Statistical analysis of single-molecule breaking traces. Phys. Status Solidi B 2013, 250, 2431. [Google Scholar] [CrossRef]
  38. Furuhata, T.; Ohshiro, T.; Akimoto, G.; Ueki, R.; Taniguchi, M.; Sando, S. Chemical Labeling-Assisted Detection of Nucleobase Modifications by Quantum Tunneling-Based Single Molecule Sensing. ACS Nano 2019, 13, 5028. [Google Scholar] [CrossRef] [PubMed]
Figure 1. (a) Schematic image of single-molecule measurement of nucleotide-type second messenger, e.g., cyclic adenosine monophosphate (cAMP), using nanogap electrodes. (b) Chemical structures of four targeted molecules: cyclic adenosine monophosphate (cAMP), adenosine triphosphate (ATP), adenosine monophosphate (AMP), and adenosine diphosphate (ADP). (c) Device and measurement system. The mechanically controllable break-junction (MCBJ) system was used for the formation of nanogap electrodes.
Figure 1. (a) Schematic image of single-molecule measurement of nucleotide-type second messenger, e.g., cyclic adenosine monophosphate (cAMP), using nanogap electrodes. (b) Chemical structures of four targeted molecules: cyclic adenosine monophosphate (cAMP), adenosine triphosphate (ATP), adenosine monophosphate (AMP), and adenosine diphosphate (ADP). (c) Device and measurement system. The mechanically controllable break-junction (MCBJ) system was used for the formation of nanogap electrodes.
Nanomaterials 11 00784 g001
Figure 2. Current–time (I–t) profiles for signals obtained in water (a), and cAMP aqueous solution (b). (c) Enlargement of signal shown in (b) obtained in cAMP aqueous solution. (d) Comparison of signal frequency values for blank and cAMP, respectively. The signal frequencies are clearly different, with the signal frequency of the signals obtained in the cAMP aqueous solution (right: cAMP) significantly larger than that of the signals obtained in water (left: blank).
Figure 2. Current–time (I–t) profiles for signals obtained in water (a), and cAMP aqueous solution (b). (c) Enlargement of signal shown in (b) obtained in cAMP aqueous solution. (d) Comparison of signal frequency values for blank and cAMP, respectively. The signal frequencies are clearly different, with the signal frequency of the signals obtained in the cAMP aqueous solution (right: cAMP) significantly larger than that of the signals obtained in water (left: blank).
Nanomaterials 11 00784 g002
Figure 3. Statistical analysis results for obtained signals. In these analyses, at least 1000 signals were utilized. The occurrence in the histogram represents normalized signal numbers. (ad) Current histograms at bias voltage of 100 mV for cAMP (a), AMP (b), ADP (c), and ATP (d). (eh) Duration-time histograms of cAMP (e), AMP (f), ADP (g), and ATP (h). (il) Two-dimensional plots of Ip and td for cAMP (i), AMP (j), ADP (k), and ATP (l), and dependency of sample species.
Figure 3. Statistical analysis results for obtained signals. In these analyses, at least 1000 signals were utilized. The occurrence in the histogram represents normalized signal numbers. (ad) Current histograms at bias voltage of 100 mV for cAMP (a), AMP (b), ADP (c), and ATP (d). (eh) Duration-time histograms of cAMP (e), AMP (f), ADP (g), and ATP (h). (il) Two-dimensional plots of Ip and td for cAMP (i), AMP (j), ADP (k), and ATP (l), and dependency of sample species.
Nanomaterials 11 00784 g003
Figure 4. (a) Flow of data acquisition and analysis. (b) Scheme for machine learning analysis. (c) Definitions of 13 features parameter of signals used for machine learning analysis. These 13 feature parameters included the following: the peak value of the current signal (Ip), average signal value (Iave), duration-time (td), and ten-dimensional shape factors (Sn = In/Ip (n = 1, 2, …, 10)). The average current value (blue circle) for each of the time regions (I1, I2, I3, I4, I5, I6, I7, I8, I9, and I10) was calculated. Each of the current values was normalized by the maximum current value of the signal (Ip), and the values in each region were defined as shape factors (S1, S2, S3, S4, S5, S6, S7, S8, S9, and S10). (d) Evaluation results for discrimination accuracies of cAMP, ADP, AMP, and ATP. To calculate the confusion matrix for the signals, at least one thousand signals were obtained from the experiments and all the signals were used for the evaluation. The prediction ratios in the confusion matrix represent the numbers of signals predicted to be the molecules normalized by the total signals of the true molecules. The diagonal terms of the matrix indicate the discrimination accuracies of the molecules. The classifier was trained with 10,000 signals.
Figure 4. (a) Flow of data acquisition and analysis. (b) Scheme for machine learning analysis. (c) Definitions of 13 features parameter of signals used for machine learning analysis. These 13 feature parameters included the following: the peak value of the current signal (Ip), average signal value (Iave), duration-time (td), and ten-dimensional shape factors (Sn = In/Ip (n = 1, 2, …, 10)). The average current value (blue circle) for each of the time regions (I1, I2, I3, I4, I5, I6, I7, I8, I9, and I10) was calculated. Each of the current values was normalized by the maximum current value of the signal (Ip), and the values in each region were defined as shape factors (S1, S2, S3, S4, S5, S6, S7, S8, S9, and S10). (d) Evaluation results for discrimination accuracies of cAMP, ADP, AMP, and ATP. To calculate the confusion matrix for the signals, at least one thousand signals were obtained from the experiments and all the signals were used for the evaluation. The prediction ratios in the confusion matrix represent the numbers of signals predicted to be the molecules normalized by the total signals of the true molecules. The diagonal terms of the matrix indicate the discrimination accuracies of the molecules. The classifier was trained with 10,000 signals.
Nanomaterials 11 00784 g004
Figure 5. (a) Flow of data acquisition and analysis with positive and unlabeled data classification (PUC) method. In the PUC method, the signals obtained from the blank solution (control experiments) were used as positive signals. On the other hand, the signal data of the sample solution were regarded as unlabeled data because these signals were composed of both blank signals and the sample molecule signals. (b) Schematic of machine learning analysis using PUC method. (c) Evaluation results for discrimination accuracies of cAMP, ADP, AMP, and ATP using the PUC method. Compared to the discrimination accuracy value of 0.25 for cAMP without the PUC method, the average discrimination accuracy was also improved to 0.50. The classifier was trained with 1000 signals. (d) Relation between probability of accurate prediction by majority vote and number of signals. The accuracy was set to 0.5 for four classes. (ej) Evaluation of discrimination accuracy of cAMP from other molecules, i.e., AMP, ATP, and ADP. The two-target discrimination accuracy values were found to be 0.70 for AMP/cAMP (e), 0.69 for ADP/cAMP (f), 0.75 for ATP/cAMP (g), 0.69 for AMP/ATP (h), 0.74 for ATP/AMP (i), and 0.69 for ADP/ATP (j).
Figure 5. (a) Flow of data acquisition and analysis with positive and unlabeled data classification (PUC) method. In the PUC method, the signals obtained from the blank solution (control experiments) were used as positive signals. On the other hand, the signal data of the sample solution were regarded as unlabeled data because these signals were composed of both blank signals and the sample molecule signals. (b) Schematic of machine learning analysis using PUC method. (c) Evaluation results for discrimination accuracies of cAMP, ADP, AMP, and ATP using the PUC method. Compared to the discrimination accuracy value of 0.25 for cAMP without the PUC method, the average discrimination accuracy was also improved to 0.50. The classifier was trained with 1000 signals. (d) Relation between probability of accurate prediction by majority vote and number of signals. The accuracy was set to 0.5 for four classes. (ej) Evaluation of discrimination accuracy of cAMP from other molecules, i.e., AMP, ATP, and ADP. The two-target discrimination accuracy values were found to be 0.70 for AMP/cAMP (e), 0.69 for ADP/cAMP (f), 0.75 for ATP/cAMP (g), 0.69 for AMP/ATP (h), 0.74 for ATP/AMP (i), and 0.69 for ADP/ATP (j).
Nanomaterials 11 00784 g005
Figure 6. (a) Comparison of F-measure values for support vector machine (SVM), extreme gradient boosting (XGBoost), and random forest. The hyperparameters of SVM were determined by a grid search to have the best F-measure. All the algorithms succeeded in identifying with an F-measure of 0.42 or higher. Among several classification algorithms, including SVM and XGBoost, the random forest algorithm had the best discrimination accuracy. Its good discrimination accuracy indicated that all the prepared features (Ip, Iave, td, S1, S2, S3, S4, S5, S6, S7, S8, S9, and S10) used here were involved in the discrimination. An evaluation was made of the contributions to the discrimination of the different feature parameters. (b) Comparison of four sets of feature parameters, i.e., the maximum current value (Ip), average current value (Iave), td, and shape factors (Sn). Among the four sets of the parameter combinations, the best combination was found to be maximum current value, average current value, and shape factors (Ip, Iave, td, Sn). This confirmed that all the prepared features used here were important for the discrimination. In addition, among three sets of feature parameters, Ip, td, and the shape factors, the shape factors (Sn) were found to provide the highest discrimination accuracy. This suggested that the shape factors were important. (c) Estimation of the importance of each feature parameter. The random forest classifier could evaluate the importance of each feature, which was an index of how much the discrimination accuracy decreased when each of the feature parameter was omitted. Among these indexes, two feature parameters, i.e., the maximum current value (Ip) and first shape factor (S1), were found to be the most important.
Figure 6. (a) Comparison of F-measure values for support vector machine (SVM), extreme gradient boosting (XGBoost), and random forest. The hyperparameters of SVM were determined by a grid search to have the best F-measure. All the algorithms succeeded in identifying with an F-measure of 0.42 or higher. Among several classification algorithms, including SVM and XGBoost, the random forest algorithm had the best discrimination accuracy. Its good discrimination accuracy indicated that all the prepared features (Ip, Iave, td, S1, S2, S3, S4, S5, S6, S7, S8, S9, and S10) used here were involved in the discrimination. An evaluation was made of the contributions to the discrimination of the different feature parameters. (b) Comparison of four sets of feature parameters, i.e., the maximum current value (Ip), average current value (Iave), td, and shape factors (Sn). Among the four sets of the parameter combinations, the best combination was found to be maximum current value, average current value, and shape factors (Ip, Iave, td, Sn). This confirmed that all the prepared features used here were important for the discrimination. In addition, among three sets of feature parameters, Ip, td, and the shape factors, the shape factors (Sn) were found to provide the highest discrimination accuracy. This suggested that the shape factors were important. (c) Estimation of the importance of each feature parameter. The random forest classifier could evaluate the importance of each feature, which was an index of how much the discrimination accuracy decreased when each of the feature parameter was omitted. Among these indexes, two feature parameters, i.e., the maximum current value (Ip) and first shape factor (S1), were found to be the most important.
Nanomaterials 11 00784 g006
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Komoto, Y.; Ohshiro, T.; Taniguchi, M. Development of Single-Molecule Electrical Identification Method for Cyclic Adenosine Monophosphate Signaling Pathway. Nanomaterials 2021, 11, 784. https://doi.org/10.3390/nano11030784

AMA Style

Komoto Y, Ohshiro T, Taniguchi M. Development of Single-Molecule Electrical Identification Method for Cyclic Adenosine Monophosphate Signaling Pathway. Nanomaterials. 2021; 11(3):784. https://doi.org/10.3390/nano11030784

Chicago/Turabian Style

Komoto, Yuki, Takahito Ohshiro, and Masateru Taniguchi. 2021. "Development of Single-Molecule Electrical Identification Method for Cyclic Adenosine Monophosphate Signaling Pathway" Nanomaterials 11, no. 3: 784. https://doi.org/10.3390/nano11030784

APA Style

Komoto, Y., Ohshiro, T., & Taniguchi, M. (2021). Development of Single-Molecule Electrical Identification Method for Cyclic Adenosine Monophosphate Signaling Pathway. Nanomaterials, 11(3), 784. https://doi.org/10.3390/nano11030784

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop