Search | VHL Regional Portal

1.

Bayesian Optimization for Cascade-Type Multistage Processes.

Kusakawa, Shunya; Takeno, Shion; Inatsu, Yu; Kutsukake, Kentaro; Iwazaki, Shogo; Nakano, Takashi; Ujihara, Toru; Karasuyama, Masayuki; Takeuchi, Ichiro.

Neural Comput ; 34(12): 2408-2431, 2022 Nov 08.

Article in English | MEDLINE | ID: mdl-36283050

ABSTRACT

Complex processes in science and engineering are often formulated as multistage decision-making problems. In this letter, we consider a cascade process, a type of multistage decision-making process. This is a multistage process in which the output of one stage is used as an input for the subsequent stage. When the cost of each stage is expensive, it is difficult to search for the optimal controllable parameters for each stage exhaustively. To address this problem, we formulate the optimization of the cascade process as an extension of the Bayesian optimization framework and propose two types of acquisition functions based on credible intervals and expected improvement. We investigate the theoretical properties of the proposed acquisition functions and demonstrate their effectiveness through numerical experiments. In addition, we consider suspension setting, an extension in which we are allowed to suspend the cascade process at the middle of the multistage decision-making process that often arises in practical problems. We apply the proposed method in a test problem involving a solar cell simulator, the motivation for this study.

2.

Bayesian Quadrature Optimization for Probability Threshold Robustness Measure.

Iwazaki, Shogo; Inatsu, Yu; Takeuchi, Ichiro.

Neural Comput ; 33(12): 3413-3466, 2021 Nov 12.

Article in English | MEDLINE | ID: mdl-34710906

ABSTRACT

In many product development problems, the performance of the product is governed by two types of parameters: design parameters and environmental parameters. While the former is fully controllable, the latter varies depending on the environment in which the product is used. The challenge of such a problem is to find the design parameter that maximizes the probability that the performance of the product will meet the desired requisite level given the variation of the environmental parameter. In this letter, we formulate this practical problem as active learning (AL) problems and propose efficient algorithms with theoretically guaranteed performance. Our basic idea is to use a gaussian process (GP) model as the surrogate model of the product development process and then to formulate our AL problems as Bayesian quadrature optimization problems for probabilistic threshold robustness (PTR) measure. We derive credible intervals for the PTR measure and propose AL algorithms for the optimization and level set estimation of the PTR measure. We clarify the theoretical properties of the proposed algorithms and demonstrate their efficiency in both synthetic and real-world product development problems.

3.

Author Correction: Exploration of natural red-shifted rhodopsins using a machine learning-based Bayesian experimental design.

Inoue, Keiichi; Karasuyama, Masayuki; Nakamura, Ryoko; Konno, Masae; Yamada, Daichi; Mannen, Kentaro; Nagata, Takashi; Inatsu, Yu; Yawo, Hiromu; Yura, Kei; Béjà, Oded; Kandori, Hideki; Takeuchi, Ichiro.

Commun Biol ; 4(1): 532, 2021 Apr 30.

Article in English | MEDLINE | ID: mdl-33931743

4.

Exploration of natural red-shifted rhodopsins using a machine learning-based Bayesian experimental design.

Inoue, Keiichi; Karasuyama, Masayuki; Nakamura, Ryoko; Konno, Masae; Yamada, Daichi; Mannen, Kentaro; Nagata, Takashi; Inatsu, Yu; Yawo, Hiromu; Yura, Kei; Béjà, Oded; Kandori, Hideki; Takeuchi, Ichiro.

Commun Biol ; 4(1): 362, 2021 03 19.

Article in English | MEDLINE | ID: mdl-33742139

ABSTRACT

Microbial rhodopsins are photoreceptive membrane proteins, which are used as molecular tools in optogenetics. Here, a machine learning (ML)-based experimental design method is introduced for screening rhodopsins that are likely to be red-shifted from representative rhodopsins in the same subfamily. Among 3,022 ion-pumping rhodopsins that were suggested by a protein BLAST search in several protein databases, the ML-based method selected 65 candidate rhodopsins. The wavelengths of 39 of them were able to be experimentally determined by expressing proteins with the Escherichia coli system, and 32 (82%, p = 7.025 × 10-5) actually showed red-shift gains. In addition, four showed red-shift gains >20 nm, and two were found to have desirable ion-transporting properties, indicating that they would be potentially useful in optogenetics. These findings suggest that data-driven ML-based approaches play effective roles in the experimental design of rhodopsin and other photobiological studies. (141/150 words).

Subject(s)

Ion Channels/metabolism , Machine Learning , Optogenetics , Rhodopsins, Microbial/metabolism , Amino Acid Sequence , Bayes Theorem , Color , Databases, Protein , Escherichia coli/genetics , Escherichia coli/metabolism , Hydrogen-Ion Concentration , Ion Channels/genetics , Ion Channels/radiation effects , Light , Proof of Concept Study , Protein Conformation, alpha-Helical , Rhodopsins, Microbial/genetics , Rhodopsins, Microbial/radiation effects , Sequence Analysis, Protein

5.

Active Learning for Level Set Estimation Under Input Uncertainty and Its Extensions.

Inatsu, Yu; Karasuyama, Masayuki; Inoue, Keiichi; Takeuchi, Ichiro.

Neural Comput ; 32(12): 2486-2531, 2020 Dec.

Article in English | MEDLINE | ID: mdl-33080163

ABSTRACT

Testing under what conditions a product satisfies the desired properties is a fundamental problem in manufacturing industry. If the condition and the property are respectively regarded as the input and the output of a black-box function, this task can be interpreted as the problem called level set estimation (LSE): the problem of identifying input regions such that the function value is above (or below) a threshold. Although various methods for LSE problems have been developed, many issues remain to be solved for their practical use. As one of such issues, we consider the case where the input conditions cannot be controlled precisely-LSE problems under input uncertainty. We introduce a basic framework for handling input uncertainty in LSE problems and then propose efficient methods with proper theoretical guarantees. The proposed methods and theories can be generally applied to a variety of challenges related to LSE under input uncertainty such as cost-dependent input uncertainties and unknown input uncertainties. We apply the proposed methods to artificial and real data to demonstrate their applicability and effectiveness.

6.

Active Learning for Enumerating Local Minima Based on Gaussian Process Derivatives.

Inatsu, Yu; Sugita, Daisuke; Toyoura, Kazuaki; Takeuchi, Ichiro.

Neural Comput ; 32(10): 2032-2068, 2020 10.

Article in English | MEDLINE | ID: mdl-32795230

ABSTRACT

We study active learning (AL) based on gaussian processes (GPs) for efficiently enumerating all of the local minimum solutions of a black-box function. This problem is challenging because local solutions are characterized by their zero gradient and positive-definite Hessian properties, but those derivatives cannot be directly observed. We propose a new AL method in which the input points are sequentially selected such that the confidence intervals of the GP derivatives are effectively updated for enumerating local minimum solutions. We theoretically analyze the proposed method and demonstrate its usefulness through numerical experiments.

7.

Active Learning of Bayesian Linear Models with High-Dimensional Binary Features by Parameter Confidence-Region Estimation.

Inatsu, Yu; Karasuyama, Masayuki; Inoue, Keiichi; Kandori, Hideki; Takeuchi, Ichiro.

Neural Comput ; 32(10): 1998-2031, 2020 10.

Article in English | MEDLINE | ID: mdl-32795233

ABSTRACT

In this letter, we study an active learning problem for maximizing an unknown linear function with high-dimensional binary features. This problem is notoriously complex but arises in many important contexts. When the sampling budget, that is, the number of possible function evaluations, is smaller than the number of dimensions, it tends to be impossible to identify all of the optimal binary features. Therefore, in practice, only a small number of such features are considered, with the majority kept fixed at certain default values, which we call the working set heuristic. The main contribution of this letter is to formally study the working set heuristic and present a suite of theoretically robust algorithms for more efficient use of the sampling budget. Technically, we introduce a novel method for estimating the confidence regions of model parameters that is tailored to active learning with high-dimensional binary features. We provide a rigorous theoretical analysis of these algorithms and prove that a commonly used working set heuristic can identify optimal binary features with favorable sample complexity. We explore the performance of the proposed approach through numerical simulations and an application to a functional protein design problem.

Subject(s)

Linear Models , Supervised Machine Learning , Bayes Theorem , Confidence Intervals , Humans

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL