Búsqueda | BVS Bolivia

1.

Nanowatt all-optical 3D perception for mobile robotics.

Yan, Tao; Zhou, Tiankuang; Guo, Yanchen; Zhao, Yun; Shao, Guocheng; Wu, Jiamin; Huang, Ruqi; Dai, Qionghai; Fang, Lu.

Sci Adv ; 10(27): eadn2031, 2024 Jul 05.

Artículo en Inglés | MEDLINE | ID: mdl-38968351

RESUMEN

Three-dimensional (3D) perception is vital to drive mobile robotics' progress toward intelligence. However, state-of-the-art 3D perception solutions require complicated postprocessing or point-by-point scanning, suffering computational burden, latency of tens of milliseconds, and additional power consumption. Here, we propose a parallel all-optical computational chipset 3D perception architecture (Aop3D) with nanowatt power and light speed. The 3D perception is executed during the light propagation over the passive chipset, and the captured light intensity distribution provides a direct reflection of the depth map, eliminating the need for extensive postprocessing. The prototype system of Aop3D is tested in various scenarios and deployed to a mobile robot, demonstrating unprecedented performance in distance detection and obstacle avoidance. Moreover, Aop3D works at a frame rate of 600 hertz and a power consumption of 33.3 nanowatts per meta-pixel experimentally. Our work is promising toward next-generation direct 3D perception techniques with light speed and high energy efficiency.

2.

Large-scale photonic chiplet Taichi empowers 160-TOPS/W artificial general intelligence.

Xu, Zhihao; Zhou, Tiankuang; Ma, Muzhou; Deng, ChenChen; Dai, Qionghai; Fang, Lu.

Science ; 384(6692): 202-209, 2024 Apr 12.

Artículo en Inglés | MEDLINE | ID: mdl-38603505

RESUMEN

The pursuit of artificial general intelligence (AGI) continuously demands higher computing performance. Despite the superior processing speed and efficiency of integrated photonic circuits, their capacity and scalability are restricted by unavoidable errors, such that only simple tasks and shallow models are realized. To support modern AGIs, we designed Taichi-large-scale photonic chiplets based on an integrated diffractive-interference hybrid design and a general distributed computing architecture that has millions-of-neurons capability with 160-tera-operations per second per watt (TOPS/W) energy efficiency. Taichi experimentally achieved on-chip 1000-category-level classification (testing at 91.89% accuracy in the 1623-category Omniglot dataset) and high-fidelity artificial intelligence-generated content with up to two orders of magnitude of improvement in efficiency. Taichi paves the way for large-scale photonic computing and advanced tasks, further exploiting the flexibility and potential of photonics for modern AGI.

3.

Photonic neuromorphic architecture for tens-of-task lifelong learning.

Cheng, Yuan; Zhang, Jianing; Zhou, Tiankuang; Wang, Yuyan; Xu, Zhihao; Yuan, Xiaoyun; Fang, Lu.

Light Sci Appl ; 13(1): 56, 2024 Feb 26.

Artículo en Inglés | MEDLINE | ID: mdl-38403652

RESUMEN

Scalable, high-capacity, and low-power computing architecture is the primary assurance for increasingly manifold and large-scale machine learning tasks. Traditional electronic artificial agents by conventional power-hungry processors have faced the issues of energy and scaling walls, hindering them from the sustainable performance improvement and iterative multi-task learning. Referring to another modality of light, photonic computing has been progressively applied in high-efficient neuromorphic systems. Here, we innovate a reconfigurable lifelong-learning optical neural network (L2ONN), for highly-integrated tens-of-task machine intelligence with elaborated algorithm-hardware co-design. Benefiting from the inherent sparsity and parallelism in massive photonic connections, L2ONN learns each single task by adaptively activating sparse photonic neuron connections in the coherent light field, while incrementally acquiring expertise on various tasks by gradually enlarging the activation. The multi-task optical features are parallelly processed by multi-spectrum representations allocated with different wavelengths. Extensive evaluations on free-space and on-chip architectures confirm that for the first time, L2ONN avoided the catastrophic forgetting issue of photonic computing, owning versatile skills on challenging tens-of-tasks (vision classification, voice recognition, medical diagnosis, etc.) with a single model. Particularly, L2ONN achieves more than an order of magnitude higher efficiency than the representative electronic artificial neural networks, and 14× larger capacity than existing optical neural networks while maintaining competitive performance on each individual task. The proposed photonic neuromorphic architecture points out a new form of lifelong learning scheme, permitting terminal/edge AI systems with light-speed efficiency and unprecedented scalability.

4.

Training large-scale optoelectronic neural networks with dual-neuron optical-artificial learning.

Yuan, Xiaoyun; Wang, Yong; Xu, Zhihao; Zhou, Tiankuang; Fang, Lu.

Nat Commun ; 14(1): 7110, 2023 Nov 04.

Artículo en Inglés | MEDLINE | ID: mdl-37925451

RESUMEN

Optoelectronic neural networks (ONN) are a promising avenue in AI computing due to their potential for parallelization, power efficiency, and speed. Diffractive neural networks, which process information by propagating encoded light through trained optical elements, have garnered interest. However, training large-scale diffractive networks faces challenges due to the computational and memory costs of optical diffraction modeling. Here, we present DANTE, a dual-neuron optical-artificial learning architecture. Optical neurons model the optical diffraction, while artificial neurons approximate the intensive optical-diffraction computations with lightweight functions. DANTE also improves convergence by employing iterative global artificial-learning steps and local optical-learning steps. In simulation experiments, DANTE successfully trains large-scale ONNs with 150 million neurons on ImageNet, previously unattainable, and accelerates training speeds significantly on the CIFAR-10 benchmark compared to single-neuron learning. In physical experiments, we develop a two-layer ONN system based on DANTE, which can effectively extract features to improve the classification of natural images.

5.

All-analog photoelectronic chip for high-speed vision tasks.

Chen, Yitong; Nazhamaiti, Maimaiti; Xu, Han; Meng, Yao; Zhou, Tiankuang; Li, Guangpu; Fan, Jingtao; Wei, Qi; Wu, Jiamin; Qiao, Fei; Fang, Lu; Dai, Qionghai.

Nature ; 623(7985): 48-57, 2023 Nov.

Artículo en Inglés | MEDLINE | ID: mdl-37880362

RESUMEN

Photonic computing enables faster and more energy-efficient processing of vision data1-5. However, experimental superiority of deployable systems remains a challenge because of complicated optical nonlinearities, considerable power consumption of analog-to-digital converters (ADCs) for downstream digital processing and vulnerability to noises and system errors1,6-8. Here we propose an all-analog chip combining electronic and light computing (ACCEL). It has a systemic energy efficiency of 74.8 peta-operations per second per watt and a computing speed of 4.6 peta-operations per second (more than 99% implemented by optics), corresponding to more than three and one order of magnitude higher than state-of-the-art computing processors, respectively. After applying diffractive optical computing as an optical encoder for feature extraction, the light-induced photocurrents are directly used for further calculation in an integrated analog computing chip without the requirement of analog-to-digital converters, leading to a low computing latency of 72 ns for each frame. With joint optimizations of optoelectronic computing and adaptive training, ACCEL achieves competitive classification accuracies of 85.5%, 82.0% and 92.6%, respectively, for Fashion-MNIST, 3-class ImageNet classification and time-lapse video recognition task experimentally, while showing superior system robustness in low-light conditions (0.14 fJ µm-2 each frame). ACCEL can be used across a broad range of applications such as wearable devices, autonomous driving and industrial inspections.

6.

Ultrafast dynamic machine vision with spatiotemporal photonic computing.

Zhou, Tiankuang; Wu, Wei; Zhang, Jinzhi; Yu, Shaoliang; Fang, Lu.

Sci Adv ; 9(23): eadg4391, 2023 Jun 09.

Artículo en Inglés | MEDLINE | ID: mdl-37285419

RESUMEN

Ultrafast dynamic machine vision in the optical domain can provide unprecedented perspectives for high-performance computing. However, owing to the limited degrees of freedom, existing photonic computing approaches rely on the memory's slow read/write operations to implement dynamic processing. Here, we propose a spatiotemporal photonic computing architecture to match the highly parallel spatial computing with high-speed temporal computing and achieve a three-dimensional spatiotemporal plane. A unified training framework is devised to optimize the physical system and the network model. The photonic processing speed of the benchmark video dataset is increased by 40-fold on a space-multiplexed system with 35-fold fewer parameters. A wavelength-multiplexed system realizes all-optical nonlinear computing of dynamic light field with a frame time of 3.57 nanoseconds. The proposed architecture paves the way for ultrafast advanced machine vision free from the limits of memory wall and will find applications in unmanned systems, autonomous driving, ultrafast science, etc.

7.

Photonic unsupervised learning variational autoencoder for high-throughput and low-latency image transmission.

Chen, Yitong; Zhou, Tiankuang; Wu, Jiamin; Qiao, Hui; Lin, Xing; Fang, Lu; Dai, Qionghai.

Sci Adv ; 9(7): eadf8437, 2023 Feb 15.

Artículo en Inglés | MEDLINE | ID: mdl-36791196

RESUMEN

Following the explosive growth of global data, there is an ever-increasing demand for high-throughput processing in image transmission systems. However, existing methods mainly rely on electronic circuits, which severely limits the transmission throughput. Here, we propose an end-to-end all-optical variational autoencoder, named photonic encoder-decoder (PED), which maps the physical system of image transmission into an optical generative neural network. By modeling the transmission noises as the variation in optical latent space, the PED establishes a large-scale high-throughput unsupervised optical computing framework that integrates main computations in image transmission, including compression, encryption, and error correction to the optical domain. It reduces the system latency of computation by more than four orders of magnitude compared with the state-of-the-art devices and transmission error ratio by 57% than on-off keying. Our work points to the direction for a wide range of artificial intelligence-based physical system designs and next-generation communications.

8.

A multichannel optical computing architecture for advanced machine vision.

Xu, Zhihao; Yuan, Xiaoyun; Zhou, Tiankuang; Fang, Lu.

Light Sci Appl ; 11(1): 255, 2022 Aug 18.

Artículo en Inglés | MEDLINE | ID: mdl-35977940

RESUMEN

Endowed with the superior computing speed and energy efficiency, optical neural networks (ONNs) have attracted ever-growing attention in recent years. Existing optical computing architectures are mainly single-channel due to the lack of advanced optical connection and interaction operators, solving simple tasks such as hand-written digit classification, saliency detection, etc. The limited computing capacity and scalability of single-channel ONNs restrict the optical implementation of advanced machine vision. Herein, we develop Monet: a multichannel optical neural network architecture for a universal multiple-input multiple-channel optical computing based on a novel projection-interference-prediction framework where the inter- and intra- channel connections are mapped to optical interference and diffraction. In our Monet, optical interference patterns are generated by projecting and interfering the multichannel inputs in a shared domain. These patterns encoding the correspondences together with feature embeddings are iteratively produced through the projection-interference process to predict the final output optically. For the first time, Monet validates that multichannel processing properties can be optically implemented with high-efficiency, enabling real-world intelligent multichannel-processing tasks solved via optical computing, including 3D/motion detections. Extensive experiments on different scenarios demonstrate the effectiveness of Monet in handling advanced machine vision tasks with comparative accuracy as the electronic counterparts yet achieving a ten-fold improvement in computing efficiency. For intelligent computing, the trends of dealing with real-world advanced tasks are irreversible. Breaking the capacity and scalability limitations of single-channel ONN and further exploring the multichannel processing potential of wave optics, we anticipate that the proposed technique will accelerate the development of more powerful optical AI as critical support for modern advanced machine vision.

9.

HiLo Based Line Scanning Temporal Focusing Microscopy for High-Speed, Deep Tissue Imaging.

Shi, Ruheng; Zhang, Yuanlong; Zhou, Tiankuang; Kong, Lingjie.

Membranes (Basel) ; 11(8)2021 Aug 17.

Artículo en Inglés | MEDLINE | ID: mdl-34436397

RESUMEN

High-speed, optical-sectioning imaging is highly desired in biomedical studies, as most bio-structures and bio-dynamics are in three-dimensions. Compared to point-scanning techniques, line scanning temporal focusing microscopy (LSTFM) is a promising method that can achieve high temporal resolution while maintaining a deep penetration depth. However, the contrast and axial confinement would still be deteriorated in scattering tissue imaging. Here, we propose a HiLo-based LSTFM, utilizing structured illumination to inhibit the fluorescence background and, thus, enhance the image contrast and axial confinement in deep imaging. We demonstrate the superiority of our method by performing volumetric imaging of neurons and dynamical imaging of microglia in mouse brains in vivo.

10.

Sequential Gating Ensemble Network for Noise Robust Multiscale Face Restoration.

Chen, Zhibo; Lin, Jianxin; Zhou, Tiankuang; Wu, Feng.

IEEE Trans Cybern ; 51(1): 451-461, 2021 Jan.

Artículo en Inglés | MEDLINE | ID: mdl-30668493

RESUMEN

Face restoration from low resolution and noise is important for applications of face analysis recognition. However, most existing face restoration models omit the multiple scale issues in the face restoration problem, which is still not well solved in the research area. In this paper, we propose a sequential gating ensemble network (SGEN) for a multiscale noise robust face restoration issue. To endow the network with multiscale representation ability, we first employ the principle of ensemble learning for SGEN network architecture design. The SGEN aggregates multilevel base-encoders and base-decoders into the network, which enables the network to contain multiple scales of receptive field. Instead of combining these base-en/decoders directly with nonsequential operations, the SGEN takes base-en/decoders from different levels as sequential data. Specifically, it is visualized that SGEN learns to sequentially extract high-level information from base-encoders in a bottom-up manner and restore low-level information from base-decoders in a top-down manner. Besides, we propose realizing bottom-up and top-down information combination and selection with a sequential gating unit (SGU). The SGU sequentially takes information from two different levels as inputs and decides the output based on one active input. Experimental results on the benchmark dataset demonstrate that our SGEN is more effective at multiscale human face restoration with more image details and less noise than state-of-the-art image restoration models. Further utilizing an adversarial training scheme, SGEN also produces more visually preferred results than other models under subjective evaluation.

11.

Conformal convolutional neural network (CCNN) for single-shot sensorless wavefront sensing.

Zhang, Yuanlong; Zhou, Tiankuang; Fang, Lu; Kong, Lingjie; Xie, Hao; Dai, Qionghai.

Opt Express ; 28(13): 19218-19228, 2020 Jun 22.

Artículo en Inglés | MEDLINE | ID: mdl-32672203

RESUMEN

Wavefront sensing technique is essential in deep tissue imaging, which guides spatial light modulator to compensate wavefront distortion for better imaging quality. Recently, convolutional neural network (CNN) based sensorless wavefront sensing methods have achieved remarkable speed advantages via single-shot measurement methodology. However, the low efficiency of convolutional filters dealing with circular point-spread-function (PSF) features makes them less accurate. In this paper, we propose a conformal convolutional neural network (CCNN) that boosts the performance by pre-processing circular features into rectangular ones through conformal mapping. The proposed conformal mapping reduces the number of convolutional filters that need to describe a circular feature, thus enables the neural network to recognize PSF features more efficiently. We demonstrate our CCNN could improve the wavefront sensing accuracy over 15% compared to a traditional CNN through simulations and validate the accuracy improvement in experiments. The improved performances make the proposed method promising in high-speed deep tissue imaging.

12.

Overcoming tissue scattering in wide-field two-photon imaging by extended detection and computational reconstruction.

Zhang, Yuanlong; Zhou, Tiankuang; Hu, Xuemei; Li, Xinyang; Xie, Hao; Fang, Lu; Kong, Lingjie; Dai, Qionghai.

Opt Express ; 27(15): 20117-20132, 2019 Jul 22.

Artículo en Inglés | MEDLINE | ID: mdl-31510112

RESUMEN

Compared to point-scanning multiphoton microscopy, line-scanning temporal focusing microscopy (LTFM) is competitive in high imaging speed while maintaining tight axial confinement. However, considering its wide-field detection mode, LTFM suffers from shallow penetration depth as a result of the crosstalk induced by tissue scattering. In contrast to the spatial filtering based on confocal slit detection, here we propose the extended detection LTFM (ED-LTFM), the first wide-field two-photon imaging technique to extract signals from scattered photons and thus effectively extend the imaging depth. By recording a succession of line-shape excited signals in 2D and reconstructing signals under Hessian regularization, we can push the depth limitation of wide-field imaging in scattering tissues. We validate the concept with numerical simulations, and demonstrate the performance of enhanced imaging depth in in vivo imaging of mouse brains.

13.

Fourier-space Diffractive Deep Neural Network.

Yan, Tao; Wu, Jiamin; Zhou, Tiankuang; Xie, Hao; Xu, Feng; Fan, Jingtao; Fang, Lu; Lin, Xing; Dai, Qionghai.

Phys Rev Lett ; 123(2): 023901, 2019 Jul 12.

Artículo en Inglés | MEDLINE | ID: mdl-31386516

RESUMEN

In this Letter we propose the Fourier-space diffractive deep neural network (F-D^{2}NN) for all-optical image processing that performs advanced computer vision tasks at the speed of light. The F-D^{2}NN is achieved by placing the extremely compact diffractive modulation layers at the Fourier plane or both Fourier and imaging planes of an optical system, where the optical nonlinearity is introduced from ferroelectric thin films. We demonstrated that F-D^{2}NN can be trained with deep learning algorithms for all-optical saliency detection and high-accuracy object classification.

RESUMEN

RESUMEN

RESUMEN

RESUMEN

RESUMEN

RESUMEN

RESUMEN

RESUMEN

RESUMEN

RESUMEN

RESUMEN

RESUMEN

RESUMEN

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA