RESUMO
Distributed nonconvex optimization underpins key functionalities of numerous distributed systems, ranging from power systems, smart buildings, cooperative robots, vehicle networks to sensor networks. Recently, it has also merged as a promising solution to handle the enormous growth in data and model sizes in deep learning. A fundamental problem in distributed nonconvex optimization is avoiding convergence to saddle points, which significantly degrade optimization accuracy. We find that the process of quantization, which is necessary for all digital communications, can be exploited to enable saddle-point avoidance. More specifically, we propose a stochastic quantization scheme and prove that it can effectively escape saddle points and ensure convergence to a second-order stationary point in distributed nonconvex optimization. With an easily adjustable quantization granularity, the approach allows a user to control the number of bits sent per iteration and, hence, to aggressively reduce the communication overhead. Numerical experimental results using distributed optimization and learning problems on benchmark datasets confirm the effectiveness of the approach.
RESUMO
The energy landscape of multiply connected superconducting structures is ruled by fluxoid quantization due to the implied single-valuedness of the complex wave function. The transitions and interaction between these energy states, each defined by a specific phase winding number, are governed by classical and/or quantum phase slips. Understanding these events requires the ability to probe, noninvasively, the state of the ring. Here, we employ a niobium resonator to examine the superconducting properties of an aluminum loop. By applying a magnetic field, adjusting temperature, and altering the loop's dimensions via focused ion beam milling, we correlate resonance frequency shifts with changes in the loop's kinetic inductance. This parameter is an indicator of the superconducting condensate's state, facilitating the detection of phase slips in nanodevices and providing insights into their dynamics. Our method presents a proof-of-principle spectroscopic technique with promising potential for investigating Cooper pair density in inductively coupled superconducting nanostructures.
RESUMO
Conductance quantization of 2D materials is significant for understanding the charge transport at the atomic scale, which provides a platform to manipulate the quantum states, showing promising applications for nanoelectronics and memristors. However, the conventional methods for investigating conductance quantization are only applicable to materials consisting of one element, such as metal and graphene. The experimental observation of conductance quantization in transition metal dichalcogenides (TMDCs) with complex compositions and structures remains a challenge. To address this issue, an approach is proposed to characterize the charge transport across a single atom in TMDCs by integrating in situ synthesized 1T'-WTe2 electrodes with scanning tunneling microscope break junction (STM-BJ) technique. The quantized conductance of 1T'-WTe2 is measured for the first time, and the quantum states can be modulated by stretching speed and solvent. Combined with theoretical calculations, the evolution of quantized and corresponding configurations during the break junction process is demonstrated. This work provides a facile and reliable avenue to characterize and modulate conductance quantization of 2D materials, intensively expanding the research scope of quantum effects in diverse materials.
RESUMO
As a common type of pollutants in industrial wastewater, cationic dyes have attracted great attentions. Using biodegradable N,N-di (carboxymethyl) glutamic acid (GLDA) as ligand and corn stalk (CS) as matrix, a novel and green biomass modified material GLDA-CS was successfully prepared. The multifunctional property of GLDA-CS for removing methylene blue (MB), malachite green (MG) and alkaline red 46 (R-46) from wastewater was evaluated. The dyes were removed by the electrostatic adsorption based on the cationic adsorption properties of GLDA-CS. The removal rates of MB, MG and R-46 can quickly reach 90.4%, 96.8% and 94.8% in short time. especially for MG and R-46 even only 20 min. The adsorption capacities of the dyes still remain more than 86.5% of the initial values after 5 cycles. In a heterogeneous system, the dyes were removed by Fenton-like degradation based on the metal chelating property of GLDA-CS. 100% degradation rates of the dyes can be achieved in 35 min under the acidic region. Even if at pH 7, degradation rates are 44.1%, 47.1% and 56.6% higher than those under the conventional homogeneous system, and the degradation rate remained at 83.7% after 5 cycles. Regardless of the adsorption or degradation, GLDA-CS shows strong anti-anion interference ability. The potential mechanisms of adsorption and degradation for the dyes by GLDA-CS were deduced by quantization calculation. It is concluded that the adsorption removal of the dyes by GLDA-CS follows MG > R-46 > MB, and mainly depends on the electrostatic interaction between -COOH in GLDA-CS and -N- in the dye molecules. Based on the degradation mechanism of Fenton-like reaction, the possible active sites of the dyes attacked by free radicals and their possible degradation intermediates were predicted by the calculations of Fukui function.
RESUMO
Deep-learning models play a significant role in modern software solutions, with the capabilities of handling complex tasks, improving accuracy, automating processes, and adapting to diverse domains, eventually contributing to advancements in various industries. This study provides a comparative study on deep-learning techniques that can also be deployed on resource-constrained edge devices. As a novel contribution, we analyze the performance of seven Convolutional Neural Network models in the context of data augmentation, feature extraction, and model compression using acoustic data. The results show that the best performers can achieve an optimal trade-off between model accuracy and size when compressed with weight and filter pruning followed by 8-bit quantization. In adherence to the study workflow utilizing the forest sound dataset, MobileNet-v3-small and ACDNet achieved accuracies of 87.95% and 85.64%, respectively, while maintaining compact sizes of 243 KB and 484 KB, respectively. Henceforth, this study concludes that CNNs can be optimized and compressed to be deployed in resource-constrained edge devices for classifying forest environment sounds.
RESUMO
To address the issue of suboptimal spectral purity in Direct Digital Frequency Synthesis (DDFS) within resource-constrained environments, this paper proposes an optimized DDFS technique based on cubic Hermite interpolation. Initially, a DDFS hardware architecture is implemented on a Field-Programmable Gate Array (FPGA); subsequently, essential interpolation parameters are extracted by combining the derivative relations of sine and cosine functions with a dual-port Read-Only Memory (ROM) structure using the cubic Hermite interpolation method to reconstruct high-fidelity target waveforms. This approach effectively mitigates spurious issues caused by amplitude quantization during the DDFS digitalization process while reducing data node storage units. Moreover, this paper introduces single-quadrant ROM compression technology to further diminish the required storage space. Experimental results indicate that, compared to traditional DDFS methods, the optimization scheme proposed in this work achieves a ROM resource compression ratio of 1792:1 and a 14-bit output Spurious-Free Dynamic Range (SFDR) of -88.134 dBc, effectively enhancing amplitude quantization precision and significantly lowering spurious levels. This significantly improves amplitude quantization precision and reduces spurious levels. The proposed scheme demonstrates notable advantages in both spectral performance and resource utilization efficiency, making it highly suitable for resource-constrained embedded systems and high-performance applications such as radar and communication systems.
RESUMO
Vector Quantization (VQ) is a technique with a wide range of applications. For example, it can be used for image compression. The codebook design for VQ has great significance in the quality of the quantized signals and can benefit from the use of swarm intelligence. Initialization of the Linde-Buzo-Gray (LBG) algorithm, which is the most popular VQ codebook design algorithm, is a step that directly influences VQ performance, as the convergence speed and codebook quality depend on the initial codebook. A widely used initialization alternative is random initialization, in which the initial set of codevectors is drawn randomly from the training set. Other initialization methods can lead to a better quality of the designed codebooks. The present work evaluates the impacts of initialization strategies on swarm intelligence algorithms for codebook design in terms of the quality of the designed codebooks, assessed by the quality of the reconstructed images, and in terms of the convergence speed, evaluated by the number of iterations. Initialization strategies consist of a combination of codebooks obtained by initialization algorithms from the literature with codebooks composed of vectors randomly selected from the training set. The possibility of combining different initialization techniques provides new perspectives in the search for the quality of the VQ codebooks. Nine initialization strategies are presented, which are compared with random initialization. Initialization strategies are evaluated on the following algorithms for codebook design based on swarm clustering: modified firefly algorithm-Linde-Buzo-Gray (M-FA-LBG), modified particle swarm optimization-Linde-Buzo-Gray (M-PSO-LBG), modified fish school search-Linde-Buzo-Gray (M-FSS-LBG) and their accelerated versions (M-FA-LBGa, M-PSO-LBGa and M-FSS-LBGa) which are obtained by replacing the LBG with the accelerated LBG algorithm. The simulation results point out to the benefits of the proposed initialization strategies. The results show gains up to 4.43 dB in terms of PSNR for image Clock with M-PSO-LBG codebooks of size 512 and codebook design time savings up to 67.05% for image Clock, with M-FF-LBGa codebooks with size N=512, by using initialization strategies in substitution to Random initialization.
RESUMO
In recent years, one-bit quantization has attracted widespread attention in the field of direction-of-arrival (DOA) estimation as a low-cost and low-power solution. Many researchers have proposed various estimation algorithms for one-bit DOA estimation, among which atomic norm minimization algorithms exhibit particularly attractive performance as gridless estimation algorithms. However, current one-bit DOA algorithms with atomic norm minimization typically rely on approximating the trace function, which is not the optimal approximation and introduces errors, along with resolution limitations. To date, there have been few studies on how to enhance resolution under the framework of one-bit DOA estimation. This paper aims to improve the resolution constraints of one-bit DOA estimation. The log-det heuristic is applied to approximate and solve the atomic norm minimization problem. In particular, a reweighted binary atomic norm minimization with noise assumption constraints is proposed to achieve high-resolution and robust one-bit DOA estimation. Finally, the alternating direction method of multipliers algorithm is employed to solve the established optimization problem. Simulations are conducted to demonstrate that the proposed algorithm can effectively enhance the resolution.
RESUMO
Due to the low-complexity implementation, direction-of-arrival (DOA) estimation-based one-bit quantized data are of interest, but also, signal processing struggles to obtain the demanded estimation accuracy. In this study, we injected a number of noise components into the receiving data before the uniform linear array (ULA) composed of one-bit quantizers. Then, based on this designed noise-boosted quantizer unit (NBQU), we propose an efficient one-bit multiple signal classification (MUSIC) method for estimating the DOA. Benefiting from the injected noise, the numerical results show that the proposed NBQU-based MUSIC method outperforms existing one-bit MUSIC methods in terms of estimation accuracy and resolution. Furthermore, with the optimal root mean square (RMS) of the injected noise, the estimation accuracy of the proposed method for estimating DOA can approach that of the MUSIC method based on the complete analog data.
RESUMO
Accurate 3D image recognition, critical for autonomous driving safety, is shifting from the LIDAR-based point cloud to camera-based depth estimation technologies driven by cost considerations and the point cloud's limitations in detecting distant small objects. This research aims to enhance MDE (Monocular Depth Estimation) using a single camera, offering extreme cost-effectiveness in acquiring 3D environmental data. In particular, this paper focuses on novel data augmentation methods designed to enhance the accuracy of MDE. Our research addresses the challenge of limited MDE data quantities by proposing the use of synthetic-based augmentation techniques: Mask, Mask-Scale, and CutFlip. The implementation of these synthetic-based data augmentation strategies has demonstrably enhanced the accuracy of MDE models by 4.0% compared to the original dataset. Furthermore, this study introduces the RMS (Real-time Monocular Depth Estimation configuration considering Resolution, Efficiency, and Latency) algorithm, designed for the optimization of neural networks to augment the performance of contemporary monocular depth estimation technologies through a three-step process. Initially, it selects a model based on minimum latency and REL criteria, followed by refining the model's accuracy using various data augmentation techniques and loss functions. Finally, the refined model is compressed using quantization and pruning techniques to minimize its size for efficient on-device real-time applications. Experimental results from implementing the RMS algorithm indicated that, within the required latency and size constraints, the IEBins model exhibited the most accurate REL (absolute RELative error) performance, achieving a 0.0480 REL. Furthermore, the data augmentation combination of the original dataset with Flip, Mask, and CutFlip, alongside the SigLoss loss function, displayed the best REL performance, with a score of 0.0461. The network compression technique using FP16 was analyzed as the most effective, reducing the model size by 83.4% compared to the original while maintaining the least impact on REL performance and latency. Finally, the performance of the RMS algorithm was validated on the on-device autonomous driving platform, NVIDIA Jetson AGX Orin, through which optimal deployment strategies were derived for various applications and scenarios requiring autonomous driving technologies.
RESUMO
Neural recording systems play a crucial role in comprehending the intricacies of the brain and advancing treatments for neurological disorders. Within these systems, the analog-to-digital converter (ADC) serves as a fundamental component, converting the electrical signals from the brain into digital data that can be further processed and analyzed by computing units. This research introduces a novel nonlinear ADC designed specifically for spike sorting in biomedical applications. Employing MOSFET varactors and voltage-controlled oscillators (VCOs), this ADC exploits the nonlinear capacitance properties of MOSFET varactors, achieving a parabolic quantization function that digitizes the noise with low resolution and the spikes with high resolution, effectively suppressing the background noise present in biomedical signals. This research aims to develop a reconfigurable, nonlinear voltage-controlled oscillator (VCO)-based ADC, specifically designed for implantable neural recording systems used in neuroprosthetics and brain-machine interfaces. The proposed design enhances the signal-to-noise ratio and reduces power consumption, making it more efficient for real-time neural data processing. By improving the performance and energy efficiency of these devices, the research contributes to the development of more reliable medical technologies for monitoring and treating neurological disorders. The quantization step of the ADC spans from 44.8 mV in the low-amplitude range to 1.4 mV in the high-amplitude range. The circuit was designed and simulated utilizing a 180 nm CMOS process; however, no physical prototype has been fabricated at this stage. Post-layout simulations confirm the expected performance. Occupying a silicon area is 0.09 mm2. Operating at a sampling frequency of 16 kS/s and a supply voltage of 1 volt, this ADC consumes 62.4 µW.
Assuntos
Conversão Análogo-Digital , Humanos , Razão Sinal-Ruído , Processamento de Sinais Assistido por Computador , Interfaces Cérebro-Computador , Encéfalo/fisiologia , Neurônios/fisiologia , Desenho de Equipamento , Dinâmica não LinearRESUMO
Image classification is increasingly being utilized on construction sites to automate project monitoring, driven by advancements in reality-capture technologies and artificial intelligence (AI). Deploying real-time applications remains a challenge due to the limited computing resources available on-site, particularly on remote construction sites that have limited telecommunication support or access due to high signal attenuation within a structure. To address this issue, this research proposes an efficient edge-computing-enabled image classification framework for support of real-time construction AI applications. A lightweight binary image classifier was developed using MobileNet transfer learning, followed by a quantization process to reduce model size while maintaining accuracy. A complete edge computing hardware module, including components like Raspberry Pi, Edge TPU, and battery, was assembled, and a multimodal software module (incorporating visual, textual, and audio data) was integrated into the edge computing environment to enable an intelligent image classification system. Two practical case studies involving material classification and safety detection were deployed to demonstrate the effectiveness of the proposed framework. The results demonstrated the developed prototype successfully synchronized multimodal mechanisms and achieved zero latency in differentiating materials and identifying hazardous nails without any internet connectivity. Construction managers can leverage the developed prototype to facilitate centralized management efforts without compromising accuracy or extra investment in computing resources. This research paves the way for edge "intelligence" to be enabled for future construction job sites and promote real-time human-technology interactions without the need for high-speed internet.
RESUMO
Heart Rate Variability (HRV) refers to the capability of the heart rhythm to vary at different times, typically reflecting the regulation of the heart by the autonomic nervous system. In recent years, with advancements in Electrocardiogram (ECG) signal processing technology, HRV features reflect various aspects of cardiac activity, such as variability in heart rate, cardiac health status, and responses. We extracted key features of HRV and used them to develop and evaluate an automatic recognition model for cardiac diseases. Consequently, we proposed the HRV Heart Disease Recognition (HHDR) method, employing the Spectral Magnitude Quantification (SMQ) technique for feature extraction. Firstly, the HRV signals are extracted through electrocardiogram signal processing. Then, by analyzing parts of the HRV signal within various frequency ranges, the SMQ method extracts rich features of partial information. Finally, the Random Forest (RF) classification computational method is employed to classify the extracted information, achieving efficient and accurate cardiac disease recognition. Experimental results indicate that this method surpasses current technologies in recognizing cardiac diseases, with an average accuracy rate of 95.1% for normal/diseased classification, and an average accuracy of 84.8% in classifying five different disease categories. Thus, the proposed HHDR method effectively utilizes the local information of HRV signals for efficient and accurate cardiac disease recognition, providing strong support for cardiac disease research in the medical field.
Assuntos
Algoritmos , Eletrocardiografia , Cardiopatias , Frequência Cardíaca , Processamento de Sinais Assistido por Computador , Humanos , Eletrocardiografia/métodos , Frequência Cardíaca/fisiologia , Cardiopatias/fisiopatologia , Cardiopatias/diagnósticoRESUMO
The accurate detection and quantification of defects is vital for the effectiveness of the eddy current nondestructive testing (ECNDT) of carbon fiber-reinforced plastic (CFRP) materials. This study investigates the identification and measurement of three common CFRP defects-cracks, delamination, and low-velocity impact damage-by employing the You Only Look Once (YOLO) model and an improved Eddy Current YOLO (EDC-YOLO) model. YOLO's limitations in detecting multi-scale features are addressed through the integration of Transformer-based self-attention mechanisms and deformable convolutional sub-modules, with additional global feature extraction via CBAM. By leveraging the Wise-IoU loss function, the model performance is further enhanced, leading to a 4.4% increase in the mAP50 for defect detection. EDC-YOLO proves to be effective for defect identification and quantification in industrial inspections, providing detailed insights, such as the correlation between the impact damage size and energy levels.
RESUMO
The advancement of unmanned aerial vehicles (UAVs) enables early detection of numerous disasters. Efforts have been made to automate the monitoring of data from UAVs, with machine learning methods recently attracting significant interest. These solutions often face challenges with high computational costs and energy usage. Conventionally, data from UAVs are processed using cloud computing, where they are sent to the cloud for analysis. However, this method might not meet the real-time needs of disaster relief scenarios. In contrast, edge computing provides real-time processing at the site but still struggles with computational and energy efficiency issues. To overcome these obstacles and enhance resource utilization, this paper presents a convolutional neural network (CNN) model with an early exit mechanism designed for fire detection in UAVs. This model is implemented using TSMC 40 nm CMOS technology, which aids in hardware acceleration. Notably, the neural network has a modest parameter count of 11.2 k. In the hardware computation part, the CNN circuit completes fire detection in approximately 230,000 cycles. Power-gating techniques are also used to turn off inactive memory, contributing to reduced power consumption. The experimental results show that this neural network reaches a maximum accuracy of 81.49% in the hardware implementation stage. After automatic layout and routing, the CNN hardware accelerator can operate at 300 MHz, consuming 117 mW of power.
RESUMO
The development of devices that exhibit both superconducting and semiconducting properties is an important endeavor for emerging quantum technologies. We investigate superconducting nanowires fabricated on a silicon-on-insulator (SOI) platform. Aluminum from deposited contact electrodes is found to interdiffuse with Si along the entire length of the nanowire, over micrometer length scales and at temperatures well below the Al-Si eutectic. The phase-transformed material is conformal with the predefined device patterns. The superconducting properties of a transformed mesoscopic ring formed on a SOI platform are investigated. Low-temperature magnetoresistance oscillations, quantized in units of the fluxoid, h/2e, are observed.
RESUMO
Disorder is the primary obstacle in the current Majorana nanowire experiments. Reducing disorder or achieving ballistic transport is thus of paramount importance. In clean and ballistic nanowire devices, quantized conductance is expected, with plateau quality serving as a benchmark for disorder assessment. Here, we introduce ballistic PbTe nanowire devices grown by using the selective-area-growth (SAG) technique. Quantized conductance plateaus in units of 2e2/h are observed at zero magnetic field. This observation represents an advancement in diminishing disorder within SAG nanowires as most of the previously studied SAG nanowires (InSb or InAs) have not exhibited zero-field ballistic transport. Notably, the plateau values indicate that the ubiquitous valley degeneracy in PbTe is lifted in nanowire devices. This degeneracy lifting addresses an additional concern in the pursuit of Majorana realization. Moreover, these ballistic PbTe nanowires may enable the search for clean signatures of the spin-orbit helical gap in future devices.
RESUMO
Landau quantization associated with the quantized cyclotron motion of electrons under magnetic field provides the effective way to investigate topologically protected quantum states with entangled degrees of freedom and multiple quantum numbers. Here we report the cascade of Landau quantization in a strained type-II Dirac semimetal NiTe2 with spectroscopic-imaging scanning tunneling microscopy. The uniform-height surfaces exhibit single-sequence Landau levels (LLs) at a magnetic field originating from the quantization of topological surface state (TSS) across the Fermi level. Strikingly, we reveal the multiple sequence of LLs in the strained surface regions where the rotation symmetry is broken. First-principles calculations demonstrate that the multiple LLs attest to the remarkable lifting of the valley degeneracy of TSS by the in-plane uniaxial or shear strains. Our findings pave a pathway to tune multiple degrees of freedom and quantum numbers of TMDs via strain engineering for practical applications such as high-frequency rectifiers, Josephson diode and valleytronics.
RESUMO
Quantization is the process of mapping an input signal from an infinite continuous set to a countable set with a finite number of elements. It is a non-linear irreversible process, which makes the traditional methods of system identification no longer applicable. In this work, we propose a method for parsimonious linear time invariant system identification when only quantized observations, discerned from noisy data, are available. More formally, given a priori information on the system, represented by a compact set containing the poles of the system, and quantized realizations, our algorithm aims at identifying the least order system that is compatible with the available information. The proposed approach takes also into account that the available data can be subject to fragmentation. Our proposed algorithm relies on an ADMM approach to solve a â p , 0 < p < 1 , quasi-norm objective problem. Numerical results highlight the performance of the proposed approach when compared to the â 1 minimization in terms of the sparsity of the induced solution.
RESUMO
Block compressed sensing (BCS) is a promising method for resource-constrained image/video coding applications. However, the quantization of BCS measurements has posed a challenge, leading to significant quantization errors and encoding redundancy. In this paper, we propose a quantization method for BCS measurements using convolutional neural networks (CNN). The quantization process maps measurements to quantized data that follow a uniform distribution based on the measurements' distribution, which aims to maximize the amount of information carried by the quantized data. The dequantization process restores the quantized data to data that conform to the measurements' distribution. The restored data are then modified by the correlation information of the measurements drawn from the quantized data, with the goal of minimizing the quantization errors. The proposed method uses CNNs to construct quantization and dequantization processes, and the networks are trained jointly. The distribution parameters of each block are used as side information, which is quantized with 1 bit by the same method. Extensive experiments on four public datasets showed that, compared with uniform quantization and entropy coding, the proposed method can improve the PSNR by an average of 0.48 dB without using entropy coding when the compression bit rate is 0.1 bpp.