Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 66
Filter
1.
J Acoust Soc Am ; 156(1): 548-559, 2024 Jul 01.
Article in English | MEDLINE | ID: mdl-39024384

ABSTRACT

Conventional near-field acoustic holography based on compressive sensing either does not fully exploit the underlying block-sparse structures of the signal or suffers from a mismatch between the actual and predefined block structure due to the lack of prior information about block partitions, resulting in poor accuracy in sound field reconstruction. In this paper, a pattern-coupled Bayesian compressive sensing method is proposed for sparse reconstruction of sound fields. The proposed method establishes a hierarchical Gaussian-Gamma probability model with a pattern-coupled prior based on the equivalent source method, transforming the sound field reconstruction problem into recovering the sparse coefficient vector of the equivalent source strengths within the compressive sensing framework. A set of hyperparameters is introduced to control the sparsity of each element in the sparse coefficient vector of the equivalent source strengths, where the sparsity of each element is determined by both its own hyperparameters and those of its immediate neighbors. This approach enables the promotion of block sparse solutions and achieves better performance in solving for the sparse coefficient vector of the equivalent source strengths without prior information of block partitions. The effectiveness and superiority of the proposed method in reconstructing sound fields are verified by simulations and experiments.

2.
Article in English | MEDLINE | ID: mdl-39042535

ABSTRACT

Generative Adversarial Networks have achieved significant advancements in generating and editing high-resolution images. However, most methods suffer from either requiring extensive labeled datasets or strong prior knowledge. It is also challenging for them to disentangle correlated attributes with few-shot data. In this paper, we propose FEditNet++, a GAN-based approach to explore latent semantics. It aims to enable attribute editing with limited labeled data and disentangle the correlated attributes. We propose a layer-wise feature contrastive objective, which takes into consideration content consistency and facilitates the invariance of the unrelated attributes before and after editing. Furthermore, we harness the knowledge from the pretrained discriminative model to prevent overfitting. In particular, to solve the entanglement problem between the correlated attributes from data and semantic latent correlation, we extend our model to jointly optimize multiple attributes and propose a novel decoupling loss and cross-assessment loss to disentangle them from both latent and image space. We further propose a novel-attribute disentanglement strategy to enable editing of novel attributes with unknown entanglements. Finally, we extend our model to accurately edit the fine-grained attributes. Qualitative and quantitative assessments demonstrate that our method outperforms state-of-the-art approaches across various datasets, including CelebA-HQ, RaFD, Danbooru2018 and LSUN Church.

3.
bioRxiv ; 2024 Mar 20.
Article in English | MEDLINE | ID: mdl-38562742

ABSTRACT

Antibiotics have dose-dependent effects on exposed bacteria. The medicinal use of antibiotics relies on their growth-inhibitory activities at sufficient concentrations. At subinhibitory concentrations, exposure effects vary widely among different antibiotics and bacteria. Bacillus subtilis responds to bacteriostatic translation inhibitors by mobilizing a population of cells (MOB-Mobilized Bacillus) to spread across a surface. How B. subtilis regulates the antibiotic-induced mobilization is not known. In this study, we used chloramphenicol to identify regulatory functions that B. subtilis requires to coordinate cell mobilization following subinhibitory exposure. We measured changes in gene expression and metabolism and mapped the results to a network of regulatory proteins that direct the mobile response. Our data reveal that several transcriptional regulators coordinately control the reprogramming of metabolism to support mobilization. The network regulates changes in glycolysis, nucleotide metabolism, and amino acid metabolism that are signature features of the mobilized population. Among the hundreds of genes with changing expression, we identified two, pdhA and pucA, where the magnitudes of their changes in expression, and in the abundance of associated metabolites, reveal hallmark metabolic features of the mobilized population. Using reporters of pdhA and pucA expression, we visualized the separation of major branches of metabolism in different regions of the mobilized population. Our results reveal a regulated response to chloramphenicol exposure that enables a population of bacteria in different metabolic states to mount a coordinated mobile response.

4.
Article in English | MEDLINE | ID: mdl-38630565

ABSTRACT

Some robust point cloud registration approaches with controllable pose refinement magnitude, such as ICP and its variants, are commonly used to improve 6D pose estimation accuracy. However, the effectiveness of these methods gradually diminishes with the advancement of deep learning techniques and the enhancement of initial pose accuracy, primarily due to their lack of specific design for pose refinement. In this paper, we propose Point Cloud Completion and Keypoint Refinement with Fusion Data (PCKRF), a new pose refinement pipeline for 6D pose estimation. The pipeline consists of two steps. First, it completes the input point clouds via a novel pose-sensitive point completion network. The network uses both local and global features with pose information during point completion. Then, it registers the completed object point cloud with the corresponding target point cloud by our proposed Color supported Iterative KeyPoint (CIKP) method. The CIKP method introduces color information into registration and registers a point cloud around each keypoint to increase stability. The PCKRF pipeline can be integrated with existing popular 6D pose estimation methods, such as the full flow bidirectional fusion network, to further improve their pose estimation accuracy. Experiments demonstrate that our method exhibits superior stability compared to existing approaches when optimizing initial poses with relatively high precision. Notably, the results indicate that our method effectively complements most existing pose estimation techniques, leading to improved performance in most cases. Furthermore, our method achieves promising results even in challenging scenarios involving textureless and symmetrical objects. Our source code is available at https://github.com/zhanhz/KRF.

5.
J Hazard Mater ; 469: 133907, 2024 May 05.
Article in English | MEDLINE | ID: mdl-38471380

ABSTRACT

Pyrene is a high molecular weight polycyclic aromatic hydrocarbon (HMW-PAHs). It is a ubiquitous, persistent, and carcinogenic environmental contaminant that has raised concern worldwide. This research explored synergistic bacterial communities for efficient pyrene degradation in seven typical Southern China mangroves. The bacterial communities of seven typical mangroves were enriched by pyrene, and enriched bacterial communities showed an excellent pyrene degradation capacity of > 95% (except for HK mangrove and ZJ mangrove). Devosia, Hyphomicrobium, Flavobacterium, Marinobacter, Algoriphahus, and Youhaiella all have significant positive correlations with pyrene (R>0, p < 0.05) by 16SrRNA gene sequencing and metagenomics analysis, indicated that these genera play a vital role in pyrene metabolism. Meanwhile, the functional genes were involved in pyrene degradation that was enriched in the bacterial communities, including the genes of nagAa, ndoR, pcaG, etc. Furthermore, the analyses of functional genes and binning genomes demonstrated that some bacterial communities as a unique teamwork to cooperatively participate in pyrene degradation. Interestingly, the genes related to biogeochemical cycles were enriched, such as narG , soxA, and cyxJ, suggested that bacterial communities were also helpful in maintaining the stability of the ecological environment. In addition, some novel species with pyrene-degradation potential were identified in the pyrene-degrading bacterial communities, which can enrich the resource pool of pyrene-degrading strains. Overall, this study will help develop further research strategies for pollutant removal.


Subject(s)
Microbiota , Polycyclic Aromatic Hydrocarbons , Pyrenes/metabolism , Polycyclic Aromatic Hydrocarbons/analysis , Bacteria/metabolism , Biodegradation, Environmental
6.
J Hazard Mater ; 469: 134036, 2024 May 05.
Article in English | MEDLINE | ID: mdl-38493623

ABSTRACT

1,2,5,6,9,10-Hexabromocyclododecanes (HBCDs) are a sort of persistent organic pollutants (POPs). This research investigated 12 microbial communities enriched from sediments of four mangroves in China to transform HBCDs. Six microbial communities gained high transformation rates (27.5-97.7%) after 12 generations of serial transfer. Bacteria were the main contributors to transform HBCDs rather than fungi. Analyses on the bacterial compositions and binning genomes showed that Alcanivorax (55.246-84.942%) harboring haloalkane dehalogenase genes dadAH and dadBH dominated the microbial communities with high transformation rates. Moreover, expressions of dadAH and dadBH in the microbial communities and Alcanivorax isolate could be induced by HBCDs. Further, it was found that purified proteins DadAH and DadBH showed high conversion rates on HBCDs in 36 h (91.9 ± 7.4 and 101.0 ± 1.8%, respectively). The engineered Escherichia coli BL21 strains harbored two genes could convert 5.7 ± 0.4 and 35.1 ± 0.1% HBCDs, respectively, lower than their cell-free crude extracts (61.2 ± 5.2 and 56.5 ± 8.7%, respectively). The diastereoisomer-specific transforming trend by both microbial communities and enzymes were γ- > α- > ß-HBCD, differed from α- > ß- > Î³-HBCD by the Alcanivorax isolate. The identified transformation products indicated that HBCDs were dehalogenated via HBr elimination (dehydrobromination), hydrolytic and reductive debromination pathways in the enriched cultures. Two enzymes converted HBCDs via hydrolytic debromination. The present research provided theoretical bases for the biotransformation of HBCDs by microbial community and the bioremediation of HBCDs contamination in the environment.


Subject(s)
Flame Retardants , Hydrocarbons, Brominated , Microbiota , Stereoisomerism , Hydrocarbons, Brominated/metabolism , Biotransformation , Bacteria/metabolism
8.
IEEE Trans Vis Comput Graph ; 30(1): 606-616, 2024 Jan.
Article in English | MEDLINE | ID: mdl-37871082

ABSTRACT

As communications are increasingly taking place virtually, the ability to present well online is becoming an indispensable skill. Online speakers are facing unique challenges in engaging with remote audiences. However, there has been a lack of evidence-based analytical systems for people to comprehensively evaluate online speeches and further discover possibilities for improvement. This paper introduces SpeechMirror, a visual analytics system facilitating reflection on a speech based on insights from a collection of online speeches. The system estimates the impact of different speech techniques on effectiveness and applies them to a speech to give users awareness of the performance of speech techniques. A similarity recommendation approach based on speech factors or script content supports guided exploration to expand knowledge of presentation evidence and accelerate the discovery of speech delivery possibilities. SpeechMirror provides intuitive visualizations and interactions for users to understand speech factors. Among them, SpeechTwin, a novel multimodal visual summary of speech, supports rapid understanding of critical speech factors and comparison of different speech samples, and SpeechPlayer augments the speech video by integrating visualization of the speaker's body language with interaction, for focused analysis. The system utilizes visualizations suited to the distinct nature of different speech factors for user comprehension. The proposed system and visualization techniques were evaluated with domain experts and amateurs, demonstrating usability for users with low visualization literacy and its efficacy in assisting users to develop insights for potential improvement.


Subject(s)
Computer Graphics , Speech , Humans , Communication
9.
Article in English | MEDLINE | ID: mdl-38145513

ABSTRACT

As a significant geometric feature of 3D point clouds, sharp features play an important role in shape analysis, 3D reconstruction, registration, localization, etc. Current sharp feature detection methods are still sensitive to the quality of the input point cloud, and the detection performance is affected by random noisy points and non-uniform densities. In this paper, using the prior knowledge of geometric features, we propose a Multi-scale Laplace Network (MSL-Net), a new deep-learning-based method based on an intrinsic neighbor shape descriptor, to detect sharp features from 3D point clouds. Firstly, we establish a discrete intrinsic neighborhood of the point cloud based on the Laplacian graph, which reduces the error of local implicit surface estimation. Then, we design a new intrinsic shape descriptor based on the intrinsic neighborhood, combined with enhanced normal extraction and cosine-based field estimation function. Finally, we present the backbone of MSL-Net based on the intrinsic shape descriptor. Benefiting from the intrinsic neighborhood and shape descriptor, our MSL-Net has simple architecture and is capable of establishing accurate feature prediction that satisfies the manifold distribution while avoiding complex intrinsic metric calculations. Extensive experimental results demonstrate that with the multi-scale structure, MSL-Net has a strong analytical ability for local perturbations of point clouds. Compared with state-of-the-art methods, our MSL-Net is more robust and accurate. The code is publicly available at.

10.
Animals (Basel) ; 13(13)2023 Jul 07.
Article in English | MEDLINE | ID: mdl-37444031

ABSTRACT

We described a new species of genus Pareas from Baise City, Guangxi Zhuang Autonomous Region, China, based on morphological and molecular evidence. Pareas baiseensis sp. nov. is distinguished from its congeners by the combination of (1) Yellowish-brown body colouration; (2) Frontal subhexagonal to diamond-shaped with its lateral sides converging posteriorly; (3) The anterior pair of chin shields is longer than it is broad; (4) Loreal not in contact with the eye, prefrontal in contact with the eye, two or three suboculars; (5) Rows of 15-15-15 dorsal scales, five rows of mid-dorsal scales keeled at the middle of the body, one vertebral scale row enlarged; (6) 187-191 ventrals, 89-97 subcaudals, all divided, cloacal plate single; (7) Two postocular stripes, the nuchal area forming a dark black four-pointed fork collar with the middle tines shorter than the outside tines. The genetic divergence (uncorrected p-distance) between the new species and other representatives of Pareas ranged from 13.9% to 24.4% for Cytochrome b (Cyt b) and 12.1% to 25.5% for NADH dehydrogenase subunit 4 (ND4). Phylogenetic analyses of mitochondrial DNA gene data recovered the new species from being the sister taxon to (P. boulengeri + P. chinensis) from China.

11.
Zootaxa ; 5319(1): 76-90, 2023 Jul 24.
Article in English | MEDLINE | ID: mdl-37518249

ABSTRACT

A new species of the genus Hebius Thompson, 1913 is described from Youjiang District, Baise City, Guangxi Zhuang Autonomous Region, China, based on a single adult female specimen. It can be distinguished from its congeners by the following combination of characters: (1) dorsal scale rows 19-17-17, feebly keeled except the outermost row; (2) tail length comparatively long, TAL/TL ratio 0.30 in females; (3) ventrals 160 (+ 3 preventrals); (4) subcaudals 112; (5) supralabials 9, the fourth to sixth in contact with the eye; (6) infralabials 10, the first 5 touching the first pair of chin shields; (7) preocular 1; (8) postoculars 2; (9) temporals 4, arranged in three rows (1+1+2); (10) maxillary teeth 30, the last 3 enlarged, without diastem; (11) postocular streak presence; (12) background color of dorsal brownish black, a conspicuous, uniform, continuous beige stripe extending from behind the eye to the end of the tail; (13) anterior venter creamish-yellow, gradually fades to the rear, with irregular black blotches in the middle and outer quarter of ventrals, the posterior part almost completely black. The discovery of the new species increases the number of species in the genus Hebius to 51.


Subject(s)
Colubridae , Lizards , Female , Animals , China , Animal Distribution , Tail , Animal Structures , Phylogeny
12.
IEEE Trans Image Process ; 32: 3136-3149, 2023.
Article in English | MEDLINE | ID: mdl-37227918

ABSTRACT

Benefiting from the intuitiveness and naturalness of sketch interaction, sketch-based video retrieval (SBVR) has received considerable attention in the video retrieval research area. However, most existing SBVR research still lacks the capability of accurate video retrieval with fine-grained scene content. To address this problem, in this paper we investigate a new task, which focuses on retrieving the target video by utilizing a fine-grained storyboard sketch depicting the scene layout and major foreground instances' visual characteristics (e.g., appearance, size, pose, etc.) of video; we call such a task "fine-grained scene-level SBVR". The most challenging issue in this task is how to perform scene-level cross-modal alignment between sketch and video. Our solution consists of two parts. First, we construct a scene-level sketch-video dataset called SketchVideo, in which sketch-video pairs are provided and each pair contains a clip-level storyboard sketch and several keyframe sketches (corresponding to video frames). Second, we propose a novel deep learning architecture called Sketch Query Graph Convolutional Network (SQ-GCN). In SQ-GCN, we first adaptively sample the video frames to improve video encoding efficiency, and then construct appearance and category graphs to jointly model visual and semantic alignment between sketch and video. Experiments show that our fine-grained scene-level SBVR framework with SQ-GCN architecture outperforms the state-of-the-art fine-grained retrieval methods. The SketchVideo dataset and SQ-GCN code are available in the project webpage https://iscas-mmsketch.github.io/FG-SL-SBVR/.

13.
Article in English | MEDLINE | ID: mdl-37220037

ABSTRACT

3D dense captioning aims to semantically describe each object detected in a 3D scene, which plays a significant role in 3D scene understanding. Previous works lack a complete definition of 3D spatial relationships and the directly integrate visual and language modalities, thus ignoring the discrepancies between the two modalities. To address these issues, we propose a novel complete 3D relationship extraction modality alignment network, which consists of three steps: 3D object detection, complete 3D relationships extraction, and modality alignment caption. To comprehensively capture the 3D spatial relationship features, we define a complete set of 3D spatial relationships, including the local spatial relationship between objects and the global spatial relationship between each object and the entire scene. To this end, we propose a complete 3D relationships extraction module based on message passing and self-attention to mine multi-scale spatial relationship features and inspect the transformation to obtain features in different views. In addition, we propose the modality alignment caption module to fuse multi-scale relationship features and generate descriptions to bridge the semantic gap from the visual space to the language space with the prior information in the word embedding, and help generate improved descriptions for the 3D scene. Extensive experiments demonstrate that the proposed model outperforms the state-of-the-art methods on the ScanRefer and Nr3D datasets.

14.
Appl Microbiol Biotechnol ; 107(12): 3877-3886, 2023 Jun.
Article in English | MEDLINE | ID: mdl-37195422

ABSTRACT

Complete ammonia oxidizers (Comammox) are of great significance for studying nitrification and expanding the understanding of the nitrogen cycle. Moreover, Comammox bacteria are also crucial in natural and engineered environments due to their role in wastewater treatment and maintaining the flux of greenhouse gases to the atmosphere. However, only few studies are there regarding the Comammox bacteria and their role in ammonia and nitrite oxidation in the environment. This review mainly focuses on summarizing the genomes of Nitrospira in the NCBI database. Ecological distribution of Nitrospira was also reviewed and the influence of environmental parameters on genus Nitrospira in different environments has been summarized. Furthermore, the role of Nitrospira in carbon cycle, nitrogen cycle, and sulfur cycle were discussed, especially the comammox Nitrospira. In addition, the overviews of current research and development regarding comammox Nitrospira, were summarized along with the scope of future research. KEY POINTS: • Most of Comammox Nitrospira are widely distributed in both aquatic and terrestrial ecosystems, but it has been studied less frequently in the extreme environments. • Comammox Nitrospira can be involved in different nitrogen transformation process, but rarely involved in nitrogen fixation. • The stable isotope and transcriptome techniques are important methods to study the metabolic function of comammox Nitrospira.


Subject(s)
Ammonia , Ecosystem , Ammonia/metabolism , Oxidation-Reduction , Bacteria/metabolism , Nitrogen Cycle , Nitrification , Phylogeny , Archaea/metabolism
15.
Article in English | MEDLINE | ID: mdl-37021894

ABSTRACT

For 3D animators, choreography with artificial intelligence has attracted more attention recently. However, most existing deep learning methods mainly rely on music for dance generation and lack sufficient control over generated dance motions. To address this issue, we introduce the idea of keyframe interpolation for music-driven dance generation and present a novel transition generation technique for choreography. Specifically, this technique synthesizes visually diverse and plausible dance motions by using normalizing flows to learn the probability distribution of dance motions conditioned on a piece of music and a sparse set of key poses. Thus, the generated dance motions respect both the input musical beats and the key poses. To achieve a robust transition of varying lengths between the key poses, we introduce a time embedding at each timestep as an additional condition. Extensive experiments show that our model generates more realistic, diverse, and beat-matching dance motions than the compared state-of-the-art methods, both qualitatively and quantitatively. Our experimental results demonstrate the superiority of the keyframe-based control for improving the diversity of the generated dance motions.

16.
Article in English | MEDLINE | ID: mdl-36884369

ABSTRACT

The genus Tamlana from the Bacteroidota currently includes six validated species. Two strains designated PT2-4T and 62-3T were isolated from Sargassum abundant at the Pingtan island coast in the Fujian Province of China. 16S rRNA gene sequence analysis showed that the closest described relative of strains PT2-4T and 62-3T is Tamlana sedimentorum JCM 19808T with 98.40 and 97.98% sequence similarity, respectively. The 16S rRNA gene sequence similarity between strain PT2-4T and strain 62-3T was 98.68 %. Furthermore, the highest average nucleotide identity values were 87.34 and 88.97 % for strains PT2-4T and 62-3T, respectively. The highest DNA-DNA hybridization (DDH) value of strain PT2-4T was 35.2 % with strain 62-3T, while the DDH value of strain 62-3T was 37.7 % with T. sedimentorum JCM 19808T. Growth of strains PT2-4T and 62-3T occurs at 15-40 °C (optimum, 30 °C) with 0-4 % (w/v) NaCl (optimum 0-1 %). Strains PT2-4T and 62-3T can grow from pH 5.0 to 10.0 (optimum, pH 7.0). The major fatty acids of strains PT2-4T and 62-3T are iso-C15 : 0 and iso G-C15 : 1. MK-6 is the sole respiratory quinone. Genomic and physiological analyses of strains PT2-4T and 62-3T showed corresponding adaptive features. Significant adaptation to the growth environment of macroalgae includes the degradation of brown algae-derived diverse polysaccharides (alginate, laminarin and fucoidan). Notably, strain PT2-4T can utilize laminarin, fucoidan and alginate via specific carbohydrate-active enzymes encoded in polysaccharide utilization loci, rarely described for the genus Tamlana to date. Based on their distinct physiological characteristics and the traits of utilizing polysaccharides from Sargassum, strains PT2-4T and 62-3T are suggested to be classified into two novel species, Tamlana laminarinivorans sp. nov. and Tamlana sargassicola sp. nov. (type strain PT2-4T=MCCC 1K04427T=KCTC 92183T and type strain 62-3T=MCCC 1K04421T=KCTC 92182T).


Subject(s)
Fatty Acids , Sargassum , Fatty Acids/chemistry , Seawater , RNA, Ribosomal, 16S/genetics , Sequence Analysis, DNA , Phylogeny , Bacterial Typing Techniques , Base Composition , DNA, Bacterial/genetics , Genomics , Adaptation, Physiological
17.
Chemosphere ; 325: 138412, 2023 Jun.
Article in English | MEDLINE | ID: mdl-36925001

ABSTRACT

The adaptation of microbial community to the long-term contamination of hexabromocyclododecanes (HBCDs) has not been well studied. Our previous study found that the HBCDs contamination in the microcosms constructed of sediments from two different mangrove forests in 8 months resulted in serious acidification (pH2-3). This study reanalyzed previous sequencing data and compared them with data after 20 months to investigate the adaptive properties of microbial communities in the stress of HBCDs and acidification. It hypothesized that the reassembly was based on the fitness of taxa. The results indicated that eukaryotes and fungi might have better adaptive capacity to these deteriorated habitats. Eukaryotic taxa Eufallia and Syncystis, and fungal taxa Wickerhamomyces were only detected after 20 months of contamination. Moreover, eukaryotic taxa Caloneis and Nitzschia, and fungal taxa Talaromyces were dominant in most of microbial communities (14.467-95.941%). The functional compositions were sediment-dependent and more divergent than community reassemblies. Network and co-occurrence analysis suggested that acidophiles such as Acidisoma and Acidiphilium were gaining more positive relations in the long-term stress. The acidophilic taxa and genes involved in resistance to the acidification and toxicity of HBCDs were enriched, for example, bacteria Acidisoma and Acidiphilium, archaea Thermogymnomonas, and eukaryotes Nitzschia, and genes kdpC, odc1, polA, gst, and sod-2. These genes involved in oxidative stress response, energy metabolism, DNA damage repair, potassium transportation, and decarboxylation. It suggested that the microbial communities might cope with the stress from HBCDs and acidification via multiple pathways. The present research shed light on the evolution of microbial communities under the long-term stress of HBCDs contamination and acidification.


Subject(s)
Hydrocarbons, Brominated , Microbiota , Hydrocarbons, Brominated/analysis , Eukaryota/metabolism , Archaea/genetics , Archaea/metabolism
18.
Sci Rep ; 13(1): 2995, 2023 02 21.
Article in English | MEDLINE | ID: mdl-36810767

ABSTRACT

Positive human-agent relationships can effectively improve human experience and performance in human-machine systems or environments. The characteristics of agents that enhance this relationship have garnered attention in human-agent or human-robot interactions. In this study, based on the rule of the persona effect, we study the effect of an agent's social cues on human-agent relationships and human performance. We constructed a tedious task in an immersive virtual environment, designing virtual partners with varying levels of human likeness and responsiveness. Human likeness encompassed appearance, sound, and behavior, while responsiveness referred to the way agents responded to humans. Based on the constructed environment, we present two studies to explore the effects of an agent's human likeness and responsiveness to agents on participants' performance and perception of human-agent relationships during the task. The results indicate that when participants work with an agent, its responsiveness attracts attention and induces positive feelings. Agents with responsiveness and appropriate social response strategies have a significant positive effect on human-agent relationships. These results shed some light on how to design virtual agents to improve user experience and performance in human-agent interactions.


Subject(s)
Attention , Emotions , Humans , Man-Machine Systems
19.
IEEE Trans Vis Comput Graph ; 29(4): 2203-2210, 2023 Apr.
Article in English | MEDLINE | ID: mdl-34752397

ABSTRACT

Caricature is a type of artistic style of human faces that attracts considerable attention in the entertainment industry. So far a few 3D caricature generation methods exist and all of them require some caricature information (e.g., a caricature sketch or 2D caricature) as input. This kind of input, however, is difficult to provide by non-professional users. In this paper, we propose an end-to-end deep neural network model that generates high-quality 3D caricatures directly from a normal 2D face photo. The most challenging issue for our system is that the source domain of face photos (characterized by normal 2D faces) is significantly different from the target domain of 3D caricatures (characterized by 3D exaggerated face shapes and textures). To address this challenge, we: (1) build a large dataset of 5,343 3D caricature meshes and use it to establish a PCA model in the 3D caricature shape space; (2) reconstruct a normal full 3D head from the input face photo and use its PCA representation in the 3D caricature shape space to establish correspondences between the input photo and 3D caricature shape; and (3) propose a novel character loss and a novel caricature loss based on previous psychological studies on caricatures. Experiments including a novel two-level user study show that our system can generate high-quality 3D caricatures directly from normal face photos.

20.
IEEE Trans Vis Comput Graph ; 29(3): 1785-1798, 2023 Mar.
Article in English | MEDLINE | ID: mdl-34851826

ABSTRACT

3D reconstruction from single-view images is a long-standing research problem. There have been various methods based on point clouds and volumetric representations. In spite of success in 3D models generation, it is quite challenging for these approaches to deal with models with complex topology and fine geometric details. Thanks to the recent advance of deep shape representations, learning the structure and detail representation using deep neural networks is a promising direction. In this article, we propose a novel approach named STD-Net to reconstruct 3D models utilizing mesh representation that is well suited for characterizing complex structures and geometry details. Our method consists of (1) an auto-encoder network for recovering the structure of an object with bounding box representation from a single-view image; (2) a topology-adaptive GCN for updating vertex position for meshes of complex topology; and (3) a unified mesh deformation block that deforms the structural boxes into structure-aware meshes. Evaluation on ShapeNet and PartNet shows that STD-Net has better performance than state-of-the-art methods in reconstructing complex structures and fine geometric details.

SELECTION OF CITATIONS
SEARCH DETAIL