Real-Time Inverse Kinematics for Generating Multi-Constrained Movements of Virtual Human Characters Paper • 2507.00792 • Published Jul 1 • 1
ImaGGen: Zero-Shot Generation of Co-Speech Semantic Gestures Grounded in Language and Image Input Paper • 2510.17617 • Published Oct 20 • 1
Conveying Meaning through Gestures: An Investigation into Semantic Co-Speech Gesture Generation Paper • 2510.17599 • Published Oct 20 • 1
Integrating Representational Gestures into Automatically Generated Embodied Explanations and its Effects on Understanding and Interaction Quality Paper • 2406.12544 • Published Jun 18, 2024
Augmented Co-Speech Gesture Generation: Including Form and Meaning Features to Guide Learning-Based Gesture Synthesis Paper • 2307.09597 • Published Jul 13, 2023
AQ-GT: a Temporally Aligned and Quantized GRU-Transformer for Co-Speech Gesture Synthesis Paper • 2305.01241 • Published May 2, 2023
Addressing Data Scarcity in Multimodal User State Recognition by Combining Semi-Supervised and Supervised Learning Paper • 2202.03775 • Published Feb 8, 2022
UniFusion: Vision-Language Model as Unified Encoder in Image Generation Paper • 2510.12789 • Published Oct 14 • 18
Processing and acquisition traces in visual encoders: What does CLIP know about your camera? Paper • 2508.10637 • Published Aug 14 • 7
How to Train your Text-to-Image Model: Evaluating Design Choices for Synthetic Training Captions Paper • 2506.16679 • Published Jun 20 • 1
Class Attribute Inference Attacks: Inferring Sensitive Class Information by Diffusion-Based Attribute Manipulations Paper • 2303.09289 • Published Mar 16, 2023 • 2
Distilling Adversarial Prompts from Safety Benchmarks: Report for the Adversarial Nibbler Challenge Paper • 2309.11575 • Published Sep 20, 2023
MultiFusion: Fusing Pre-Trained Models for Multi-Lingual, Multi-Modal Image Generation Paper • 2305.15296 • Published May 24, 2023 • 1
Mitigating Inappropriateness in Image Generation: Can there be Value in Reflecting the World's Ugliness? Paper • 2305.18398 • Published May 28, 2023 • 2
Exploiting Cultural Biases via Homoglyphs in Text-to-Image Synthesis Paper • 2209.08891 • Published Sep 19, 2022 • 2
The Stable Artist: Steering Semantics in Diffusion Latent Space Paper • 2212.06013 • Published Dec 12, 2022 • 1
LLavaGuard: VLM-based Safeguards for Vision Dataset Curation and Safety Assessment Paper • 2406.05113 • Published Jun 7, 2024 • 3