Johns Hopkins University

university

Verified

https://www.jhu.edu/

JohnsHopkins

Activity Feed Request to join this org

AI & ML interests

AI, Machine Learning, Interdisciplinary Collaboration

Recent Activity

toshi2k2 authored a paper 11 days ago

Name That Part: 3D Part Segmentation and Naming

toshi2k2 authored a paper 20 days ago

The Universal Weight Subspace Hypothesis

toshi2k2 authored a paper about 1 month ago

Progressive Prompt Detailing for Improved Alignment in Text-to-Image Generative Models

View all activity

Papers

Perceptual Taxonomy: Evaluating and Guiding Hierarchical Scene Reasoning in Vision-Language Models

View all Papers

RyanWW

authored 2 papers about 1 month ago

Captain Safari: A World Engine

Paper • 2511.22815 • Published Nov 28, 2025 • 9

Perceptual Taxonomy: Evaluating and Guiding Hierarchical Scene Reasoning in Vision-Language Models

Paper • 2511.19526 • Published Nov 24, 2025 • 1

RyanWW

authored 3 papers 2 months ago

Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering

Paper • 2406.00622 • Published Jun 2, 2024

3D-Aware Visual Question Answering about Parts, Poses and Occlusions

Paper • 2310.17914 • Published Oct 27, 2023

Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning

Paper • 2212.00259 • Published Dec 1, 2022

RyanWW

authored 4 papers 3 months ago

PulseCheck457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Multimodal Models

Paper • 2502.08636 • Published Feb 12, 2025

SpatialReasoner: Towards Explicit and Generalizable 3D Spatial Reasoning

Paper • 2504.20024 • Published Apr 28, 2025

XModBench: Benchmarking Cross-Modal Capabilities and Consistency in Omni-Language Models

Paper • 2510.15148 • Published Oct 16, 2025 • 2

KeyVID: Keyframe-Aware Video Diffusion for Audio-Synchronized Visual Animation

Paper • 2504.09656 • Published Apr 13, 2025

pratyushrt

authored a paper 3 months ago

Hard Examples Are All You Need: Maximizing GRPO Post-Training Under Annotation Budgets

Paper • 2508.14094 • Published Aug 15, 2025 • 1

ash56

authored 2 papers 5 months ago

Rapidly Adapting to New Voice Spoofing: Few-Shot Detection of Synthesized Speech Under Distribution Shifts

Paper • 2508.13320 • Published Aug 18, 2025 • 2

Kreyòl-MT: Building MT for Latin American, Caribbean and Colonial African Creole Languages

Paper • 2405.05376 • Published May 8, 2024

noandrews

authored 2 papers 7 months ago

GenVC: Self-Supervised Zero-Shot Voice Conversion

Paper • 2502.04519 • Published Feb 6, 2025

Feedback Friction: LLMs Struggle to Fully Incorporate External Feedback

Paper • 2506.11930 • Published Jun 13, 2025 • 53

carankt

authored 2 papers 7 months ago

SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer

Paper • 2409.08425 • Published Sep 12, 2024 • 10

CapSpeech: Enabling Downstream Applications in Style-Captioned Text-to-Speech

Paper • 2506.02863 • Published Jun 3, 2025 • 8

westbrook

authored 4 papers 7 months ago

CapSpeech: Enabling Downstream Applications in Style-Captioned Text-to-Speech

Paper • 2506.02863 • Published Jun 3, 2025 • 8

SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline

Paper • 2505.19314 • Published May 25, 2025 • 4

Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits

Paper • 2505.14648 • Published May 20, 2025 • 9

Noise-robust Speech Separation with Fast Generative Correction

Paper • 2406.07461 • Published Jun 11, 2024