Published onDecember 18, 2024CoMPaSS: Enhancing Spatial Understanding in Text-to-Image Diffusion ModelsSpatial-Constraints-Oriented-Pairingtraining-frameworktext-to-image-diffusion-modelsdata-ambiguitytext-encodersMMDiT-architecturespatial-relationshipsToken-ENcoding-ORderingstate-of-the-art-benchmarkshigh-quality-spatial-priorsUNet-architecture• State Key Lab of CAD&CG, Zhejiang University• vivo Mobile Communication Co. Ltd
Published onDecember 17, 2024Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation3D-geometric-informationimplicit-and-explicit-3D-representationsself-supervised-fine-tuningspatial-relationshipsspatial-geometry2D-foundation-modelspoint-cloud-datalarge-scale-pretrained-knowledgesimulation-benchmarksrobotic-manipulationtask-aware-masked-autoencoderLift3D-frameworkpositional-mappingreal-world-scenarios3D-features-extractionlarge-scale-robotic-3D-data• State Key Laboratory of Multimedia Information Processing, School of Computer Science, Peking University• Beijing Academy of Artificial Intelligence (BAAI)• CUHK
Published onDecember 16, 2024MK-SGN: A Spiking Graph Convolutional Network with Multimodal Fusion and Knowledge Distillation for Skeleton-based Action Recognitionmultimodal-Graph-Convolutional-Networksenergy-efficiencytemporal-dynamicsSpiking-Neural-Networksknowledge-distillationskeleton-based-action-recognitionspatial-relationshipsenergy-consumption-reduction• Beijing University of Posts and Telecommunications
Published onDecember 13, 2024Optimized 3D Point Labeling with Leaders Using the Beams Displacement Methodblock-processing-strategylabel-overlay-issues3D-point-labelingleader-linesposition-configurationspatial-relationshipstriangulated-graphmap-displacement-problemproximity-graphsvisibility-improvement
Published onDecember 11, 2024APS-LSTM: Exploiting Multi-Periodicity and Diverse Spatial Dependencies for Flood Forecastingadaptive-aggregationmulti-periodicityspatio-temporal-information-extractionLSTMdisaster-preventionspatial-relationshipshydrological-datanonlinear-temporal-patternsFast-Fourier-Transformflood-predictionself-attention-methodspatial-dependencies• Key Laboratory of Water Big Data Technology of Ministry of Water Resources, Hohai University, 211100 Nanjing, China.
Published onDecember 11, 20243D-Mem: 3D Scene Memory for Embodied Exploration and Reasoningreasoningmemory-retrievallifelong-autonomyFrontier-Snapshotsembodied-AIspatial-relationshipsembodied-explorationmemory-management3D-scene-representationsmulti-view-imagesactive-exploration• UMass Amherst
Published onDecember 10, 2024Driv3R: Learning Dense 4D Reconstruction for Autonomous Drivingdepth-estimationself-supervision4D-reconstructionnuScenes-datasettemporal-contextsmulti-modality-sensor-fusionspatial-relationshipstemporal-integration4D-flow-predictordynamic-scenesautonomous-drivingmulti-view-3D-consistencymoving-objectsoptimization-free-alignment• Tsinghua University• University of California, Berkeley
Published onDecember 10, 2024LLaVA-SpaceSGG: Visual Instruct Tuning for Open-vocabulary Scene Graph Generation with Enhanced Spatial Relationsobject-locationsScene-Graph-Generationstructured-graph-representationsvisual-scenesobject-relationstwo-stage-training-paradigmmultimodal-large-language-modeldata-construction-pipelineopen-vocabulary-contextsinstruction-tuning-datasetspatial-relationshipsdepth-information
Published onDecember 10, 2024A Riemannian Take on Distance Fields and Geodesic Flows in Roboticsnon-Euclidean-structuresRiemannian-eikonal-equationgeodesicshigh-dimensional-spacesRiemannian-metricminimal-energy-trajectorieslearning-techniquesgeodesic-distance-fieldpartial-differential-equationenergy-aware-motion-generationdistance-functionsoptimization-techniquesroboticsspatial-relationshipscontrol-techniquesgradient-flowphysics-informed-neural-networks• Sunrise Setting Ltd, UK• SAGE Publications Ltd, UK
Published onDecember 3, 2024Spatial-variant causal Bayesian inference for rapid seismic ground failures and impacts estimationspatial-relationshipsremote-sensing-technologiescausal-graph-based-Bayesian-networkbuilding-damagesatellite-imageryspatial-heterogeneityhazard-estimationseismic-activitypost-earthquake-ground-failuresbilateral-filter