An applied scientist @ Amazon AGI, building video generative models.
-
Amazon
- Santa Clara, CA
-
03:39
(UTC -08:00) - sy-zhang.github.io
- @zhangsongyang
- in/songyang-zhang
Pinned Loading
-
microsoft/VideoX
microsoft/VideoX PublicVideoX: a collection of video cross-modal models
-
TCMN-Release
TCMN-Release PublicCodes for our ACM MM 2019 paper: "Exploiting Temporal Relationships in Video Moment Localization with Natural Language"
-
Geometric-Feature-Release
Geometric-Feature-Release PublicCodes for our WACV2017 paper: "On Geometric Features for Skeleton-Based Action Recognition Using Multilayer LSTM Networks"
-
mugen-org/MUGEN_baseline
mugen-org/MUGEN_baseline Publicmultimodal video-audio-text generation and retrieval between every pair of modalities on the MUGEN dataset. The repo. contains the training, evaluation and inference codes for these baselines.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.