Urmăriți
Jinfa Huang
Jinfa Huang
University of Rochester, Peking University
Adresă de e-mail confirmată pe ur.rochester.edu - Pagina de pornire
Titlu
Citat de
Citat de
Anul
Moe-llava: Mixture of experts for large vision-language models
B Lin, Z Tang, Y Ye, J Cui, B Zhu, P Jin, J Huang, J Zhang, M Ning, ...
arXiv preprint arXiv:2401.15947, 2024
722024
A Survey of Large Language Models in Medicine: Principles, Applications, and Challenges
H Zhou, F Liu, B Gu, X Zou, J Huang, J Wu, Y Li, SS Chen, P Zhou, J Liu, ...
arXiv preprint arXiv:2311.05112, 2023
53*2023
Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning
P Jin, J Huang, P Xiong, S Tian, C Liu, X Ji, L Yuan, J Chen
CVPR 2023, Highlight, 2023
492023
Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations
P Jin, J Huang, F Liu, X Wu, S Ge, G Song, D Clifton, J Chen
NeurIPS 2022, Spotlight, 30291-30306, 2022
482022
Weakly-supervised 3d spatial reasoning for text-based visual question answering
H Li, J Huang, P Jin, G Song, Q Wu, J Chen
IEEE Transactions on Image Processing 32, 3367-3382, 2023
31*2023
Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment
P Jin, H Li, Z Cheng, J Huang, Z Wang, L Yuan, C Liu, J Chen
IJCAI 2023, 2023
242023
Gpt-4V (ision) as a Social Media Analysis Engine
H Lyu, J Huang, D Zhang, Y Yu, X Mou, J Pan, Z Yang, Z Wei, J Luo
arXiv preprint arXiv:2311.07547, 2023
182023
Guoym at SemEval-2020 task 8: Ensemble-based Classification of Visuo-lingual Metaphor in Memes
Y Guo, J Huang, Y Dong, M Xu
Proceedings of the Fourteenth Workshop on Semantic Evaluation, 1120-1125, 2020
172020
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
S Yuan, J Huang, Y Shi, Y Xu, R Zhu, B Lin, X Cheng, L Yuan, J Luo
arXiv preprint arXiv:2404.05014, 2024
82024
LLMBind: A unified modality-task integration framework
B Zhu, P Jin, M Ning, B Lin, J Huang, Q Song, M Pan, L Yuan
arXiv preprint arXiv:2402.14891, 2024
52024
Improving Scene Graph Generation with Superpixel-Based Interaction Learning
J Wang, C Zhang, J Huang, B Ren, Z Deng
ACMMM 2023, 2023
52023
Continuous-Multiple Image Outpainting in One-Step via Positional Query and A Diffusion-based Approach
S Zhang, J Huang, Q Zhou, Z Wang, F Wang, J Luo, J Yan
ICLR 2024, 2024
42024
Cross-Modality Time-Variant Relation Learning for Generating Dynamic Scene Graphs
J Wang, J Huang, C Zhang, Z Deng
ICRA 2023, 2023
42023
Ldnn: Linguistic Knowledge Injectable Deep Neural Network for Group Cohesiveness Understanding
Y Wang, J Wu, J Huang, G Hattori, Y Takishima, S Wada, R Kimura, ...
Proceedings of the 2020 International Conference on Multimodal Interaction …, 2020
42020
LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference
Z Wan, Z Wu, C Liu, J Huang, Z Zhu, P Jin, L Wang, L Yuan
arXiv preprint arXiv:2406.18139, 2024
32024
RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter
M Cao, H Tang, J Huang, P Jin, C Zhang, R Liu, L Chen, X Liang, L Yuan, ...
ACL 2024 Findings, 2024
32024
Chronomagic-bench: A benchmark for metamorphic evaluation of text-to-time-lapse video generation
S Yuan, J Huang, Y Xu, Y Liu, S Zhang, Y Shi, R Zhu, X Cheng, J Luo, ...
arXiv preprint arXiv:2406.18522, 2024
22024
Evolver: Chain-of-Evolution Prompting to Boost Large Multimodal Models for Hateful Meme Detection
J Huang, J Pan, Z Wan, H Lyu, J Luo
arXiv preprint arXiv:2407.21004, 2024
12024
A Survey of Camouflaged Object Detection and Beyond
F Xiao, S Hu, Y Shen, C Fang, J Huang, C He, L Tang, Z Yang, X Li
arXiv preprint arXiv:2408.14562, 2024
2024
MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval
H Tang, M Cao, J Huang, R Liu, P Jin, G Li, X Liang
arXiv preprint arXiv:2408.10575, 2024
2024
Sistemul nu poate realiza operația în acest moment. Încercați din nou mai târziu.
Articole 1–20