Sparseadapter: An easy approach for improving the parameter-efficiency of adapters S He, L Ding, D Dong, M Zhang, D Tao EMNLP 2022 Findings, 2022 | 49 | 2022 |
Vega-mt: The jd explore academy translation system for wmt22 C Zan, K Peng, L Ding, B Qiu, B Liu, S He, Q Lu, Z Zhang, C Liu, W Liu, ... Seventh Conference on Machine Translation (WMT22), 2022 | 46 | 2022 |
Reflection-tuning: Data recycling improves llm instruction-tuning M Li, L Chen, J Chen, S He, H Huang, J Gu, T Zhou arXiv preprint arXiv:2310.11716, 2023 | 7* | 2023 |
Mera: Merging pretrained adapters for few-shot learning S He, RZ Fan, L Ding, L Shen, T Zhou, D Tao arXiv preprint arXiv:2308.15982, 2023 | 7 | 2023 |
PAD-Net: An Efficient Framework for Dynamic Networks S He, L Ding, D Dong, B Liu, F Yu, D Tao ACL 2023, 2023 | 6* | 2023 |
Merging experts into one: Improving computational efficiency of mixture of experts S He, RZ Fan, L Ding, L Shen, T Zhou, D Tao EMNLP 2023 Oral, 2023 | 4 | 2023 |
Superfiltering: Weak-to-strong data filtering for fast instruction-tuning M Li, Y Zhang, S He, Z Li, H Zhao, J Wang, N Cheng, T Zhou arXiv preprint arXiv:2402.00530, 2024 | 3 | 2024 |
Reformatted Alignment RZ Fan, X Li, H Zou, J Li, S He, E Chern, J Hu, P Liu arXiv preprint arXiv:2402.12219, 2024 | 2 | 2024 |
Sd-conv: Towards the parameter-efficiency of dynamic convolution S He, C Jiang, D Dong, L Ding Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2023 | 2 | 2023 |
Selective reflection-tuning: Student-selected data recycling for llm instruction-tuning M Li, L Chen, J Chen, S He, J Gu, T Zhou arXiv preprint arXiv:2402.10110, 2024 | 1 | 2024 |
Accurate Prediction of Antibody Function and Structure Using Bio-Inspired Antibody Language Model H Jing, Z Gao, S Xu, T Shen, Z Peng, S He, T You, S Ye, W Lin, S Sun bioRxiv, 2023.08. 30.555473, 2023 | 1 | 2023 |
Multi-modal Attention Network for Stock Movements Prediction S He, S Gu The AAAI-22 Workshop on Knowledge Discovery from Unstructured Data in …, 2021 | 1 | 2021 |
RESSA: Repair Sparse Vision-Language Models via Sparse Cross-Modality Adaptation S He, T Chen arXiv preprint arXiv:2404.02424, 2024 | | 2024 |
NeuralSlice: Neural 3D triangle mesh reconstruction via slicing 4D tetrahedral meshes C Jiang, J Yang, S He, Y Lai, L Gao | | 2023 |