John AI Lab

News

Older posts…

Featured Research

MixEval-X

The first any-to-any evaluations from real-world data mixtures.

Beam Search for Reasoning

A framework of stepwise self-evaluation LLM reasoning. A concurrent work with Tree of Thoughts.

Noisy Student

The first method that uses extra unlabeled data to achieve state-of-the-art on ImageNet. Also employed in AlphaFold 2 (Nobel Prize 2024), Google Search and other state-of-the-art AI systems.

RACE Dataset

The first large-scale language understanding dataset collected from exams for human. Currently MMLU and MATH are also collected from exams.

Mission

Feel the AGI. Use 10k GPUs like rich kids in the industry. Get 100k citations. Get Nobel prizes and Turing awards.

Just kidding haha. We work on democratizing AI in aspects including but not limited to resource-efficient AI, vision language models and more exploratory topics.

People

Faculty

Michael Qizhe Shieh
Assistant Professor
Jinjie Ni
Postdoc

PhD Students

Hannah Brown
Co-advised with Kenji Kawaguchi
Esther Gan E
Esther Gan
Co-advised with Min-Yen Kan
Yiran Zhao
Affiliated Member

Masters Students

Fengtao He
Fengtao He
National University of Singapore
Leon Lin
Leon Lin
National University of Singapore

Undergraduate Students

Qilong Feng
Qilong Feng
National University of Singapore
Zhennan Shen
Zhennan Shen
Shanghai Jiao Tong University
Yang Zhang
Yang Zhang
Peking University

Publications

Download BibTeX.

*: equal contribution, †: equal advising

2024
September
PDF Accelerating greedy coordinate gradient via probe sampling
Yiran Zhao, Wenyue Zheng, Tianle Cai, Xuan Long Do, Kenji Kawaguchi, Anirudh Goyal, and Michael Qizhe Shieh
NeurIPS 2024
2024
September
PDF Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning
Yuxi Xie, Anirudh Goyal, Wenyue Zheng, Min-Yen Kan, Timothy P Lillicrap, Kenji Kawaguchi, and Michael Shieh
NeurIPS 2024 Workshop
2024
September
PDF Advancing Adversarial Suffix Transfer Learning on Aligned Large Language Models
Hongfu Liu*, Yuxi Xie*, Ye Wang, and Michael Shieh
EMNLP 2024
2024
September
PDF Reasoning Robustness of LLMs to Adversarial Typographical Errors
Esther Gan*, Yiran Zhao*, Liying Cheng, Yancan Mao, Anirudh Goyal, Kenji Kawaguchi, Min-Yen Kan, and Michael Shieh
EMNLP 2024
2024
May
PDF Instructcoder: Instruction tuning large language models for code editing
Kaixin Li*, Qisheng Hu*, James Zhao, Hui Chen, Yuxi Xie, Tiedong Liu, Michael Shieh†, and Junxian He†
ACL 2024 Workshop
2024
May
PDF Prompt optimization via adversarial in-context learning
Xuan Long Do*, Yiran Zhao*, Hannah Brown*, Yuxi Xie, James Xu Zhao, Nancy F. Chen, Kenji Kawaguchi, Michael Shieh†, and Junxian He†
ACL 2024 (Oral)
2023
October
PDF Self-evaluation guided beam search for reasoning
Yuxi Xie, Kenji Kawaguchi, Yiran Zhao, James Xu Zhao, Min-Yen Kan†, Junxian He†, and Michael Xie†
NeurIPS 2023
2023
October
PDF Automatic model selection with large language models for reasoning
James Xu Zhao, Yuxi Xie, Kenji Kawaguchi, Junxian He, and Michael Qizhe Xie
EMNLP 2023 Findings
2023
May
PDF Multi-Source Test-Time Adaptation as Dueling Bandits for Extractive Question Answering
Hai Ye, Qizhe Xie, and Hwee Tou Ng
ACL 2023
2021
February
PDF Meta pseudo labels
Hieu Pham, Zihang Dai, Qizhe Xie, and Quoc V. Le
CVPR 2021
2020
October
PDF Unsupervised data augmentation for consistency training
Qizhe Xie, Zihang Dai, Eduard Hovy, Thang Luong, and Quoc V. Le
NeurIPS 2020
2020
February
PDF Self-training with noisy student improves imagenet classification
Qizhe Xie, Minh-Thang Luong, Eduard Hovy, and Quoc V. Le
CVPR 2020
2017
November
PDF RACE: Large-scale ReAding Comprehension Dataset From Examinations
Guokun Lai*, Qizhe Xie*, Hanxiao Liu, Yiming Yang, and Eduard Hovy
EMNLP 2017