[2023.11] Please checkout our S3Eval benchmark to see how a synthetic dataset can be used to systematically π analze & π¬ evaluate language models!
[2023.10] 3 papers got accepted by EMNLP 2023! If you'd like to hang out with me during the conference in πΈπ¬, feel free to DM me in twitter ! I will also be in the SSNLP 2023 event!