The datasets used in our experiments are available in the data/ directory of this repository, formatted in MTEB style (i.e. json lines). We provide code to generate the LIMIT style datasets, as well ...