Model-Written Evaluation Datasets
This repository includes datasets written by language models, used in our paper on "Discovering Language Model Behaviors with Model-Written Evaluations."
Explore
68,443 skills indexed with the new KISS metadata standard.
This repository includes datasets written by language models, used in our paper on "Discovering Language Model Behaviors with Model-Written Evaluations."
*.ipynb
> You can now configure and run Evals directly in the OpenAI Dashboard. [Get started →](https://platform.openai.com/docs/guides/evals)
Copyright (c) 2023 OpenAI
For a more in-depth look at our security policy, please check out our [Coordinated Vulnerability Disclosure Policy](https://openai.com/security/disclosure/#:~:text=Disclosure%20Policy,-Security%20is%20essential&text=OpenAI%27s%20coordinated%20vulnerability%20disclosure%20policy,expect%20from%20us%20
- repo: https://github.com/pre-commit/mirrors-mypy
generic skill
evals.egg-info/
[](https://doi.org/10.5281/zenodo.10256836)
exclude: ^tests/testdata/
.DS_Store
Copyright (c) 2020 EleutherAI
* fix image upscale on cpu ([#16275](https://github.com/AUTOMATIC1111/stable-diffusion-webui/pull/16275))
*.ckpt
A web interface for Stable Diffusion, implemented using Gradio library.
channels:
[MESSAGES CONTROL]
extensions-disabled
9c54b78d9dde5601e916f308d9a9d6953ec39430
<sup>Special thanks to:</sup>
/extensions
[](https://llm.mlc.ai/docs/)
repos:
disable=too-many-positional-arguments,duplicate-code