For reference, our API is detailed here.
We recommend using completely_generated_prob
to understand whether a document was completely generated by AI. On our validation dataset, here is how the results change when you set all documents with completely_generated_prob
under the threshold as human, and above as AI:
- At a threshold of 0.65, 85% of AI documents are classified as AI, and 99% of human documents are classified as human
- At a threshold of 0.16, 96% of AI documents are classified as AI, and 96% of human documents are classified as human
We recommend using a threshold of 0.65 or higher to minimize the number of false positives, as we think it is currently more harmful to falsely detect human writing as AI than vice versa.