GPT Reddit Comment Detection
Collection
Collection of datasets and models used for detecting LLM bots on reddit. • 6 items • Updated • 1
How to use trentmkelly/slop-detector with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-classification", model="trentmkelly/slop-detector") # Load model directly
from transformers import AutoTokenizer, AutoModelForSequenceClassification
tokenizer = AutoTokenizer.from_pretrained("trentmkelly/slop-detector")
model = AutoModelForSequenceClassification.from_pretrained("trentmkelly/slop-detector")Old model. Please use https://huggingface.co/trentmkelly/slop-detector-mini-2 instead. Faster and more accurate, trained on more data.
loss: 0.03548985347151756
f1: 0.9950522264980759
precision: 0.9945054945054945
recall: 0.9955995599559956
auc: 0.9997361672360855
accuracy: 0.995049504950495
I trained this on a bunch of top-level comments on reddit. Human class was the real responses to selfposts in various subs, and the LLM class was a response from one of several LLMs to the same post. I am tired of reading fucking GPT-slop comments on reddit.
Converted ONNX model is available for compatibility with transformers.js. Browser extension and mini version coming soon.
Base model
thenlper/gte-base