synthesizability-PN-prediction-balance-tpr-tnr
Collection
3 items
•
Updated
This repository contains the LoRA adapter for a Qwen3-14B model fine-tuned to classify chemical synthesizability (P = synthesizable, N = unsynthesizable). Training uses a P/N-only focal loss. Prompts follow this template:
You are a materials science assistant. Given a chemical composition, answer only with 'P' (synthesizable/positive) or 'N' (non-synthesizable/negative)." Correspondingly, each user query was formatted as: "Is the material {composition} likely synthesizable? Answer with P (positive) or N (negative).
The base checkpoint is Unsloth’s 4-bit MXFP4 build (unsloth/Qwen3-14B-unsloth-bnb-4bit). Attaching this adapter reproduces the best validation performance among evaluated epochs (Epoch 2).
train_llm_pn.jsonl) / validation (train_llm_pn.jsonl)| Metric | Value |
|---|---|
| TPR (P Recall) | 0.9595 |
| TNR (U Specificity) | 0.9622 |
The training and validation splits combine multiple public sources and internal curation:
chat_template.jinja is included in this checkpoint).Base model
Qwen/Qwen3-14B-Base