---
library_name: transformers
tags:
- From_Scratch
- Custom
- YSNRFD
- ysnrfd
- LLM
- ysnrfd
- Persian_LLM
- English_LLM
datasets:
- wikitext2-raw-v1
language:
- fa
- en
---


# REPORT ANY PROBLEMS IN MODEL LOADING AND INFERENCE

## Model Details   

**WARNINNGS:** This Model IS **Pre-Trained**, **in the future will be finetuned**.

- I changed the number of attention heads from **12** to **32**.  
- The **hidden size** remains unchanged.  
- The **tokenizer** must be updated.


### Model Description

The First Persian LLM By YSNRFD, This Model support Only English text Inputs, In The Future I Want Add Persian Language Support.

- **Developed by:** ysnrfd
- **Funded by:** ysnrfd
- **Shared by:** ysnrfd
- **Model type:** LLM
- **Language(s) (NLP):** English
- **License:** ysnrfd LICENSE


# INFERENCE
## The package below must be installed in this exact way. I'm sorry that I couldn’t upload it to PyPI.
```python
# GITHUB REPO
PUBLIC_WHL_URL="https://github.com/ysnrfd/ysnrfd_architecture_whl/releases/download/WHL/ysnrfd-1.1.0-py3-none-any.whl"

# PACKAGE INSTALL
!pip install $PUBLIC_WHL_URL
```
## CODE FOR START TEXT GENERATION
```python
import torch
from transformers import AutoTokenizer, PreTrainedTokenizer
import logging

# Logging settings
logging.basicConfig(level=logging.INFO)
logger = logging.getLogger(__name__)

# ----------------------------------------------------
# 1. Importing architecture classes (from your locally installed package)
# ----------------------------------------------------
# This line loads Ysnrfd classes from your installed package (.whl), without needing the source code.
try:
    from ysnrfd import YsnrfdConfig, YsnrfdForCausalLM
    logger.info(" Ysnrfd classes were successfully loaded from the local package.")
except ImportError:
    logger.error(" Import error: Make sure the 'ysnrfd' package (.whl file) is installed.")
    # You may stop the program here
    exit()

# ----------------------------------------------------
# 2. Setting the model identifier (Hugging Face Hub)
# ----------------------------------------------------
# This is the trained model identifier and its weights.
HF_MODEL_ID = "ysn-rfd/ysnrfd-base-V2"

# Device selection (GPU or CPU)
device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
logger.info(f"Using device: {device}")

# ----------------------------------------------------
# 3. Loading the tokenizer and configuration
# ----------------------------------------------------
try:
    # Load tokenizer using AutoTokenizer standard
    tokenizer: PreTrainedTokenizer = AutoTokenizer.from_pretrained(HF_MODEL_ID)
    
    # Load config.json from the Hub and inject it into the custom YsnrfdConfig class
    # This class comes from your package, but the values are read from the Hub's config.json.
    config: YsnrfdConfig = YsnrfdConfig.from_pretrained(HF_MODEL_ID)
    
    # Set pad_token_id if not defined (required for generation)
    if tokenizer.pad_token_id is None:
        tokenizer.pad_token_id = tokenizer.eos_token_id
    
    logger.info(f" Configuration and tokenizer loaded successfully. Config type: {type(config)}")

except Exception as e:
    logger.error(f" Error loading configuration/tokenizer from the Hub: {e}")
    exit()


# ----------------------------------------------------
# 4. Loading the model and weights
# ----------------------------------------------------
try:
    # Load model weights from the Hub into your custom architecture class
    model: YsnrfdForCausalLM = YsnrfdForCausalLM.from_pretrained(HF_MODEL_ID, config=config)
    model.to(device)
    model.eval()
    logger.info(" Model and weights loaded successfully and moved to device.")
except Exception as e:
    logger.error(f" Error loading model from the Hub: {e}")
    exit()

# ----------------------------------------------------
# 5. Running Inference (text generation)
# ----------------------------------------------------
prompt = "On a sunny day in Tehran,"
logger.info(f"\n--- Starting text generation ---")
logger.info(f"Prompt: {prompt}")

# Tokenize the input
input_ids = tokenizer.encode(prompt, return_tensors="pt").to(device)

# Run generation
with torch.no_grad():
    outputs = model.generate(
        input_ids,
        max_length=100,
        do_sample=True,
        top_p=0.9,
        temperature=0.7,
        pad_token_id=tokenizer.pad_token_id
    )

# Decode output
generated_text = tokenizer.decode(outputs[0], skip_special_tokens=True)

logger.info("\n--- Final Model Output ---")
print(generated_text)
logger.info("--- Done ---")


# ----------------------------------------------------
# 4. MODEL WEIGTH LOADING
# ----------------------------------------------------
try:
    # loading moedl weight
    model: YsnrfdForCausalLM = YsnrfdForCausalLM.from_pretrained(HF_MODEL_ID, config=config)
    model.to(device)
    model.eval()
    logger.info(" MODEL AND WEIGHT ARE SUCCESFULY LOADED")
except Exception as e:
    logger.error(f" MODEL LOADING ERROR OF Hub: {e}")
    exit()

# ----------------------------------------------------
# 5. INFERENCE / RUN
# ----------------------------------------------------
prompt = "IN A SUNNY DAY"
logger.info(f"\n--- START TEXT GENERATION ---")
logger.info(f"پرامپت: {prompt}")

# TOKENIZING INPUTS
input_ids = tokenizer.encode(prompt, return_tensors="pt").to(device)

# RUN GENERATION
with torch.no_grad():
    outputs = model.generate(
        input_ids,
        max_length=100,
        do_sample=True,
        top_p=0.9,
        temperature=0.7,
        pad_token_id=tokenizer.pad_token_id
    )

# decoding output
generated_text = tokenizer.decode(outputs[0], skip_special_tokens=True)

logger.info("\n--- FINAL MODEL OUTPUT ---")
print(generated_text)
logger.info("--- END ---")
```


### Training Data

wikitext2-raw-v1


#### Training Hyperparameters

- **Training regime:** float32


## Evaluation

Not Yet

### Testing Data, Factors & Metrics

ysnrfd en testing data

#### Testing Data

Not Yet


#### Summary

The Fisrt Persian LLM Trained From Scratch (Size Like SLM)


- **Hardware Type:** Nvidia Tesla T4
- **Hours used:** 3H
- **Cloud Provider:** Google Colab

### Model Architecture and Objective

YSNRFD Architecture

#### Hardware

Nvidia Tesla T4

#### Software

Python Code, From Scratch, Pytorch