Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Alexander Panfilov's picture

Alexander Panfilov

kotekjedi
2 11 2
0xe69756's profile picture
·

AI & ML interests

None yet

Recent Activity

updated a dataset 26 days ago
honeypot-redteam/strategic_lies
upvoted a paper 8 months ago
One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models
authored a paper 9 months ago
Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols
View all activity

Organizations

ISTA Machine Learning and Computer Vision Lab's profile picture Charles's First Org's profile picture Honeypot Red-Team's profile picture Aletheia's Quest's profile picture

authored 2 papers 9 months ago

Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols

Paper • 2510.09462 • Published Oct 10, 2025 • 6

Strategic Dishonesty Can Undermine AI Safety Evaluations of Frontier LLM

Paper • 2509.18058 • Published Sep 22, 2025 • 12
authored a paper about 1 year ago

Capability-Based Scaling Laws for LLM Red-Teaming

Paper • 2505.20162 • Published May 26, 2025 • 4
authored a paper about 2 years ago

Provable Compositional Generalization for Object-Centric Learning

Paper • 2310.05327 • Published Oct 9, 2023
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs