SecureAGI

Security is the core towards AGI

Cover Image for Red Teaming the Mind of the Machine: A Systematic Evaluation of Prompt Injection and Jailbreak Vulnerabilities in LLMs

A comprehensive study analyzing over 1,400 adversarial prompts to assess the susceptibility of leading LLMs to prompt injection and jailbreak attacks, proposing layered defense strategies.

Yingjing Lu
Yingjing Lu

More Stories

Cover Image for A Systematic Evaluation of Jailbreak Risks in Large Language Models

A Systematic Evaluation of Jailbreak Risks in Large Language Models

This study introduces a comprehensive benchmark to assess the vulnerability of LLMs to jailbreak attacks, revealing their persistent weaknesses despite increasing safety efforts.

Yingjing Lu
Yingjing Lu
Cover Image for Guided by the Machine: A Framework for Mechanistic Interpretability in Language Models

Guided by the Machine: A Framework for Mechanistic Interpretability in Language Models

A novel framework for interpretability that leverages guiding signals to reverse-engineer how transformer models represent and compute high-level behaviors.

Yingjing Lu
Yingjing Lu
Cover Image for Privacy Auditing of Large Language Models: Advancements in Canary Design

Privacy Auditing of Large Language Models: Advancements in Canary Design

This article explores innovative methodologies for enhancing privacy audits in large language models through improved canary generation techniques.

Yingjing Lu
Yingjing Lu
Cover Image for LLM Security: A Comprehensive Survey of Vulnerabilities, Attacks, and Defenses

LLM Security: A Comprehensive Survey of Vulnerabilities, Attacks, and Defenses

An in-depth look at the evolving security landscape of Large Language Models, highlighting key vulnerabilities, attack vectors, and defense mechanisms.

Yingjing Lu
Yingjing Lu
Cover Image for DeepSeek V3 Deep Dive: Training Methodologies and Their Impact

DeepSeek V3 Deep Dive: Training Methodologies and Their Impact

In this article we are deep diving into Deep Seek V3's training methodologies that makes it efficient to train

Yingjing Lu
Yingjing Lu
Cover Image for DeepSeek V3 Intro: Ground breaking does not cost a lot of money as people would think

DeepSeek V3 Intro: Ground breaking does not cost a lot of money as people would think

Intro to Deep Seek V3, a new state of the art LLM that does not cost a lot of money to train. Let's dive into what it is for this article

Yingjing Lu
Yingjing Lu
Cover Image for LLM Security Issues - Misinformation and Social Engineering

LLM Security Issues - Misinformation and Social Engineering

Large Language Models (LLMs) have transformed the technological landscape, finding applications in everything from customer support and creative writing to research assistance and programming. However, their ubiquity also exposes them to significant security risks. Attackers can manipulate these models in subtle but impactful ways, undermining their reliability and potentially causing real-world harm. This article examines the primary security vulnerabilities in LLMs, provides concrete examples of attacks, and discusses mitigation strategies.

Yingjing Lu
Yingjing Lu
Cover Image for LLM Security Issues - Model Manipulation

LLM Security Issues - Model Manipulation

Large Language Models (LLMs) have transformed the technological landscape, finding applications in everything from customer support and creative writing to research assistance and programming. However, their ubiquity also exposes them to significant security risks. Attackers can manipulate these models in subtle but impactful ways, undermining their reliability and potentially causing real-world harm. This article examines the primary security vulnerabilities in LLMs, provides concrete examples of attacks, and discusses mitigation strategies.

Yingjing Lu
Yingjing Lu
Cover Image for LLM Security Issues - An Overview

LLM Security Issues - An Overview

Large Language Models (LLMs) like OpenAI’s GPT series, Google’s Bard, and Meta’s LLaMA have revolutionized the way humans interact with artificial intelligence (AI). However, as their capabilities grow, so do the potential security vulnerabilities they introduce. This article explores the primary security concerns associated with LLMs, organized into key categories.

Yingjing Lu
Yingjing Lu
Cover Image for Unlocking the Potential of Multi-Modal Large Language Models: A Comprehensive Guide to Training with Text, Images, and Voice

Unlocking the Potential of Multi-Modal Large Language Models: A Comprehensive Guide to Training with Text, Images, and Voice

Multi-modal large language models (LLMs) promise groundbreaking advancements by integrating text, image, and voice data into unified AI systems. This article explores the essential steps, techniques, and challenges involved in training such sophisticated models.

Yingjing Lu
Yingjing Lu
Cover Image for Demystifying Large Language Models: Exploring Different Types and Their Applications

Demystifying Large Language Models: Exploring Different Types and Their Applications

Large Language Models (LLMs) are revolutionizing the way we interact with technology, but their diversity can be overwhelming—this guide breaks down the different types of LLMs, their unique strengths, and practical applications.

Yingjing Lu
Yingjing Lu
Cover Image for The Dark Side of AI Language Models: Understanding the Security Risks

The Dark Side of AI Language Models: Understanding the Security Risks

As AI language models become more advanced and widely used, it's crucial to understand the potential security risks they pose. From personal information leaks to copyright infringement within their training data, these models can have unintended consequences that you should be aware of.

Yingjing Lu
Yingjing Lu
Cover Image for Enhancing Your Daily Life and Work Leveraging Potential of Generative AI

Enhancing Your Daily Life and Work Leveraging Potential of Generative AI

By embracing the power of generative AI, you can unlock new possibilities, enhance your skills, and achieve greater success in your personal and professional endeavors. This article list out some ways you can leverage to improve your life and work.

Yingjing Lu
Yingjing Lu
Cover Image for What is: Large Language Models and Generative AI

What is: Large Language Models and Generative AI

Large language models and generative AI are being used in everything from chatbots and virtual assistants to content creation and language translation. Soon, you might find yourself having a heart-to-heart with your smartphone, getting writing tips from your computer, or even watching a movie script written entirely by AI!

Yingjing Lu
Yingjing Lu