Skip to content
@TrustAIRLab

TrustAIRLab

GitHub Org's stars

TrustAIRLab (Trustworthy AI Research Lab) is a research lab dedicated to the trustworthy machine learning, with a focus on safety, privacy, and security. It aims to

  • offer high-quality libraries to reduce the difficulties in algorithm reproduction

  • benchmark existing attacks and defenses on machine learning models

  • build a solid foundation for Trustworthy AI research and development

Popular repositories Loading

  1. JailbreakRadar JailbreakRadar Public

    Python 74 5

  2. VoiceJailbreakAttack VoiceJailbreakAttack Public

    Code for Voice Jailbreak Attacks Against GPT-4o.

    Python 31 1

  3. JailbreakLLMs JailbreakLLMs Public

    A dataset consists of 6,387 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 666 jailbreak prompts).

    11

  4. ZeroFake ZeroFake Public

    Python 11 1

  5. Conversation_Reconstruction_Attack Conversation_Reconstruction_Attack Public

    This is the public code repository for the paper 'Reconstruct Your Previous Conversations! Comprehensively Investigating Privacy Leakage Risks in Conversations with GPT Models'

    Python 9 1

  6. SecurityNet SecurityNet Public

    JavaScript 8

Repositories

Showing 10 of 24 repositories

Top languages

Loading…

Most used topics

Loading…