AI Tech Suite

Trojan Detection Challenge 2023 (LLM Edition)

Click to visit website

About

The Trojan Detection Challenge 2023 (LLM Edition) is a NeurIPS 2023 competition focused on advancing methods for detecting hidden functionality in large language models (LLMs). The competition features two tracks: Trojan Detection and Red Teaming. The Trojan Detection Track challenges participants to identify triggers for hidden behaviors (trojans) in LLMs. The Red Teaming Track tasks participants with developing automated methods to elicit specific undesirable behaviors. There's a $30,000 prize pool, and winning teams will be invited to co-author a publication and present at a NeurIPS workshop. The competition uses open-source LLMs (Pythia and Llama-2-chat) and encourages participants to share their methods.

Features

• red teaming

• competition

• llm safety

• trojan detection

FAQs

What are the current rules?

The rules are available here: [Here](index.html#rules).

Can the organizers change the rules?

Yes, with participant consent if urgently needed.

How do I contact the organizers?

Who can participate in the competition?

The competition is open to the public.

When is the deadline to register?

You can register anytime during the competition.

How many people can I have in my team?

Teams can have any number of members. Solo teams are allowed.

Where can I download data and submit results?

See the [Getting Started](start.html) page.

How many submissions can each team enter per competition track?

5 submissions per day during validation, 5 total in the test phase. Only one account per team.

Are participants required to share the details of their method?

We encourage sharing; winning teams must share with organizers.

What are the details for the Trojan Detection Track?

Details are here: [Here](tracks.html#trojan-detection).

What are the details for the Red Teaming Track?

Details are here: [Here](tracks.html#red-teaming).

Why are you using the baselines you have chosen?

The baselines are well-known text optimization and red teaming methods from the academic literature.

Why are you using the LLMs you have chosen?

For Trojan Detection, open-source Pythia LLMs are used for broader participation; Llama-2-chat for Red Teaming due to robustness.

Why are you using the particular trojan attack you have chosen?

We use the simplest trojan attack for its resemblance to the red teaming task, fostering connections between communities.

Is it "trojans" or "Trojans"?

Both are used in the literature; "trojans" is used for better flow.

Job Opportunities

There are currently no job postings for this AI tool.

Social Media

Average Rating: 0.0

5 Stars:

0 Ratings

4 Stars:

0 Ratings

3 Stars:

0 Ratings

2 Stars:

0 Ratings

1 Star:

0 Ratings

User Ratings

No ratings available.

Sign In to Rate this Tool

1 Star2 Stars3 Stars4 Stars5 Stars

Featured Tools

HeyHoney

NSFW AI sex chatbot offering both free and premium subscriptions.

View Details

NoFilterGPT

Anonymous, uncensored AI chat with AES encryption and no logs. Offers free and pro plans.

View Details

Wayin AI

Wayin AI summarizes videos, supports multiple languages, and allows interactive Q&A via chatbot and screenshot queries.

View Details

CapMonster Cloud

Automated CAPTCHA recognition service using AI.

View Details

Pokecut

Pokecut is a free AI-powered photo editor with tools for background removal, changing, and enhancement. Pro plans offer extra features and credits.

View Details

GitGab

Connect your Github repos to ChatGPT & Claude for code assistance, bug finding, and documentation. Free trial available.

View Details

TryNectar AI

Create and interact with a customizable AI girlfriend. Features include AI chat, roleplay, and image generation. NSFW content available.

View Details

Smart Cookie Trivia

A trivia website with questions in multiple categories. Play now and expand your knowledge!

View Details

Arbor

Arbor is an automated carbon accounting platform that helps businesses measure, analyze, and reduce their product's carbon footprint quickly and accurately.

View Details

PhotoLog

PhotoLog offers secure, client-side encrypted media storage with mini-site creation, easy sharing, and various storage plans.

View Details

Apptest.ai

AI-powered mobile app testing platform with a test automation cloud (Ptero) and a no-code test scenario authoring tool (Stego).

View Details

Saner.AI

AI-powered productivity assistant for ADHD and knowledge workers, centralizing notes, tasks, and AI tools to enhance focus and efficiency.

View Details

Trojan Detection Challenge 2023 (LLM Edition)

Click to visit website

About

Platform

Keywords

Task

Features

FAQs

What are the current rules?

Can the organizers change the rules?

How do I contact the organizers?

Who can participate in the competition?

When is the deadline to register?

How many people can I have in my team?

Where can I download data and submit results?

How many submissions can each team enter per competition track?

Are participants required to share the details of their method?

What are the details for the Trojan Detection Track?

What are the details for the Red Teaming Track?

Why are you using the baselines you have chosen?

Why are you using the LLMs you have chosen?

Why are you using the particular trojan attack you have chosen?

Is it "trojans" or "Trojans"?

Job Opportunities

Social Media

User Ratings

Sign In to Rate this Tool

Featured Tools

HeyHoney

NoFilterGPT

Wayin AI

CapMonster Cloud

Pokecut

GitGab

TryNectar AI

Smart Cookie Trivia

Arbor

PhotoLog

Apptest.ai

Saner.AI

Trojan Detection Challenge 2023 (LLM Edition)

Click to visit website

About

Platform

Keywords

Task

Features

FAQs

What are the current rules?

Can the organizers change the rules?

How do I contact the organizers?

Who can participate in the competition?

When is the deadline to register?

How many people can I have in my team?

Where can I download data and submit results?

How many submissions can each team enter per competition track?

Are participants required to share the details of their method?

What are the details for the Trojan Detection Track?

What are the details for the Red Teaming Track?

Why are you using the baselines you have chosen?

Why are you using the LLMs you have chosen?

Why are you using the particular trojan attack you have chosen?

Is it "trojans" or "Trojans"?

Job Opportunities

Social Media

User Ratings

Sign In to Rate this Tool

Featured Tools

HeyHoney

NoFilterGPT

Wayin AI

CapMonster Cloud

Pokecut

GitGab

TryNectar AI

Smart Cookie Trivia

Arbor

PhotoLog

Apptest.ai

Saner.AI