Skip to content
View BillChan226's full-sized avatar
🐝
learning
🐝
learning

Highlights

  • Pro

Organizations

@AI-secure

Block or report BillChan226

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
BillChan226/README.md

Hi there, I'm Zhaorun Personal Website 👋

Connect with me:

HZ HZ | GoogleScholar HZ | Twitter


🏖️ My Research Interests

  • Trustworthy deployment and safe interactions with large foundation models and agents from both a theoretical and empirical perspective.
  • enhancing LLM's trustworthiness via retrieval-augmented generation (RAG) and robustness certificates for hallucination, alignment, jailbreaks and privacy.

GitHub stats Language Stats

Pinned Loading

  1. SafeWatch Public

    [ICLR 2025] Official implementation for "SafeWatch: An Efficient Safety-Policy Following Video Guardrail Model with Transparent Explanations"

    Python 29

  2. AI-secure/AgentPoison Public

    [NeurIPS 2024] Official implementation for "AgentPoison: Red-teaming LLM Agents via Memory or Knowledge Base Backdoor Poisoning"

    Python 111 13

  3. HALC Public

    [ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"

    Python 86 1

  4. MJ-Bench/MJ-Bench Public

    Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"

    Jupyter Notebook 43 5

  5. AI-secure/MMDT Public

    Comprehensive Assessment of Trustworthiness in Multimodal Foundation Models

    Jupyter Notebook 18 2

391 contributions in the last year

Contribution Graph
Day of Week April May June July August September October November December January February March
Sunday
Monday
Tuesday
Wednesday
Thursday
Friday
Saturday
Less
No contributions.
Low contributions.
Medium-low contributions.
Medium-high contributions.
High contributions.
More

Activity overview

Contributed to BillChan226/MJ-Bench, AI-secure/AgentPoison, MJ-Bench/MJ-Bench.github.io and 19 other repositories
Loading A graph representing BillChan226's contributions from April 07, 2024 to April 11, 2025. The contributions are 98% commits, 2% issues, 0% pull requests, 0% code review.   Code review 2% Issues   Pull requests 98% Commits

Contribution activity

April 2025

Created 2 repositories
Loading