Discussion
Loading...

Post

Log in
  • About
  • Code of conduct
  • Privacy
  • About Bonfire
Casey
Casey
@[email protected]  ·  activity timestamp last week

Introducing WebAccessBench, a novel benchmark for AI language models to assess #accessibility quality and WCAG conformance in generated web interfaces under realistic prompting conditions.

I did a bit of research and found that LLMs are incredibly bad at basic digital accessibility tasks. You can compare models and read the full white paper at https://conesible.de/wab.

Overall data suggests massive implications for society at large, and major discrimination of people with disabilities. #a11y

A sharepic that lists all benchmarked models and their score in a bar chart. Find them listed at https://conesible.de/wab. Beneath is a preview of the whitepaper PDF.
A sharepic that lists all benchmarked models and their score in a bar chart. Find them listed at https://conesible.de/wab. Beneath is a preview of the whitepaper PDF.
A sharepic that lists all benchmarked models and their score in a bar chart. Find them listed at https://conesible.de/wab. Beneath is a preview of the whitepaper PDF.
Accessibility Is Civil Rights. AI Must Stop Shipping Barriers.
  • Copy link
  • Flag this post
  • Block

bonfire.mavnn.eu

News and community around mavnn.eu projects.

bonfire.mavnn.eu: About · Code of conduct · Privacy ·
Bonfire social · 1.0.1 no JS en
Automatic federation enabled
Log in
  • Explore
  • About
  • Public Groups
  • Code of Conduct