IBM

company

Verified

https://www.ibm.com/

AI & ML interests

Enterprise AI and ML, Foundation Models, Responsible AI

Recent Activity

DhavalPatel submitted a paper 2 days ago

DiagnosticIQ: A Benchmark for LLM-Based Industrial Maintenance Action Recommendation from Symbolic Rules

DhavalPatel submitted a paper 5 days ago

SPIN: Structural LLM Planning via Iterative Navigation for Industrial Tasks

LeoYML submitted a paper about 2 months ago

From Static Templates to Dynamic Runtime Graphs: A Survey of Workflow Optimization for LLM Agents

View all activity

Papers

SPIN: Structural LLM Planning via Iterative Navigation for Industrial Tasks

DiagnosticIQ: A Benchmark for LLM-Based Industrial Maintenance Action Recommendation from Symbolic Rules

View all Papers

ibm 's Spaces 7

BenchBench Leaderboad

Compare benchmarks for language models

Unitxt

Risk Atlas Nexus

Evaluate AI risks with common risk taxonomies

JuStRank

Display ranked LLM judges based on performance metrics

README

Biomed-multi-alignment unified demo with PPI and TDI examples

Demo for MAMMAL approch on multiple domains

Llm Rank Themselves

Rank and compare language models using benchmarks