Dyadic: A Scalable Platform for Human-Human and Human-AI Conversation Research

cs.HC cs.AI cs.CL David M. Markowitz · Mar 23, 2026

What it does

Why it matters

It attempts to solve the methodological gap in conversation research by providing turnkey tools for experimental manipulation, live monitoring, and in-situ survey delivery during ongoing chats. The core value proposition is lowering...

Main concern

Community signal

0 up · 0 down

AI Review AI reviewed

Plain-language introduction

Dyadic is a web-based platform for studying human-human and human-AI conversations through text or voice-based interaction. It attempts to solve the methodological gap in conversation research by providing turnkey tools for experimental manipulation, live monitoring, and in-situ survey delivery during ongoing chats. The core value proposition is lowering barriers to entry for researchers studying dyadic interaction processes without requiring programming expertise.

Critical review

Verdict

Bottom line

This tutorial paper serves as a functional overview and documentation of a software platform rather than a peer-reviewed scientific evaluation. The contribution is primarily technical and infrastructural—presenting Dyadic as a practical solution for conversation researchers who lack the resources to build custom experimental software. The paper succeeds as documentation but makes broad claims about scalability, flexibility, and utility that remain unsubstantiated by empirical tests, benchmarks, or comparative studies against existing alternatives.

“The primary purpose of this tutorial is to advance conversation research in the social sciences by presenting a tool for its study.”

paper · Section 1

What holds up

The platform description reveals genuinely useful features that address practical pain points in conversation research, particularly the ability to inject surveys in situ during conversations rather than relying on post-hoc recall. The slot-based architecture enabling within-room role manipulations is technically sound, and the inclusion of response latency metrics ($t_{first\,keystroke}$, $t_{reply}$) and typing behavior logs demonstrates awareness of granular behavioral measures valued in interaction research. The commitment to browser-based deployment without software installation removes friction for participant recruitment.

“A more precise and granular way to evaluate how people think and feel about a conversation, their partner, or their experience is to measure these dynamics throughout the conversation. Dyadic allows researchers to embed surveys directly within a chat session.”

paper · Section 3.3

“Response latency — first keystroke: Time from message delivery to recipient's first keystroke logged per message.”

paper · Section 3.4

Main concerns

The paper commits an important category error: presenting documentation as if it were empirical scholarship. There is no validation that Dyadic actually improves research quality, reduces costs, or enables discoveries that extant tools cannot. Claims of scalability are asserted without performance benchmarks or stress-testing data under high concurrency. The reproducibility concerns regarding API-dependent AI are noted but dismissed rather than addressed—changes to models or discontinuation of services could render studies irreproducible, yet the platform offers no versioning safeguards or local model hosting solutions except vague references to Hugging Face. The tool is commercial-adjacent (requiring paid API keys for full functionality) yet presented as a public good without acknowledging vendor lock-in risks.

Furthermore, claims about mobile compatibility are hedged with explicit warnings that the mobile experience has not been fully optimized, yet the paper simultaneously markets this as a feature. The security discussion focuses on transport encryption and password hashing but omits whether conversation data are encrypted at rest, where servers are physically located, or GDPR/HIPAA compliance status—critical omissions for sensitive conversational data collection.

“Currently, Dyadic is also best suited for desktop studies; hopefully, future iterations of the tool will build out a phone-specific interface.”

paper · Section 5

“It is important to note that access is not inherently free for API use. On a complementary note, changes to the APIs or models could limit replicability or reproducibility.”

paper · Section 5

Evidence and comparison

The evidence provided is descriptive rather than comparative or evaluative. No experiments demonstrate that studies conducted via Dyadic yield different or superior outcomes compared to existing platforms (e.g., oTree, jsPsych, custom Lab.js implementations, or even Qualtrics chat widgets). The table comparing features against unstated alternatives is incomplete without identifying what those alternatives are. The paper cites the need to bridge siloed approaches between AI-mediated communication and psychology of language research (Boyd & Markowitz, 2025, 2026), but offers no empirical demonstration that this bridging actually occurs in practice using Dyadic. The granularity of behavioral measures (typing latencies, mouse clicks) is claimed to facilitate linking verbal behavior to interpersonal outcomes, but no example analyses or case studies demonstrate this linkage.

“to bridge traditionally siloed approaches that study interaction (e.g., those within computer-mediated communication, AI-Mediated Communication and psychology of language research)”

paper · Section 1

“Response latency — reply send: Time from message delivery to when the recipient sends their reply.”

paper · Table 1

Reproducibility

Reproducibility is compromised by several factors. First, Dyadic is a hosted proprietary platform—not open source—meaning researchers cannot inspect the code, host their own instances, or verify that experimental logic executes as described. The author admits the platform is AI-augmented (coded with Claude Code), introducing opacity about the software's decision-making processes. Second, AI-mediated studies depend on remote API services (OpenAI, Anthropic, Google) where model weights, behaviors, and availability change without notice; as the paper acknowledges, changes to APIs could limit reproducibility (Gundersen & Kjensmo, 2018). Third, while data exports are available in CSV format, the platform lacks preregistration integration, version control for study configurations, or standardized metadata schemas. Hyperparameters for AI behaviors (system prompts, response delays) are configurable but not automatically archived in a machine-readable format, placing the burden on researchers to manually document these settings in methods sections.

“Obtaining an API token is free but using the API costs money for certain platforms... changes to the APIs or models could limit replicability or reproducibility”

paper · Section 5

“Initial underlying code was written by him and significantly augmented by Claude Code.”

paper · Section 6

Abstract

Conversation is ubiquitous in social life, but the empirical study of this interactive process has been thwarted by tools that are insufficiently modular and unadaptive to researcher needs. To relieve many constraints in conversation research, the current tutorial presents an overview and introduction to a new tool, Dyadic (https://www.chatdyadic.com/), a web-based platform for studying human-human and human-AI conversations using text-based or voice-based chats. Dyadic is distinct from other platforms by offering studies with multiple modalities, AI suggestions (e.g., in human-human studies, AI can suggest responses to a participant), live monitoring (e.g., researchers can evaluate, in real time, chats between communicators), and survey deployment (e.g., Likert-type scales, feeling thermometers, and open-ended text boxes can be sent to humans for in situ evaluations of the interaction), among other consequential features. No coding is required to operate Dyadic directly, and integrations with existing survey platforms are offered.

Challenge the Review

Pick a starting point or write your own. Challenges run in the background, so you can keep reading while the AI investigates.

Challenges are public to read, but only signed-in members can post them. Your challenge text is stored with your account for moderation, but usernames are not shown in the public thread.

No challenges yet. Disagree with the review? Ask the AI to revisit a specific claim.