1. Introduction
The design of dexterous robotic hands has advanced considerably over the past two decades, yet a fundamental methodological challenge persists. The prevailing design paradigm remains predominantly empirical: engineers iteratively refine prototypes through intuition and trial-and-error, a process ill-suited to the high-dimensional, tightly constrained design space that dexterous hands occupy [
1,
2,
3]. This approach inevitably embeds compromises among critical performance indicators, and as a result, existing dexterous hands tend to cluster into two distinct design philosophies. One category prioritizes anthropomorphic fidelity—emphasizing kinematic similarity and high DoF—but often at the expense of dynamic speed and responsiveness [
4]. The other achieves impressive speed or force output through compact, simplified mechanisms, yet sacrifices human-like scale, dexterous versatility, and coordinated multi-finger control [
5,
6]. Bridging this divide—achieving a design that simultaneously maintains human-like size and dexterity while substantially elevating dynamic performance—has remained an open and difficult problem.
Recent advances in biomimetic structures, lightweight materials, actuation modalities, and transmission mechanisms have significantly broadened the accessible design space [
7,
8,
9,
10,
11,
12]. However, these innovations have largely been absorbed into the same forward-design framework: a specific configuration is proposed first, and its performance is evaluated afterward. What remains absent is a mature methodology capable of moving in the reverse direction—starting from explicit, quantitative dynamic performance targets and systematically deriving the optimal combination of actuation, transmission, and structural parameters. To address this gap, our group previously proposed a data-driven inverse design framework [
13]. By aggregating design variables and performance metrics from a large corpus of existing dexterous hands, we constructed a “design variables—performance metrics” association map using Cramér’s V correlation analysis. This map provides quantitative, performance-targeted guidance for hardware design decisions. However, this framework was proposed only as a concept; it has never been validated on a complete humanoid dexterous hand system, and its potential to realize dynamic performance approaching or matching human-level performance on structured high-frequency tasks on physical hardware has remained untested.
The present study provides this critical validation. We apply the data-driven inverse design framework to develop, for the first time, a fully anthropomorphic dexterous hand optimized specifically for high-frequency dynamic performance. Starting from the quantitative association map, we identify an optimal region of the design space and translate it into a concrete hardware configuration. The resulting system, termed the Beyond Hand, retains a human-like 20-DoF architecture and anthropomorphic scale while achieving dynamic capabilities that, under the specific test conditions (rhythm-game and Tetris tasks), are comparable to or better than the best performance observed in our intermediate-level human participant cohort (
n = 5) in terms of task completion speed and timing consistency. Its design embodies a biomimetics-informed engineering strategy: the key features independently identified through data-driven analysis—tendon-driven force transmission, proximal actuator placement, and lightweight skeletal structures—closely mirror the biomechanical organization of the human hand [
14,
15,
16]. This convergence between statistical optimization and biological evolution suggests that data-driven inverse design can recover principles that nature itself has refined over millions of years, providing an independent line of evidence to support biomimetic design decisions.
The main contributions of this work are as follows:
First system-level validation of data-driven inverse design. We demonstrate that a quantitative design-performance association map can successfully guide the complete hardware development of a humanoid dexterous hand, translating explicit dynamic performance targets into an optimal physical configuration.
Development of the Beyond Hand platform. The resulting hand is a compact, 20-DoF anthropomorphic system that matches human scale while achieving enhanced high-frequency dynamic capability comparable to the best observed performance of our intermediate-level human participant cohort (n = 5) on the structured rhythm-game and Tetris tasks, with all actuators placed in the forearm to minimize distal inertia.
Rigorous experimental benchmarking against human baselines and existing designs. Through standardized frequency-response characterization across all 15 finger joints, rhythm-game tasks demanding millisecond-level multi-finger coordination at frequencies up to 12 Hz, and Tetris-style single-finger manipulation tests under a fixed wrist, we establish clear performance advantages in speed, timing accuracy, and high-frequency stability on the structured rhythm-game and Tetris tasks relative to our intermediate-level human participant cohort (n = 5) and established dexterous hand designs.
The remainder of this paper is organized as follows.
Section 2 describes the materials and methods, including the data-driven inverse design framework, the biomimetics-informed derivation of the Beyond Hand configuration, and the experimental protocols for performance evaluation.
Section 3 presents the results, covering the association map, the design outcome, standardized performance metrics, and high-dynamic manipulation tests.
Section 4 discusses the implications of the convergence between data-driven and evolutionary design principles, the specific dimensions in which the Beyond Hand extends beyond biological capabilities, and the limitations and future directions of this work.
Section 5 concludes the paper.
2. Materials and Methods
2.1. Data-Driven Inverse Design Framework
This work builds upon a dataset established in our prior study [
13]. We have now expanded this dataset through a systematic search across Google, Google Scholar, ScienceDirect, IEEE Xplore, and Scopus, covering more than 370 articles. After screening for relevance, data completeness, and cross-validation across multiple sources, a total of 77 distinct dexterous hands were included in the final statistical analysis. The inclusion criteria required that each hand be either commercially available or described in sufficient detail in the literature, and that at least partial quantitative or semi-quantitative information could be extracted.
For data normalization, where different units or measurement methods were reported, we converted all values to consistent metric units (e.g., Newtons for force, degrees per second for speed, kilograms for weight, and dimensionless ratios for compactness). For missing and inconsistent data across sources, we applied a three-step strategy: first, reasonable completion based on similar designs or diagram measurements; second, exclusion of data points with excessive missing information or impossible speculation; third, merging of repeated information from different sources to ensure consistency. To minimize dataset bias, we included all identifiable dexterous hands that met the inclusion criteria, without any selective omission based on design preferences or performance outcomes.
The resulting dataset, comprising 77 distinct dexterous hands, systematically documents key design variables—including the number of DoF, driving form, transmission mode, structural form, and structural material—alongside five performance metrics: maximum speed, fingertip force, compactness, weight, and maximum single-finger motion range. The exact composition of the 77 hands and their respective data quality are shown in the
table in Part 3 of the Supplementary Video S1.
To analyze the associations between design variables and performance metrics, we conducted a quantitative correlation analysis using Cramér’s V, a statistical measure appropriate for categorical data [
13]. Prior to this analysis, all continuous performance metrics were converted into two discrete categories based on functionally meaningful thresholds derived from general data on human hand performance: fingertip force was classified as <12 N or ≥12 N; maximum speed as <200°/s or ≥200°/s; weight as <0.5 kg or ≥0.5 kg for hands without forearms, and as <1.5 kg or ≥1.5 kg for integrated hand-forearm systems; maximum single-finger motion range as below the human level or at/above it; and compactness (defined as the finger length-to-thickness ratio) as <5.5 or ≥5.5. These thresholds are grounded in typical human hand capabilities reported in the literature. We acknowledge that human hand performance varies considerably across individuals, and therefore the exact boundary between “low” and “high” categories is inherently somewhat uncertain. To evaluate the potential influence of threshold selection on the correlation results, we conducted a sensitivity analysis by varying each threshold within a reasonable range (e.g., ±10–20% of the original value) and recalculating Cramér’s V. The results showed that the strength and direction of the correlations remained consistent. Thus, the main conclusions of our correlation analysis are not sensitive to the exact choice of cutoff values.
Correspondingly, the key design variables were categorized as follows: the number of DoF was classified as <3 or ≥3; structural form was classified by motor location as built-in or external; driving form was classified as fully actuated or underactuated; and structural material was classified as metallic or non-metallic. Transmission modes were classified into first-level and second-level categories. The first-level mode refers to the transmission from the actuator to the finger, encompassing tendon-driven, gear-driven, ball screw-driven, and linkage-driven systems. The second-level mode refers to the transmission within the finger itself, which includes tendon-driven, gear-driven, belt-driven, and linkage-driven systems. For clarity in the subsequent analysis and visualization, transmission modes across both levels were consolidated into two representative categories: tendon-driven and linkage-driven.
The resulting correlation matrix was visualized as a Sankey diagram, which maps the couplings between design variables and performance metrics. The width of each flow corresponds to the calculated Cramér’s V value, with wider links indicating stronger correlations. Full results of this analysis are presented in
Section 3.1.
2.2. Biomimetics-Informed Inverse Design
We take a high-dynamic rhythm-based game task as the specific design target. This task imposes multiple stringent requirements on the dexterous hand, including extreme speed, high acceleration, structural compactness, and wide response bandwidth—all of which correspond directly to the performance metrics analyzed in the association map.
The key design principles emerging from the data-driven analysis—the strong correlation between tendon-driven transmission and speed, the benefits of lightweight metallic structures for force output, and the trade-off between DoF count and system weight—point unequivocally toward a specific region of the design space. To understand why this particular combination of features emerges as optimal, we examine the anatomical basis of the human hand’s remarkable dynamic capabilities. The human hand achieves its combination of speed and dexterity primarily through a tendon-driven architecture: long flexor and extensor tendons, originating from muscle bellies located in the forearm, transmit forces across multiple joints to the phalanges. This arrangement confers two critical advantages. First, by placing the majority of muscle mass proximally in the forearm rather than within the hand itself, the mass and inertia of the distal segments are minimized, enabling rapid acceleration and direction changes. Second, the multi-articular routing of tendons across successive finger joints provides a natural mechanical coupling that simplifies coordinated motion patterns while maintaining compliant, energy-efficient force transmission. The convergence between these well-established biomechanical principles and the statistical patterns independently identified in our association map is striking: the data-driven analysis recovers, through purely quantitative means, the same design strategy that biological evolution arrived at over millions of years.
Guided by this dual foundation of biological insight and quantitative data, we derive the following configuration for the Beyond Hand. To achieve extreme speed, we select a tendon-driven mode as the first-level transmission, which closely mimics the long flexor tendons of the human hand and aligns with the map’s strong correlation between tendon drive and high-speed performance. To reduce transmission losses and increase structural stiffness, we adopt a linkage-driven mode as the second-level transmission. This hybrid tendon–linkage architecture parallels the human hand’s combination of flexible tendons for force transmission and rigid skeletal segments for structural support. To balance fingertip force and structural compactness, we use aluminum alloy as the main structural material, analogous to the role of bone tissue in providing lightweight, high-strength frameworks. In terms of overall configuration, we maintain a human-like 20-DoF design and—following the biological strategy of reducing distal inertia—place all actuators in the forearm, remote from the hand itself.
Table 1 summarizes the systematic mapping between these design choices and their biological counterparts [
17,
18].
2.3. Standardized Performance Evaluation Methods
To characterize the frequency response and key physical capabilities of the Beyond Hand, a series of standardized benchtop tests were conducted.
2.3.1. Frequency Response Characterization
Sinusoidal frequency-sweep experiments from 1 to 16 Hz were performed on all joints of the five fingers. The test encompassed the metacarpophalangeal (MCP), proximal interphalangeal (PIP), and abduction/adduction (ABAD) joints of each finger; for the thumb, the corresponding carpometacarpal (CMC), MCP, and interphalangeal (IP) joints were evaluated. Motion amplitudes were set to ±20° for abduction/adduction joints and 0–80° for flexion/extension joints, consistent across all fingers. Command trajectories were sent at 100 Hz, and joint angle sensors sampled at 160 Hz. The cutoff frequency for each joint was identified using the standard criteria of a −3 dB magnitude drop and a −90° phase lag. Each frequency sweep test was repeated three times per joint. The three runs produced highly consistent results; the best-performing run (i.e., with the most stable tracking and lowest phase lag) is presented as representative.
Each joint of each finger is driven by an HT2806 motor (rated torque 0.04 N·m, rated speed 1200 RPM, maximum no-load speed 2200 RPM, stall torque 0.14 N·m; Changzhou Hetai Motor & Electrical Appliance Co., Ltd., Changzhou, China). The tendon is wound around a 5 mm diameter shaft connected to the motor. A cascaded PID control structure is implemented on an STM32F407 microcontroller (STMicroelectronics, Geneva, Switzerland). The inner current loop runs at 20 kHz, the middle velocity loop at 20 kHz, and the outermost position/impedance loop at 1000 Hz. During the 1–16 Hz sweep tests, the system operated in position-controlled mode with a sinusoidal reference trajectory.
The position control loop updates at 1000 Hz. The round-trip communication latency between the host computer and the lower-level controller was measured as 3.6 ± 0.3 ms. The total system pure delay from command issuance by the host computer to joint motion feedback via the encoder was measured as 12 ± 4 ms. A proportional controller tuned for critical damping (Kp = 60) was used for the outermost position loop; the inner velocity and current loops employ PI control. No electronic damping was applied; the system relies primarily on the natural mechanical damping of the joints.
2.3.2. Key Performance Metrics and Benchmarking
In addition to frequency response, the peak angular velocity of the PIP joint, maximum fingertip force under a bent posture, system weight, finger compactness ratio (finger length-to-thickness), and fingertip repeatability were measured. To contextualize these results, a systematic benchmarking was performed against both the human hand and three representative state-of-the-art dexterous hands (detailed in
Section 3.3).
2.3.3. Radar-Chart Normalization Method
For the radar-chart comparison in
Section 3.3.3, all six performance indicators (DoF ratio, fingertip force ratio, weight ratio, speed ratio, compactness ratio, and frequency response ratio) were normalized to the human hand as the baseline, where the human hand is assigned a value of 1. Specifically, each ratio was computed as: Ratio = (value of the dexterous hand)/(corresponding value of the human hand). No additional weighting was applied; all six metrics were treated equally to provide an unbiased, unweighted comparison across designs.
The benchmark dexterous hands (Inspired Hand, Shadow Hand, DLR-HIT II Hand) were tested under different experimental conditions and reported in separate literature sources. Therefore, direct comparison under identical testing protocols is not feasible. The radar chart is intended as an indicative, qualitative comparison based on the best available data reported in the literature, rather than a strictly controlled benchmarking study.
2.4. High-Dynamic Manipulation Task Design
To assess the Beyond Hand’s performance under realistic manipulation scenarios, two experimental tasks were designed, each targeting a distinct aspect of dynamic capability.
2.4.1. Rhythm-Game Task
A music rhythm-game task was employed to evaluate multi-finger coordination and high-speed dynamic response. The robotic hand was required to execute 15 different tapping patterns according to sliding-bar cues on a screen, pressing the corresponding key within a strictly constrained time window upon slider arrival at a judgment area. The time deviation between each press and the target beat was categorized into four levels: ±50 ms (“Best”), ±50–100 ms (“Good”), ±100–150 ms (“Cool”), and >150 ms (“Miss”). Overall accuracy was defined as the proportion of presses rated “Best” when all presses were ideally timed. The total slider density reached up to 34 notes per second, with a maximum single-lane density of 12 notes per second. The robotic hand performed the task three independent times. The outcomes across the three runs were similar; the best result (the run with the highest overall accuracy) is reported. A video demonstration is provided in the
Supplementary Materials (Video S1, part 1).
2.4.2. Tetris-Style Manipulation Task
A single-finger dynamic experiment was designed based on a Tetris game to evaluate high-speed single-finger agility and reachable workspace under a fixed wrist. The middle finger alone performed rapid presses on the “↑” key (first row) and “←” and “→” keys (second row). The block descent speed increased progressively: 150 ms per grid cell (0–30 s), 120 ms (31–45 s), 100 ms (46–60 s), and 75 ms (61–75 s). The task was repeated three times under the same conditions. The results were consistent across repetitions; the best performance (the run with the fastest and most accurate key presses) is reported. A video demonstration is provided in the
Supplementary Materials (Video S1, part 2).
2.5. Human Participant Baseline
To benchmark the Beyond Hand against human performance, five healthy doctoral students (age range 24–27 years, mean 25.2 years) participated in the rhythm-game task. Their skill level with rhythm games was intermediate: they could achieve high scores on pieces of moderate difficulty but performed considerably worse on fast-tempo tracks.
Each participant performed 10 repeated trials of the same manipulation tasks. For each participant, the best outcome across the 10 trials was recorded as the representative human performance, as our aim was to compare the upper bound of human performance with that of the robotic system under identical task conditions. Therefore, we report the individual best performance rather than a group mean with variance.
Written informed consent was obtained from all participants prior to the experiments. The study was conducted in accordance with the ethical guidelines of our institution. Under our institutional policies, this low-risk behavioral study (non-invasive, healthy adult volunteers) was exempt from formal ethics committee review.
4. Discussion
The most salient finding of this study is the convergence between design principles identified through purely data-driven analysis and those embodied in the human hand. The Cramér’s V-based association map, constructed without biological bias, independently singled out tendon-driven transmission, proximal actuator placement, and lightweight rigid structures as dominant features for high-speed dynamic performance. That the same principles characterize a system refined by millions of years of evolution strongly suggests that the high-dimensional design space of dexterous hands is governed by fundamental physical constraints admitting only a narrow set of high-performance solutions. This convergence validates the inverse design framework and offers a broader insight: well-constructed engineering datasets can recapitulate certain outcomes of evolutionary optimization, providing independent evidence for biomimetic design.
We attempted to quantitatively compare tendon routing efficiency, inertia distribution, and energy efficiency between the human hand and the Beyond Hand, but encountered substantial practical difficulties. For tendon routing efficiency, the human hand involves complex multi-articular interactions, passive elastic tissues, and nonlinear friction properties that are extremely difficult to measure or model with sufficient accuracy. For inertia distribution, the human hand’s mass properties vary significantly across individuals and cannot be directly obtained from literature without invasive measurements. For energy efficiency, there is no established standard for comparing metabolic energy consumption in biological hands with electrical power consumption in robotic hands under equivalent tasks. Therefore, in this study we focus on demonstrating morphological and kinematic convergence (e.g., degrees of freedom, finger proportions, tendon routing topology), which are more readily comparable. Quantitative comparisons of efficiency, inertia distribution, and energy consumption remain important but challenging directions for future work.
Before interpreting the association map, we note several limitations of Cramér’s V analysis. Correlation does not imply causation. Cramér’s V only quantifies the strength of a symmetric association and does not indicate any directional or causal relationship. Therefore, the inverse design framework presented here is intended as an exploratory and descriptive tool to identify potential design–performance linkages, not as a predictive or prescriptive causal model. Moreover, given the currently limited amount of available dexterous hand data, the estimated correlations may change as more data accumulate in the future. Our findings should thus be viewed as preliminary insights.
The experimental results also clarify where engineered systems can extend beyond biological capabilities. Human participants in the rhythm-game task showed a sharp performance decline above 4–6 Hz due to neuromuscular and cognitive limits. The Beyond Hand, free of these bottlenecks, sustained >90% accuracy up to 12 Hz. The uniformly stable frequency response across all 15 joints—cutoff frequencies tightly clustered around 10 Hz—further demonstrates that proximal actuation benefits the whole hand evenly. These results do not diminish the human hand’s unmatched versatility; rather, they identify timing consistency, repetition rate, and fatigue-free operation as areas where robotic systems complement biological performance.
Limitations of the game-based validation tasks. The Tetris and rhythm-game experiments primarily evaluate repetitive, high-speed tapping rather than the full scope of dexterous manipulation, such as grasp adaptation, contact-rich interaction, or handling object uncertainty. These game-based tasks were chosen intentionally to benchmark the hand’s peak speed, precision, and motion consistency under highly dynamic but structured conditions—where the contact sequence and target positions are known in advance. They are not intended to demonstrate the hand’s capability in unstructured tasks such as adaptive grasping, in-hand manipulation, or interaction with deformable or uncertain objects. Therefore, the results should be interpreted as evidence of high-speed motor performance rather than comprehensive dexterous manipulation ability. Future work will include more ecologically valid manipulation tasks (e.g., grasping unknown objects, tool use, collaborative assembly) to fully assess practical utility.
Regarding mechanical durability, after extended operation we observed some abrasion at the pin-hole interfaces of the linkages. Nevertheless, the assembly accuracy remained within acceptable limits for the intended manipulation tasks. To mitigate wear, we inserted copper sleeves at all pin-hole joints and applied sufficient lubrication, which substantially reduces friction and slows down wear. Based on these observations, we recommend routine maintenance practices, including periodic inspection of tendon tension, re-lubrication of copper sleeves, and replacement of sacrificial wear parts after extended use.
Beyond bioinspired marine robotics, origami-inspired soft robotic designs have demonstrated how simulation-driven morphological selection can achieve dynamic locomotion [
26]—a design paradigm that shares conceptual parallels with the data-driven inverse design framework presented in this work.
Recent advances in underactuated tendon-driven hands with integrated tactile sensing, such as the Tactile SoftHand-A, have demonstrated that antagonistic tendon mechanisms combined with 3D-printed tactile structures can achieve robust grasping across diverse objects [
27]. This work provides a relevant reference for future iterations of the Beyond Hand, suggesting that tactile embodiment can be incorporated without sacrificing mechanical simplicity.
Several specific limitations point to future directions. First, the Beyond Hand lacks tactile sensing, which prevents fine force modulation and adaptive grasping [
28,
29]. Second, the association map, while effective, depends on the current dataset’s composition and size; its statistical resolution will improve as more dexterous hand designs are added. Third, the validation tasks, though demanding, remain structured; unstructured environments with varied objects and physical interactions need to be addressed. Integrating tactile sensing and real-time feedback control into the architecture will be a necessary next step. Finally, the convergence observed here suggests that inverse design frameworks, applied to larger and more diverse datasets, could eventually generate quantitative hypotheses in comparative biomechanics, closing the loop between engineering and biology.
Limitations regarding computational cost, generalizability, and multi-objective optimization. The proposed inverse design framework is computationally lightweight: the core Cramér’s V analysis on the dataset of 77 dexterous hands completes within seconds on a standard PC (e.g., Intel i5, 16 GB RAM) using Python (3.11.7)’s SciPy library. The dominant effort is the one-time dataset construction (literature search, data extraction, normalization), which is performed offline. The framework can be extended to other robotic platforms (e.g., humanoid or quadruped robots), as the principle of correlating categorical design variables with performance metrics is platform-independent. However, the framework does not inherently resolve trade-offs when performance metrics conflict (e.g., speed vs. force, weight vs. durability). It identifies which design variables influence each metric, providing a basis for informed trade-off decisions. When objectives are non-conflicting, simultaneous optimization is straightforward; when conflicts exist, explicit multi-objective optimization (e.g., Pareto front analysis) would be required. The latter is not implemented in the current work and remains an important future direction.
5. Conclusions
This study addressed the challenge of simultaneously achieving human-like scale and dexterity with elevated dynamic performance in dexterous hand design. By constructing a quantitative association map between design variables and performance metrics, we established a data-driven inverse design framework that translates explicit dynamic targets into an optimized hardware configuration—replacing the empirical trial-and-error paradigm. The resulting Beyond Hand preserves a 20-DoF anthropomorphic form while embodying the biomechanical principles of tendon-driven transmission, proximal actuation, and lightweight rigid structures.
Standardized frequency-response tests across all 15 finger joints revealed stable and well-balanced dynamic characteristics, with magnitude attenuation below 3 dB at 14 Hz and cutoff frequencies tightly clustered around 10 Hz. In high-dynamic manipulation tasks, the Beyond Hand maintained over 90% overall accuracy in rhythm-game operations at frequencies up to 12 Hz, achieved better performance than the human participants in our study under the same structured task conditions The Tetris-style single-finger task further demonstrated rapid, reliable multi-target key-press capability, with fingertip velocities reaching approximately 3000 mm/s under progressively tightening time constraints and a fixed wrist.
Beyond validating specific performance targets, this work shows that purely data-driven analysis can independently recover design principles convergent with those of biological evolution, underscoring the value of quantitative performance-driven methodologies. The proposed framework is scalable and will be refined as the underlying design dataset grows. Future extensions will incorporate tactile sensing, adaptive grasping in unstructured environments, and real-time feedback control [
30]. The Beyond Hand thus serves as both a validation platform and a blueprint for next-generation high-performance biomimetic robotic hands, with promising applications in service robotics, rehabilitation, high-speed industrial manipulation, and human–robot collaborative scenarios.
We also acknowledge that the human comparison in this study involved only five intermediate-level participants performing a structured rhythm-game task. Therefore, the finding that the Beyond Hand outperformed these participants should not be overgeneralized to all human capabilities. The human hand remains unmatched in versatility, sensory feedback, and adaptive manipulation.