Integrated jerk as an indicator of affinity for artificial agent kinematics: laptop and virtual reality experiments involving index finger motion during two-digit grasping

James Hirose; Atsushi Nishikawa; Yosuke Horiba; Shigeru Inui; Todd C. Pataky

doi:10.7717/peerj.9843

Integrated jerk as an indicator of affinity for artificial agent kinematics: laptop and virtual reality experiments involving index finger motion during two-digit grasping

James Hirose¹, Atsushi Nishikawa², Yosuke Horiba³, Shigeru Inui³, Todd C. Pataky ⁴

1Department of Biomedical Engineering, Shinshu University, Ueda, Nagano, Japan

2Department of Mechanical Science and Bioengineering, Osaka University, Suita, Osaka, Japan

3Department of Advance Textile and Kansei Engineering, Shinshu University, Ueda, Nagano, Japan

4Department of Human Health Sciences, Kyoto University, Sakyo-ku, Kyoto, Japan

DOI: 10.7717/peerj.9843

Published: 2020-09-15
Accepted: 2020-08-11
Received: 2020-03-02

Academic Editor: Leonardo Gollo

Subject Areas: Kinesiology, Psychiatry and Psychology, Human-Computer Interaction
Keywords: Kinematics, Artificial motion naturalness, Grasping, Uncanny valley

Copyright: © 2020 Hirose et al.
Licence: This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.

Cite this article: Hirose J, Nishikawa A, Horiba Y, Inui S, Pataky TC. 2020. Integrated jerk as an indicator of affinity for artificial agent kinematics: laptop and virtual reality experiments involving index finger motion during two-digit grasping. PeerJ 8:e9843 https://doi.org/10.7717/peerj.9843

The authors have chosen to make the review history of this article public.

Abstract

Uncanny valley research has shown that human likeness is an important consideration when designing artificial agents. It has separately been shown that artificial agents exhibiting human-like kinematics can elicit positive perceptual responses. However the kinematic characteristics underlying that perception have not been elucidated. This paper proposes kinematic jerk amplitude as a candidate metric for kinematic human likeness, and aims to determine whether a perceptual optimum exists over a range of jerk values. We created minimum-jerk two-digit grasp kinematics in a prosthetic hand model, then added different amplitudes of temporally smooth noise to yield a variety of animations involving different total jerk levels, ranging from maximally smooth to highly jerky. Subjects indicated their perceptual affinity for these animations by simultaneously viewing two different animations side-by-side, first using a laptop, then separately within a virtual reality (VR) environment. Results suggest that (a) subjects generally preferred smoother kinematics, (b) subjects exhibited a small preference for rougher-than minimum jerk kinematics in the laptop experiment, and that (c) the preference for rougher-than minimum-jerk kinematics was amplified in the VR experiment. These results suggest that non-maximally smooth kinematics may be perceptually optimal in robots and other artificial agents.

Introduction

The uncanny valley is a phenomenon introduced in the robotics literature by Masahiro Mori, explaining that our affinity toward objects generally increases as the object becomes more human-like; however, at a certain human-likeness our affinity drops drastically and we feel a sense of eeriness (Mori, MacDorman & Kageki, 2012). With appearance being quite human but being incompletely so, the viewer senses that something is unnatural and the object is seen as odd, creepy, and/or terrifying.

Kinematics may be important to consider in conjunction with the uncanny valley. Mori pointed out that moving objects can amplify affinity relative to non-moving objects, both in positive and negative senses (such as a corpse being animated becomes much more eerie) (Mori, MacDorman & Kageki, 2012). From this idea, one could surmise that different levels of human likeness in the objects’ kinematics may modulate uncanny valley responses.

The uncanny valley has arguably been overcome in many applications involving static, lifelike imagery such as computer-generated humans (Alexander et al., 2010; Perry, 2014), but animated human-like objects (mainly robots) are arguably far less human-like. A variety of research has shown the importance of kinematics in human–robot interactions. For example, it has been shown that both humanoid features and human movements are important factors for facilitating human–robot social interactions (Kilner, Paulignan & Blakemore, 2003; Oztop, Chaminade & Franklin, 2004a; Chaminade et al., 2005; Glasauer et al., 2010). Whereas the biological movement in the latter study (Chaminade et al., 2005) was from a motion captured trajectory, a separate but similar study (Huber et al., 2008b) showed that human–robot interaction improved when the robot moved with minimum-jerk trajectories as approximations to human kinematics (Flash & Hogan, 1985), instead of with mechanical trapezoid trajectories. This research suggests that kinematics, and potentially jerk specifically, may interact with the uncanny valley to form an overall affinity for moving artificial agents.

Jerk is the third time derivative of position, and applies to both linear and angular motion. It can be interpreted as the rate of change of acceleration, or perhaps more clearly as proportional to the rate of change of force. Jerk has been shown to be relevant to human–robot interaction (Huber et al., 2008a), and theoretical predictions of movement based on a minimum jerk criterion have been shown to qualitatively reproduce many features of human movement (Flash & Hogan, 1985; Furuna & Nagasaki, 1993; Viviani & Flash, 1995). Minimizing jerk is potentially useful movement strategy because high jerk implies rapid force changes and thus both wasted muscular energy and compromised mechanical control. Jerky motion is additionally characteristic of movement disorders (Latash, 2012; Kavanagh, Wedderburn-Bisshop & Keogh, 2016), implying that lower jerk, if perceivable by humans, may also be perceived as healthier and/or more natural than jerky movements.

The purpose of this study was to quantify the changes in affinity for a moving artificial agent as a function of kinematic jerk. We postulate that jerk may be an important affinity-relevant variable based on previous studies which hypothesize that humans follow minimum jerk trajectories (Flash & Hogan, 1985) and that smooth trajectory endpoints are associated with greater affinity (Huber et al., 2008b). Through this quantification a secondary goal was to find the optimal smoothness, if any, for artificial agent kinematics.

This study used the hand as an artificial agent model because the hand is more recognizable as human-like than a limb, and because using a hand avoids problems associated with uncanny valley-like responses to heads and faces. We aimed to examine our affinity toward robot hand prostheses by measuring affinity as a function of smoothnesses from very jerky kinematics, jerkier than a robot, to minimum-jerk kinematics. To secure simplicity for this experiment design, and provide functional findings that can be implemented into robotic designs, we aimed for a simple 1-DoF grasp motion. We chose to examine two-digit index-thumb motion as the simplest possible 1-DOF motion that is still recognizable as human-like.

Experiment 1: human and robot kinematics

The purpose of this experiment was to roughly quantify the expected range of kinematic jerk values for robot and human finger grasping motions. Specifically the human jerk magnitude that was used for kinematic references and basic kinematic motion robot hand device was used to generate the minimum-jerk value and trajectory.

Methods

A sample of 21 human subjects was tested. All subjects (including experiment 2 and 3) were requited from Shinshu University and provided written informed consent following the procedures of Shinshu University’s ethical review board (approval number: 216).

Two main devices were used for this experiment. The first was a six-camera motion capture system (VENUS 3D Flex13, Nobby Tech. Ltd., Tokyo, Japan) which recorded the positions of reflective markers at a sampling rate of 120 Hz. The second device was an open-source robotic hand (HACKberry, exiii Inc., Tokyo, Japan) (Fig. 1). The individual hand segments were 3D-printed using polylactic acid filament and, unlike the original open-source design, only the wrist segments and more distal segments were used (i.e., the forearm segments were not used). Also unlike the original open-source design, which was battery-powered, the device was modified to accept power using a standard wall adapter. Switches on the back of the hand were used to close the fingers into a grasp-like posture, and then return the fingers to the extended posture depicted in Fig. 1.

Figure 1: Assembled HACKberry system.

Download full-size image

DOI: 10.7717/peerj.9843/fig-1

Reflective markers were placed on the medial side of the human at the following locations: proximal interphalangeal joint (PIP joint), metacarpophalangeal joint (MP joint), and metacarpal bone (MP bone) (Fig. 2). The motion capture system’s six cameras were placed around the table, yielding a total capture volume of approximately 50 cm³.

Figure 2: Marker placements on a human subject hand.

Download full-size image

DOI: 10.7717/peerj.9843/fig-2

Protocol

A small eraser (1.0 × 1.6 × 4.5 cm) was placed on the table with its wide face down, and was used both as a reference point and as a guide for the subjects in order to generate a two finger grasp motion. Each participant held their thumb tip close to the eraser, and keeping an ‘L’ shape with their two fingers, they extended their index finger while facing their hand’s dorsal surface upwards. Marker positions were recorded for a minimum of 60 s at 120 Hz. Subjects were informed to grasp the eraser as they normally do and maintain the grasp until they are instructed to release it and return to their initial position. The interval between movements were 5 s and the grasp-release motions were repeated six times.

A similar motion recording was done on the HACKberry device to measure the grasp range and its time of movement. Like the human hand measurement, markers were placed onto the corresponding locations of the HACKberry device and its thumb and index finger will grasp the same object. The grasp motion was measured five times.

Analysis

Data were processed in Python 3 (Python Software Foundation). From the six grasp-release motions that the human subjects each performed, five grasp trials with the minimal marker swapping were selected for analysis. Each start and end point of the motion were manually determined and the average absolute jerk being calculated as: (1) $Average absolute jerk = \frac{1}{T} \sum_{t = t_{0}}^{T} |\frac{d^{3} θ (t)}{d t^{3}}|$ where T is the total time of the finger motion, t₀ is the time at which the finger starts to move, θ is the angle of the MP joint, and θ’s third derivative was estimated numerically using adjacent time samples. All joint angle data were processed with a low-pass fileter using a fifth-order, zero delay Butterworth filer with a cutoff frequency of 10 Hz.

The HACKberry device was measured for its grasp time length and its angular range by determining the initial and final MP joint angle from the average values. Just like the human subject data, the HACKberry’s initial and final point of its motion were manually determined.

Lastly, based on HACKberry’s measured data, we created a minimum-jerk trajectory. Flash & Hogan (1985) describe a general minimum-jerk trajectory (x_minjerk), where an object travels from x_i to x_f in time =d seconds, as: (2) $x_{minjerk} (t) = x_{i} + (x_{f} - x_{i}) (10 {(\frac{t}{d})}^{3} - 15 {(\frac{t}{d})}^{4} + 6 {(\frac{t}{d})}^{5})$ and the measured initial and final points, and the average grasp duration was applied accordingly, and the absolute average jerk magnitude is calculated using Eq. (1).

Results

As Fig. 3 shows, the average absolute human jerk for the task used in this paper was 0.50 × 10⁵deg∕s³. The HACKberry performed its two-digit 1-degree of freedom (DoF) grasp motion from 161.4 to 96.9 deg in 0.4 s. Thus the average absolute jerk magnitude for the minimum-jerk trajectory for the HACKberry device was 0.20 × 10⁵deg∕s³.

Figure 3: Absolute human jerk values relative to the minimum-jerk trajectory (MJT).
Each point represents one subject’s mean (averaged across five trials). The dashed-line represents the grand mean (across subjects). The shaded area represents standard deviation (SD).

Download full-size image

DOI: 10.7717/peerj.9843/fig-3

Brief discussion

Among the results, a subject produced a mean value (0.190 × 10⁵deg∕s³) that is just under the minimal jerk value (0.20 × 10⁵deg∕s³). This is because the subject moved its finger slower and/or in smaller pinch angle than the time and/or the travel angle of the minimum-jerk trajectory. Also note that the parameters for the minimum-jerk trajectory was based on the kinematics of the HACKberry robot hand.

Note that this experiment does not seek to identify differences between artificial and human kinematics. An artificial agent was used simply because jerk cannot be controlled precisely in real human motion. The human jerk data from Experiment 1 had just one purpose: to quantify an approximate range of jerk values that can be expected in a 1-DoF two finger grasp motion.