Exaggerated joint movements when keeping most of the palm fixed

EladP 2015-03-25 21:19:14 UTC #1

Hello Leap people,

I have the following issue:

While keeping my palm fixed and just moving one finger I get exaggerated movements in a lot of other joints which clearly haven't moved.

I made a small movie of it here: exaggerated joint movement movie
The hand images shown in the movie clearly show that other then one finger (at a time), the hand is fixed (the movie shows pinky and thumb movements).

Its a problem for me because my app needs to distinguish between small taps and large taps. For determining tap gestures I'm using some joints as reference points. I would not like these reference points to move if they aren't actually moving.

My device is recently calibrated.

Thank you for any input on the matter,
Elad

Joe_Ward 2015-03-25 21:51:28 UTC #2

Unfortunately, I think that is just because the Leap is fitting fairly sparse observed data to a more detailed skeletal model. Perhaps referencing things to the hand basis will make things easier? For example, here's an except from a Java Processing script that changes the joint positions into the hand frame of reference and then draws an ellipse at each finger joint (everything but the ellipse drawing is part of the Leap API):

Frame frame = controller.frame();
Hand hand = frame.hands().get(0);
currentHand = hand;
Matrix handTransform = hand.basis();
handTransform.setOrigin(hand.palmPosition());
handTransform = handTransform.rigidInverse();
for(Finger finger : hand.fingers()){
   for(Bone.Type boneType : Bone.Type.values()) {
       Bone bone = finger.bone(boneType);
       Vector transformed = handTransform.transformPoint(bone.prevJoint());
       Vector normalized = normalize(transformed);
       ellipse(normalized.getX(), normalized.getZ(),5,5);
       if(boneType == Bone.Type.TYPE_DISTAL){ //get tip of distal phalanx
          Vector transformedN = handTransform.transformPoint(bone.nextJoint());
          Vector normalizedN = normalize(transformedN);
          ellipse(normalizedN.getX(), normalizedN.getZ(),5,5);
       }
   }        
}

This isolates the movement of fingers from the hand. You still get some movement from other fingers, but I think should be easier to find the minimum motion you can reliably detect as intentional.

On another level, though, even if the Leap detection was flawless, will your user's be flawless in their performance of your gestures? Even based on variability in how still people are willing to hold their hand when they tap, you will still have a "smallest" reliable tap distance that I suspect will be larger than the unwanted motion shown in your video.

EladP 2015-03-26 09:23:07 UTC #3

Hi and thanks for the response.

I was actually already using the suggested hand reference, so I really have no intention of force the users to keep their hands still. But I would like to know how strong the tap gesture was and having the hand basis change where it shouldn't (the palm position, in the suggested example) really skews the results.

My followup questions to your explanation of the artifacts being related to fairly sparse observed data are (hopefully you can share the info):

1) What observed data is Leap trying to fit to the model?
2) Are there joints in the hand model which are more reliable then others?
3) Are there plans/directions to improve this?
4) Is there any way I (as a user) can use the images (which do supply some indication that the wrist, for example, is not moving) to improve the reliability of the results?
5) Can we get the measured data (without hand data fitting), as was done in V1?

Thanks,
Elad

Joe_Ward 2015-03-26 17:09:52 UTC #4

To answer your 3rd question first. Yes, tracking is something we continue to put a lot of effort toward improvement.

1) As you are aware already, the input data is a pair of stereo images. The basic processes for getting tracking data from these images is described in our patent filings. What is new since then is the v2 is the skeletal hand model. In some circumstances, like a flat open hand, it is easy to align the model to the data. However, the more of the hand that is obscured, the more unlikely it becomes that a particular fit is actually the right one out of the available possibilities. Thus there can be situations where even a small hand movement leads to a larger change in the model positioning because the new data reveals that the old position was in fact in error. This isn't to say that we can't do better, and as I said, is something we constantly strive to improve.

2) I don't think so. (But even if that were true today, it would be an implementation detail that could change.)

4) Not easily. I suppose you could create your own hand model and find agreement between the images and the API hand model.

5) No, that data is not available.

EladP 2015-03-26 19:08:47 UTC #5

Thank you very much for the detailed response and the links.