Kinect | Jamie Dixon's Home

Halloween Project (Garage Of Mystery)

November 4, 2014 2 Comments

For this years Halloween, the kids and I decided to do something out of the opening scene of Indiana Jones, without the big rock. We wanted to give kids a choice when they came to the house –> either get a small “fun” size candy bar or enter the garage of mystery for the chance of a full sized candy bar. (Incidentally, whoever thought it would be a good idea to name the smallest candy size on earth “fun” obviously was never a kid. When I was growing up, we called it four size, being that if took four of them to make a normal candy bar)

So if the kid wants to go into the garage of mystery, they have to get to the alter of snickers without the motion detector or the laser beam trip wires catching them. The full-size Snickers would disappear if the kid was picked up by the Kinect motion detector or if they tripped too many beams. In the diagram below, the red dots are the lasers crossing in front of the alter

The first thing we did was construct the alter.

Once the frame was set, we added a servo with a trap door to the top. We control the servo via a Phidget Servo Controller with some basic code from the Phidget SDK (if the SDK, you know, had F# in it)

 1     member this.servoController_Attached(args:Events.AttachEventArgs) =
 2         let _servoController = args.Device :?> AdvancedServo
 3         _servoController.servos.[0].Engaged <- true 
 4         _servoController.servos.[0].Position <- 110. 
 5         _isServoControllerReady  <- true
 6 
 7     member this.initializeController() =
 8         _servoController.Attach.Add(this.servoController_Attached)
 9         _servoController.``open``()
10 
11     member this.moveController(position:float) =
12         if _isServoControllerReady then
13             _servoController.servos.[0].Position <- position 
14

And you can see it in action here:

With the alter ready, we turned our attention to the laser trip wires. We purchased a whole bunch of dollar store pen lasers and got some Phidget light sensors. We then created a frame for both sides of the garage –> one to mount the laser and 1 to mount the light sensor

And then we added some basic code from the Phidget SDK (if the SDK, you know, had F# in it)

 1     member this.interfaceKit_Attached(args: Events.AttachEventArgs) =
 2         let _interfaceKit = args.Device :?> InterfaceKit
 3         _interfaceKit.sensors 
 4                         |> Seq.cast
 5                         |> Seq.map(fun s -> s :> InterfaceKitAnalogSensor)
 6                         |> Seq.map(fun s -> s.Sensitivity <- 20)
 7                         |>ignore
 8         _isInterfaceKitReady <- true
 9 
10     member this.interfaceKit_SensorChange(e: SensorChangeEventArgs ) =
11         let eventArgs = new LightSensorChangeEventArgs(e.Index,e.Value)
12         lightSensorChange.Trigger(eventArgs)
13 
14     member this.initializeInterfaceKit() =
15         _interfaceKit.Attach.Add(this.interfaceKit_Attached)
16         _interfaceKit.SensorChange.Add(this.interfaceKit_SensorChange)
17         _interfaceKit.``open``()
18         _interfaceKit.waitForAttachment()
19

Note that we are trapping the event from the light sensor and then raising it up in our own event.

With the light sensor in place, we turned our attention to the Kinect motion sensor. I first considered Rob Miles’s ides to compare the different color frames to see if there was movement but because I am using F# and F# does not support pointers like C#, the performance was too choppy. You can see the Stack Overflow thread here. So I could have either jumped to over to C# or figure out a different way using F#. I went with option B by using the skeleton frame, which has a Z index. By comparing the Z index over time, I can see how fast a person is moving towards to alter. The Kinect code was pretty much from the SDK (if the SDK, you know, had F# in it)

 1     member this.kinectSensor_ColorFrameReady(args: ColorImageFrameReadyEventArgs) =
 2         use colorFrame = args.OpenColorImageFrame()
 3         if not (colorFrame = null) then
 4             let colorData =  Array.zeroCreate<byte> colorFrame.PixelDataLength 
 5             colorFrame.CopyPixelDataTo(colorData)
 6             let width = colorFrame.Width
 7             let height = colorFrame.Height
 8             let stride = colorFrame.Width * colorFrame.BytesPerPixel
 9             let eventArgs = new ColorDataReadyEventArgs(colorData,width,height,stride)
10             colorDataReady.Trigger(eventArgs)
11             ()
12 
13     member this.KinectSensor_SkeletonFrameReady(args: SkeletonFrameReadyEventArgs) =
14         use skeletonFrame = args.OpenSkeletonFrame()
15         if not (skeletonFrame = null) then
16             let skeletons = Array.zeroCreate<Skeleton> skeletonFrame.SkeletonArrayLength
17             skeletonFrame.CopySkeletonDataTo(skeletons)
18             let skeletons1 = skeletons |> Array.filter (fun s -> s.TrackingState = SkeletonTrackingState.Tracked)               
19             if skeletons1.Length > 0 then
20               skeletonChanged.Trigger(skeletons1.[0])
21             ()
22         ()
23 
24     member this.initializeKinect() =
25         _kinectSensor.ColorStream.Enable()
26         _kinectSensor.ColorFrameReady.Subscribe(this.kinectSensor_ColorFrameReady) |> ignore
27         _kinectSensor.SkeletonStream.Enable();
28         _kinectSensor.SkeletonFrameReady.Subscribe(this.KinectSensor_SkeletonFrameReady) |> ignore
29         _kinectSensor.Start()
30

In the UI, I then checked for the skeleton movement and if the person moved too fast, they would trigger the snickers trap door to open

 1         void garage_SkeletonChanged(object sender, Skeleton skeleton)
 2         {
 3             if(_skeletonPoint.Z > 0)
 4             {
 5                 float zDelta = _skeletonPoint.Z - skeleton.Position.Z;
 6                 if (zDelta >= _zDeltaThreshold)
 7                 {
 8                     _numberOfSkeletonHits += 1;
 9                     skeletonChangedProgressBar.Dispatcher.Invoke(new Action(() => skeletonChangedProgressBar.Value = _numberOfSkeletonHits));
10 
11                 }
12                 if(_numberOfSkeletonHits >= _numberOfHitsForAlarm)
13                 {
14                     _garage.moveController(_openPosition);
15                 }
16 
17                 skeletonCanvas.Children.Clear();
18                 drawSkelton(skeleton);
19             }
20             _skeletonPoint = skeleton.Position;
21         }
22

With the result like this:

With the hard parts done, it was time to create a UI. I went with C# here because I am using WPF and the support for WPF and the Kinect is best in C#. I created a WPF application and built a UI

 1 <Window x:Class="ChickenSoftware.Halloween.UI.MainWindow"
 2         xmlns="http://schemas.microsoft.com/winfx/2006/xaml/presentation"
 3         xmlns:x="http://schemas.microsoft.com/winfx/2006/xaml"
 4         Title="MainWindow" Height="600" Width="650" >
 5     <Grid Height="600" Width="650" VerticalAlignment="Top" HorizontalAlignment="Left" >
 6         <Image Name ="kinectVideo"  Height="480" Width="640" Margin="10,0,0,120" />
 7         <Canvas Name="skeletonCanvas" Height="480" Width="640" Margin="10,0,0,120"  />
 8         <Rectangle x:Name="sensor0Rectange" Fill="Lime" HorizontalAlignment="Left" Height="40" Margin="10,480,0,0" Stroke="Black" VerticalAlignment="Top" Width="82"/>
 9         <Rectangle x:Name="sensor1Rectange" Fill="Lime" HorizontalAlignment="Left" Height="40" Margin="189,480,0,0" Stroke="Black" VerticalAlignment="Top" Width="83"/>
10         <Rectangle x:Name="sensor2Rectange" Fill="Lime" HorizontalAlignment="Left" Height="40" Margin="367,480,0,0" Stroke="Black" VerticalAlignment="Top" Width="82"/>
11         <Rectangle x:Name="sensor3Rectange" Fill="Lime" HorizontalAlignment="Left" Height="40" Margin="537,480,0,0" Stroke="Black" VerticalAlignment="Top" Width="83"/>
12         <ProgressBar x:Name="skeletonChangedProgressBar" HorizontalAlignment="Left" Height="40" Margin="10,528,0,0" VerticalAlignment="Top" Width="392" Foreground="#FFB00606"/>
13         <Button x:Name="resetButton" Content="Reset" HorizontalAlignment="Left" Height="37" Margin="537,528,0,0" 
14                 VerticalAlignment="Top" Width="83" Click="resetButton_Click"/>
15         <Button x:Name="EjectButton" Content="Eject!" HorizontalAlignment="Left" Height="37" Margin="429,528,0,0" 
16             VerticalAlignment="Top" Width="83" Click="EjectButton_Click"/>
17     </Grid>
18 </Window>

I then added some code to handle all of the events that the Phidgets and Kinect are sending to the UI and do something useful with it. For example, the light sensor change fills in the appropriate box on the screen (note that Phidgets use a different thread so you need to use Dispatcher.Invoke)

 1         void garage_LightSensorChange(object sender, LightSensorChangeEventArgs args)
 2         {
 3             switch (args.SensorIndex)
 4             {
 5                 case 0:
 6                     if (args.SensorIndex == 0 && args.LightAmount < _lightSensorThreshold)
 7                     {
 8                         _sensor0Tripped = true;
 9                         sensor0Rectange.Dispatcher.Invoke(new Action(()=>sensor0Rectange.Fill = new SolidColorBrush(Colors.Red)));
10                     }
11                     break;
12                 case 1:
13 
14                     if (args.SensorIndex == 1 && args.LightAmount < _lightSensorThreshold)
15                     {
16                         _sensor1Tripped = true;
17                         sensor1Rectange.Dispatcher.Invoke(new Action(() => sensor1Rectange.Fill = new SolidColorBrush(Colors.Red)));
18                     }
19                     break;
20                 case 2:
21                     if (args.SensorIndex == 2 && args.LightAmount < _lightSensorThreshold)
22                     {
23                         _sensor2Tripped = true;
24                         sensor2Rectange.Dispatcher.Invoke(new Action(() => sensor2Rectange.Fill = new SolidColorBrush(Colors.Red)));
25                     }
26                     break;
27                 case 3:
28                     if (args.SensorIndex == 3 && args.LightAmount < _lightSensorThreshold)
29                     {
30                         _sensor3Tripped = true;
31                         sensor3Rectange.Dispatcher.Invoke(new Action(() => sensor3Rectange.Fill = new SolidColorBrush(Colors.Red)));
32                     }
33                     break;
34             }
35             CheckForIntruder();
36         }
37

With this associated method

 1         private void CheckForIntruder()
 2         {
 3             Int32 numberOfSensorsTripped = 0;
 4 
 5             if (_sensor0Tripped == true) 
 6                 numberOfSensorsTripped += 1;
 7             if (_sensor1Tripped == true)
 8                 numberOfSensorsTripped += 1;
 9             if (_sensor2Tripped == true)
10                 numberOfSensorsTripped += 1;
11             if (_sensor3Tripped == true)
12                 numberOfSensorsTripped += 1;
13             if (numberOfSensorsTripped >= _numberOfSensorsForAlarm )
14                 _garage.moveController(0);
15 
16         }

This code would be so much better in F# using pattern matching but b/c of the UI code, I kept it in C#. I might refactor the non-visual components later. The one thing that did surprise me is that how the Kinect V1 SDK makes it very hard to separate the UI components from the domain components. Phidgets, on the other hand, had a very clear separation of concerns

So we then added some sides to the alter of snickers

And we were good to go. The final result looks like this (the smoke machine was an added touch):

All of the code is on github here. If you create your own garage of mystery, please drop me a line –> I would love to see what other makers come up with.

Filed under F#, Kinect, Phidgets

Parsing Microsoft MVP Pages Part 2

October 28, 2014 1 Comment

As a final piece of the Terminator App (V1) is to associate MVP Names to the pictures I uploaded to Sky Biometry via the MVPId. I already blogged about how to parse the MVP search page and get the photos for sky biometry and this was a similar task. The key for each photo is the MVPId. Once a person’s photo is sent to Sky Biometry, the response is the photo used to match and their Id. Ideally, we would also see the person’s name

The first step was to parse the MVP list the same way I did before:

1 let getPageContents(pageNumber:int) =
2     let uri = new Uri("http://mvp.microsoft.com/en-us/search-mvp.aspx?lo=United+States&sl=0&browse=False&sc=s&ps=36&pn=" + pageNumber.ToString())
3     let request = WebRequest.Create(uri)
4     request.Method <- "GET"
5     let response = request.GetResponse()
6     let stream = response.GetResponseStream()
7     let reader = new StreamReader(stream)
8     reader.ReadToEnd()

Next, once the page is laoded, I needed a way of parsing the name. I used the tag like this <a href="/en-us/mvp/Jamie%20Dixon-5000814" to identify MVPs. I then layered in a regex like this

 1 let getMVPInfoFromPageContents(pageContents:string) =
 2     let pattern = "(us\\/mvp\\/)([A-Z])(.+?)(-)(\\d+)"
 3     let matchCollection = Regex.Matches(pageContents, pattern)
 4     matchCollection 
 5         |> Seq.cast 
 6         |> Seq.map(fun (m:Match) -> m.Value)
 7         |> Seq.map(fun s -> s.Split('-'))
 8         |> Seq.map(fun a -> a.[0],a.[1])
 9         |> Seq.map(fun (n,i) -> n.Substring(7),n,i)
10         |> Seq.map(fun (n,ln,i) -> n.Replace("%20"," "),ln,i)
11         |> Seq.map(fun (n,ln,i) -> n,"mvp.microsoft.com/en-"+ln+"-"+i,i)
12         |> Seq.distinctBy(fun (n,uri,i) -> n)
13

And this is a great site in terms of building regexs.

With the list parsed, I then put each page together and saved it to disk

 1 let getGetMVPInfos(pageNumber: int) =
 2     let pageContents = getPageContents(pageNumber)
 3     getMVPInfoFromPageContents pageContents
 4 
 5 let pageList = [1..17]
 6 let mvpInfos = pageList
 7                 |>Seq.collect(fun i -> getGetMVPInfos(i))
 8 
 9 let outFile = new StreamWriter(@"c:\data\mvpList.csv")
10 mvpInfos |> Seq.iter(fun (n,uri,i) -> outFile.WriteLine(sprintf "%s,%s,%s" n uri i))
11 outFile.Flush
12 outFile.Close()

And with that in place, the terminator can use the FSharp csv provider to load the list (and also find Esther Lee, the one non-MVP the terminiator is scanning for)

 1 namespace ChickenSoftware.Terminator.Core
 2 
 3 open System
 4 open FSharp.Data
 5 
 6 type nameMappingContext = CsvProvider<"C:/data/mvpList.csv">
 7 
 8 type LocalFileSystemMvpProvider () =
 9     member this.GetMVPInfo (mvpId:int) =
10         if mvpId = 1 then
11             new MVPInfo(1,"Esther Lee","NA","https://pbs.twimg.com/profile_images/2487129558/3DSC_0379.jpg")
12         else
13             let nameList = nameMappingContext.Load("C:/data/mvpList.csv")
14             let foundInfo = nameList.Rows
15                                 |> Seq.filter(fun r -> r.``21505`` = mvpId.ToString())
16                                 |> Seq.map(fun r -> new MVPInfo(Int32.Parse(r.``21505``),r.``Bill Jelen``,
17                                                                 r.``mvp.microsoft.com/en-us/mvp/Bill%20Jelen-21505``,
18                                                                 "http://mvp.microsoft.com/private/en-us/PublicProfile/Photo/" + r.``21505``))
19                                 |> Seq.toArray
20             if foundInfo.Length > 0 then
21                 foundInfo.[0]
22             else
23                 new MVPInfo(-1,"None","None","None")
24

And then compare the 2 photos and get the person’s name

1             LocalFileSystemMvpProvider mvpProvider = new LocalFileSystemMvpProvider();
2             var mvpInfo = mvpProvider.GetMVPInfo(mvpId);
3 
4             compareImage.Source = new BitmapImage(new Uri(mvpInfo.PhotoUri));
5             facialRecognitionTextBox.Text = mvpInfo.FullName + " identified with a " + matchValue.Confidence + "% confidence."; 
6

And it (kinda works)

and kinda not

Filed under F#, Kinect

Parsing Microsoft MVP Pages and Uploading Photos to Sky Biometry

October 21, 2014 2 Comments

As a piece of the Terminator project that I am bringing to the MVP Summit, I wanted to load in all of the MVP photographs to Sky Biometry and if a person matches the photo at a high level, terminate them. I asked my Microsoft contact if I could get all of the MVP photos to load into the app and they politely told me no.

Not being one who takes no lightly, I decided to see if I could load the photos from the MVP website. Each MVP has a profile photo like here and all of the MVPs are listed here with their MVP IDs specified. So if I can get the Id from the search page and then create a Uri to the photo, I can then load it into Sky Biometry.

I first created a new FSharp project and fired up a script window. I created a function that gets the entire contents of a page with the only variable being the index number of the pagination.

1 let getPageContents(pageNumber:int) =
2     let uri = new Uri("http://mvp.microsoft.com/en-us/search-mvp.aspx?lo=United+States&sl=0&browse=False&sc=s&ps=36&pn=" + pageNumber.ToString())
3     let request = WebRequest.Create(uri)
4     request.Method <- "GET"
5     let response = request.GetResponse()
6     let stream = response.GetResponseStream()
7     let reader = new StreamReader(stream)
8     reader.ReadToEnd()
9

I then parsed the page for all instances of the MVPId. Fortunately, I found this post that helped me understand how the pattern match works in .NET. Note that the regex for the tag mvpid=123456 is “mvpid=\d+”

1 let getMVPIdsFromPageContents(pageContents:string) =
2     let pattern = "mvpid=\d+"
3     let matchCollection = Regex.Matches(pageContents, pattern)
4     matchCollection 
5         |> Seq.cast 
6         |> Seq.map(fun (m:Match) -> m.Value)
7         |> Seq.map(fun s -> s.Split('='))
8         |> Seq.map(fun a -> a.[1])
9

With that out of the way, I could get a Seq of all MVP IDs (at least from America and then collect each of the pages together:

1 let getGetMVPIds(pageNumber: int) =
2     let pageContents = getPageContents(pageNumber)
3     getMVPIdsFromPageContents pageContents
4 
5 let pageList = [1..17]
6 let mvpIds = pageList
7                 |>Seq.collect(fun i -> getGetMVPIds(i))
8

so far so good:

I then could create a method that generates the MVP Photo Uri:

1 let getMvpImageUri(mvpId: int) =
2     new Uri("http://mvp.microsoft.com/private/en-us/PublicProfile/Photo/" + mvpId.ToString())
3

With that out of the way, it was time to point the photos to Sky Biometry for facial detection and tagging. I used the code found in this post with a couple of changes to account that a face might not be found in the photo (hence the choice type) and that bad things might happen (like too big of a photo)

 1 type skybiometryFaceDetection = JsonProvider<".\SkyBiometryImageJson\FaceDetection.json">
 2 type skybiometryAddTags = JsonProvider<".\SkyBiometryImageJson\AddTags.json">
 3 type skybiometryFaceTraining = JsonProvider<".\SkyBiometryImageJson\FaceTraining.json">
 4 
 5 let detectFace (imageUri:string) =
 6     let stringBuilder = new StringBuilder()
 7     stringBuilder.Append(skyBiometryUri) |> ignore
 8     stringBuilder.Append("/fc/faces/detect.json?urls=") |> ignore
 9     stringBuilder.Append(imageUri) |> ignore
10     stringBuilder.Append("&api_key=") |> ignore
11     stringBuilder.Append(skyBiometryApiKey) |> ignore
12     stringBuilder.Append("&api_secret=") |> ignore
13     stringBuilder.Append(skyBiometryApiSecret) |> ignore
14     try
15         let faceDetection =  skybiometryFaceDetection.Load(stringBuilder.ToString())
16         if faceDetection.Photos.[0].Tags.Length > 0 then
17             Some faceDetection.Photos.[0].Tags.[0].Tid
18         else
19             None
20     with | :? System.Exception -> None
21

I then added the other two methods to tag and recognize

 1 let saveTag(uid:string, tid:string)=
 2     let stringBuilder = new StringBuilder()
 3     stringBuilder.Append(skyBiometryUri) |> ignore
 4     stringBuilder.Append("/fc/tags/save.json?uid=") |> ignore
 5     stringBuilder.Append(uid) |> ignore
 6     stringBuilder.Append("&tids=") |> ignore
 7     stringBuilder.Append(tid) |> ignore
 8     stringBuilder.Append("&api_key=") |> ignore
 9     stringBuilder.Append(skyBiometryApiKey) |> ignore
10     stringBuilder.Append("&api_secret=") |> ignore
11     stringBuilder.Append(skyBiometryApiSecret) |> ignore
12     let tags = skybiometryAddTags.Load(stringBuilder.ToString())
13     tags.Status
14 
15 let trainFace(uid:string)=
16     let stringBuilder = new StringBuilder()
17     stringBuilder.Append(skyBiometryUri) |> ignore
18     stringBuilder.Append("/fc/faces/train.json?uids=") |> ignore
19     stringBuilder.Append(uid) |> ignore
20     stringBuilder.Append("&api_key=") |> ignore
21     stringBuilder.Append(skyBiometryApiKey) |> ignore
22     stringBuilder.Append("&api_secret=") |> ignore
23     stringBuilder.Append(skyBiometryApiSecret) |> ignore
24     let training = skybiometryFaceTraining.Load(stringBuilder.ToString())
25     training.Status
26

Upon reflection, this would have been a perfect place for Scott W’s ROP, but I just created a covering function

 1 let saveToSkyBiometry(mvpId:string, imageUri:string) =
 2     let tid = detectFace(imageUri)
 3     match tid with 
 4     | Some x -> saveTag(mvpId + "@terminatorChicken",x) |> ignore
 5                 trainFace(mvpId + "@terminatorChicken") 
 6     | None -> "Failure"
 7 
 8 let results = mvpIds
 9                 |> Seq.map(fun mvpId -> mvpId, getMvpImageUri(Int32.Parse(mvpId)))
10

I then created a Seq.Map to call all of the photos in order but I quickly ran into this:

So I changed my Seq.Map to a Loop so I could throttle the requests:

1 for (mvpId,uri) in results do
2     let result= saveToSkyBiometry(mvpId, uri.ToString())
3     printfn "%s" result
4     Thread.Sleep(TimeSpan.FromMinutes(1.))
5

And sure enough

And you can see the load every hour

You can see the full code here.

Filed under F#, Kinect

Terminator Program: With The Kinect 2

July 29, 2014 Leave a comment

I got my hands on a Kinect2 last week so I decided to re-write the Terminator program using the Kinect2 api. Microsoft made some major changes to the domain api (no more skeleton frame, now using a body) but the underlying logic is still the same. Therefore, it was reasonably easy to port the code. There is plenty of places in the V2 api that are not documented yet but because I did some work in the V1 api, I could still get things done. For example, the V2 api documentation and code samples use event handlers to work with any new frame that arrives from the Kinect. This lead to some pretty laggy code. However, by using polling on a second thread, I was able to get the performance to where it needs to be. Also, a minor annoyance is that you have to use Win8 with the Kinect 2.

So here is the Terminator application, Gen 2. The UI is still just a series of UI controls:

 1 <Window x:Class="ChickenSoftware.Terminator.Gen2.UI.MainWindow"
 2         xmlns="http://schemas.microsoft.com/winfx/2006/xaml/presentation"
 3         xmlns:x="http://schemas.microsoft.com/winfx/2006/xaml"
 4         Title="MainWindow" Height="700" Width="650" Loaded="Window_Loaded">
 5     <Canvas Width="650" Height="700">
 6         <Image x:Name="kinectColorImage" Width="640" Height="480" />
 7         <Canvas x:Name="bodyCanvas" Width="640" Height="480" />
 8         <Button x:Name="takePhotoButton" Canvas.Left="10" 
 9                 Canvas.Top="485" Height="40" Width="125" Click="takePhotoButton_Click">Take Photo</Button>
10         <TextBox x:Name="facialRecognitionTextBox" Canvas.Left="10" Canvas.Top="540" Width="125" Height="40" FontSize="8" />
11         <Image x:Name="currentImage" Canvas.Left="165" Canvas.Top="485" Height="120" Width="170" />
12         <Image x:Name="compareImage" Canvas.Left="410" Canvas.Top="485" Height="120" Width="170" />
13     </Canvas>
14 </Window>
15

In the code behind, I set up some class-level variables. The only real difference is that the photo is moving from 640/480 to 1920/1080:

1         KinectSensor _kinectSensor = null;
2         Boolean _isKinectDisplayActive = false;
3         Boolean _isTakingPicture = false;
4         WriteableBitmap _videoBitmap = null;
5         Int32 _width = 1920;
6         Int32 _height = 1080;

When the page is loaded, a new thread is spun up that handles rendering the Kinect data:

1         private void Window_Loaded(object sender, RoutedEventArgs e)
2         {
3             SetUpKinect();
4             _isKinectDisplayActive = true;
5             Thread videoThread = new Thread(new ThreadStart(DisplayKinectData));
6             videoThread.Start();
7         }

Setting up the Kinect is a bit different (KinectSensor.GetDefault()) but intuitive:

1         internal void SetUpKinect()
2         {
3             _videoBitmap = new WriteableBitmap(1920, 1080, 96, 96, PixelFormats.Bgr32, null);
4             _kinectSensor = KinectSensor.GetDefault();
5             _kinectSensor.Open();
6         }

With the big change in the DisplayKinectData method

 1         internal void DisplayKinectData()
 2         {
 3             var colorFrameSource = _kinectSensor.ColorFrameSource;
 4             var colorFrameReader = colorFrameSource.OpenReader();
 5             var bodyFrameSource = _kinectSensor.BodyFrameSource;
 6             var bodyFrameReader = bodyFrameSource.OpenReader();
 7 
 8             while (_isKinectDisplayActive)
 9             {
10                 using (var colorFrame = colorFrameReader.AcquireLatestFrame())
11                 {
12                     if (colorFrame == null) continue;
13                     using (var bodyFrame = bodyFrameReader.AcquireLatestFrame())
14                     {
15                         if (bodyFrame == null) continue;
16                         //Color
17                         var colorFrameDescription = colorFrame.ColorFrameSource.CreateFrameDescription(ColorImageFormat.Bgra);
18                         var bytesPerPixel = colorFrameDescription.BytesPerPixel;
19                         var frameSize = colorFrameDescription.Width * colorFrameDescription.Height * bytesPerPixel;
20                         var colorData = new byte[frameSize];
21                         if (colorFrame.RawColorImageFormat == ColorImageFormat.Bgra)
22                         {
23                             colorFrame.CopyRawFrameDataToArray(colorData);
24                         }
25                         else
26                         {
27                             colorFrame.CopyConvertedFrameDataToArray(colorData, ColorImageFormat.Bgra);
28                         }
29                         //Body
30                         var bodies = new Body[bodyFrame.BodyCount];
31                         bodyFrame.GetAndRefreshBodyData(bodies);
32                         var trackedBody = bodies.FirstOrDefault(b => b.IsTracked);
33 
34                         //Update
35                         if (_isTakingPicture)
36                         {
37                             Dispatcher.Invoke(new Action(() => AnalyzePhoto(colorData)));
38                         }
39                         else
40                         {
41                             if (trackedBody == null)
42                             {
43                                 Dispatcher.Invoke(new Action(() => UpdateDisplay(colorData)));
44                             }
45                             else
46                             {
47                                 Dispatcher.Invoke(new Action(() => UpdateDisplay(colorData, trackedBody)));
48                             }
49                         }
50                     }
51                 }
52             }
53         }
54

I am using a frameReader and frameSource for both the color (the video image) and the body (the old skeleton). The method to get the frame has changed –> I am using AquireLatestFrame(). It is nice that we are still using byte[] to hold the data.

With the data in the byte[] arrays, the display is updated. There are two UpdateDisplay methods:

 1         internal void UpdateDisplay(byte[] colorData)
 2         {
 3             var rectangle = new Int32Rect(0, 0, _width, _height);
 4             _videoBitmap.WritePixels(rectangle, colorData, _width * 4, 0);
 5             kinectColorImage.Source = _videoBitmap;
 6         }
 7 
 8         internal void UpdateDisplay(byte[] colorData, Body body)
 9         {
10             UpdateDisplay(colorData);
11             var drawingGroup = new DrawingGroup();
12             using (var drawingContext = drawingGroup.Open())
13             {
14                 var headPosition = body.Joints[JointType.Head].Position;
15                 if (headPosition.Z < 0)
16                 {
17                     headPosition.Z = 0.1f;
18                 }
19                 var adjustedHeadPosition = _kinectSensor.CoordinateMapper.MapCameraPointToDepthSpace(headPosition);
20                 bodyCanvas.Children.Clear();
21                 Rectangle headTarget = new Rectangle();
22                 headTarget.Fill = new SolidColorBrush(Colors.Red);
23                 headTarget.Width = 10;
24                 headTarget.Height = 10;
25                 Canvas.SetLeft(headTarget, adjustedHeadPosition.X + 75);
26                 Canvas.SetTop(headTarget, adjustedHeadPosition.Y);
27                 bodyCanvas.Children.Add(headTarget);
28             }
29         }

This is pretty much like V1 where the video byte[] is being written to a WritableBitmap and the body is being drawn on the canvas. Note that like V1, the coordinates of the body need to be adjusted to the color frame. The API has a series of overloads that makes it easy to do the translation.

With the display working, I added in taking the photo, sending it to Azure blob storage, and having Sky Biometry analyze the results. This code is identical to V1 with the connection strings for Azure and Sky Biometry broken out into their own methods and the sensitive values placed into the app.config:

1         internal void AnalyzePhoto(byte[] colorData)
2         {
3             var bitmapSource = BitmapSource.Create(_width, _height, 96, 96, PixelFormats.Bgr32, null, colorData, _width * 4);
4             JpegBitmapEncoder encoder = new JpegBitmapEncoder();
5             encoder.Frames.Add(BitmapFrame.Create(bitmapSource));
6             var photoImage = UploadPhotoImage(encoder);
7             CompareImages(photoImage);
8             _isTakingPicture = false;
9         }

class="wlWriterEditableSmartContent" style="float:none;margin:0;display:inline;padding:0;"> style="color:#008080;"> 1 internal PhotoImage UploadPhotoImage(JpegBitmapEncoder encoder) { using(MemoryStream memoryStream = new MemoryStream()) { encoder.Save(memoryStream); var photoImage = new PhotoImage(Guid.NewGuid(), memoryStream.ToArray()); var customerUniqueId = new Guid(ConfigurationManager.AppSettings["customerUniqueId"]); var connectionString = GetAzureConnectionString(); IPhotoImageProvider provider = new AzureStoragePhotoImageProvider(customerUniqueId, connectionString); provider.InsertPhotoImage(photoImage); memoryStream.Close(); return photoImage; } }

http://dunnhq.com -->

class="wlWriterEditableSmartContent" style="float:none;margin:0;display:inline;padding:0;"> style="color:#008080;"> 1 internal void CompareImages(PhotoImage photoImage) { String skyBiometryUri = ConfigurationManager.AppSettings["skyBiometryUri"]; String uid = ConfigurationManager.AppSettings["skyBiometryUid"]; String apiKey = ConfigurationManager.AppSettings["skyBiometryApiKey"]; String apiSecret = ConfigurationManager.AppSettings["skyBiometryApiSecret"]; var imageComparer = new SkyBiometryImageComparer(skyBiometryUri, uid, apiKey, apiSecret); String basePhotoUri = GetBasePhotoUri(); String targetPhotoUri = GetTargetPhotoUri(photoImage); currentImage.Source = new BitmapImage(new Uri(targetPhotoUri)); compareImage.Source = new BitmapImage(new Uri(basePhotoUri)); var matchValue = imageComparer.CalculateFacialRecognitionConfidence(basePhotoUri, targetPhotoUri); facialRecognitionTextBox.Text = "Match Value Confience is: " + matchValue.Confidence.ToString(); }

http://dunnhq.com -->

I think I am doing the Sky Biometry recognition incorrectly so I will look at that later. In any event, working with the Kinect V2 was fairly easy because it was close enough to the V1 that the concepts could translate. I look forward to adding the targeting system this weekend!!!

Terminator Program: Part 2

July 8, 2014 1 Comment

Following up on my last post, I decided to send the entire photograph to Sky Biometry and have them parse the photograph and identify individual people. This ability is built right into their API. For example, if you pass them this picture, you get the following json back.

I added the red highlight to show that Sky Biometry can recognize multiple people (it is an array of uids) and that each face tag has a center.x and center:y. Reading the API documentation, this point is center of the face tag point and their point is a percentage of the photo width.

So I need to translate the center point of the skeleton from the Kinect to eqiv center point of the sky biometry recognition output and I should be able to identify individual people within the Kinect’s field of vision. Going back to the Kinect code, I ditched the DrawBoxAroundHead method and altered the UpdateDisplay method like so

private void UpdateDisplay(byte[] colorData, Skeleton[] skeletons)
{
    if (_videoBitmap == null)
    {
        _videoBitmap = new WriteableBitmap(640, 480, 96, 96, PixelFormats.Bgr32, null);
    }
    _videoBitmap.WritePixels(new Int32Rect(0, 0, 640, 480), colorData, 640 * 4, 0);
    kinectColorImage.Source = _videoBitmap;
    var selectedSkeleton = skeletons.FirstOrDefault(s => s.TrackingState == SkeletonTrackingState.Tracked);
    if (selectedSkeleton != null)
    {
        var headPosition = selectedSkeleton.Joints[JointType.Head].Position;
        var adjustedHeadPosition =
            _sensor.CoordinateMapper.MapSkeletonPointToColorPoint(headPosition, ColorImageFormat.RgbResolution640x480Fps30);
        var adjustedSkeletonPosition = _sensor.CoordinateMapper.MapSkeletonPointToColorPoint(selectedSkeleton.Position, ColorImageFormat.RgbResolution640x480Fps30);
 
        skeletonCanvas.Children.Clear();
        Rectangle headRectangle = new Rectangle();
        headRectangle.Fill = new SolidColorBrush(Colors.Blue);
        headRectangle.Width = 10;
        headRectangle.Height = 10;
        Canvas.SetLeft(headRectangle, adjustedHeadPosition.X);
        Canvas.SetTop(headRectangle, adjustedHeadPosition.Y);
        skeletonCanvas.Children.Add(headRectangle);
 
        Rectangle skeletonRectangle = new Rectangle();
        skeletonRectangle.Fill = new SolidColorBrush(Colors.Red);
        skeletonRectangle.Width = 10;
        skeletonRectangle.Height = 10;
        Canvas.SetLeft(skeletonRectangle, adjustedHeadPosition.X);
        Canvas.SetTop(skeletonRectangle, adjustedHeadPosition.Y);
        skeletonCanvas.Children.Add(skeletonRectangle);
 
        String skeletonInfo = headPosition.X.ToString() + " : " + headPosition.Y.ToString() + " — ";
        skeletonInfo = skeletonInfo + adjustedHeadPosition.X.ToString() + " : " + adjustedHeadPosition.Y.ToString() + " — ";
        skeletonInfo = skeletonInfo + adjustedSkeletonPosition.X.ToString() + " : " + adjustedSkeletonPosition.Y.ToString();
 
        skeletonInfoTextBox.Text = skeletonInfo;
 
    }
}

Notice that there are two rectangles because I was not sure if the Head.Position or the Skeleton.Position would match SkyBiometry. Turns out that I want the Head.Position for SkyBiometry (besides, the terminator would want head shots only)

So I ditched the Skeleton.Position. I then needed a way to translate the Head.Posotion.X to SkyBiometry.X and Head.Posotion.Y to SkyBiometry.Y. Fortunately, I know the size of each photograph (640 X 480) so calculating the percent is an exercise of altering UpdateDisplay:

private void UpdateDisplay(byte[] colorData, Skeleton[] skeletons)
{
    Int32 photoWidth = 640;
    Int32 photoHeight = 480;
 
    if (_videoBitmap == null)
    {
        _videoBitmap = new WriteableBitmap(photoWidth, photoHeight, 96, 96, PixelFormats.Bgr32, null);
    }
    _videoBitmap.WritePixels(new Int32Rect(0, 0, photoWidth, photoHeight), colorData, photoWidth * 4, 0);
    kinectColorImage.Source = _videoBitmap;
    var selectedSkeleton = skeletons.FirstOrDefault(s => s.TrackingState == SkeletonTrackingState.Tracked);
    if (selectedSkeleton != null)
    {
        var headPosition = selectedSkeleton.Joints[JointType.Head].Position;
        var adjustedHeadPosition =
            _sensor.CoordinateMapper.MapSkeletonPointToColorPoint(headPosition, ColorImageFormat.RgbResolution640x480Fps30);
 
        skeletonCanvas.Children.Clear();
        Rectangle headRectangle = new Rectangle();
        headRectangle.Fill = new SolidColorBrush(Colors.Blue);
        headRectangle.Width = 10;
        headRectangle.Height = 10;
        Canvas.SetLeft(headRectangle, adjustedHeadPosition.X);
        Canvas.SetTop(headRectangle, adjustedHeadPosition.Y);
        skeletonCanvas.Children.Add(headRectangle);
 
        var skyBiometryX = ((float)adjustedHeadPosition.X / photoWidth)*100;
        var skyBioMetryY = ((float)adjustedHeadPosition.Y / photoHeight)*100;
 
        String skeletonInfo = adjustedHeadPosition.X.ToString() + " : " + adjustedHeadPosition.Y.ToString() + " — ";
        skeletonInfo = skeletonInfo + Math.Round(skyBiometryX,2).ToString() + " : " + Math.Round(skyBioMetryY,2).ToString();
 
        skeletonInfoTextBox.Text = skeletonInfo;
 
    }

And so now I have

The next step is to get the Kinect photo to Sky Biometry. I decided to use Azure Blob Storage as my intermediately location. I updated the architectural diagram like so:

At this point, it made sense to move the project over to F# so I could better concentrate on the work that needs to be done and also getting the important code out of the UI code behind. I fired up a F# project in my solution added a couple different implementations of Storing Photos. To keep things consistent, I created a data structure and an interface:

namespace ChickenSoftware.Terminator.Core
 
open System
 
type public PhotoImage (uniqueId:Guid, imageBytes:byte[]) =
    member this.UniqueId = uniqueId
    member this.ImageBytes = imageBytes
 
type IPhotoImageProvider =
    abstract member InsertPhotoImage : PhotoImage -> unit 
    abstract member DeletePhotoImage : Guid -> unit
    abstract member GetPhotoImage : Guid -> PhotoImage 

My 1st stop was to replicate what Miles did with the Save File Dialog box with a File System Provider. It was very much like a C# implementation:

namespace ChickenSoftware.Terminator.Core
 
open System
open System.IO
open System.Drawing
open System.Drawing.Imaging
 
type LocalFileSystemPhotoImageProvider(folderPath: string) = 
 
    member this.GetPhotoImageUri(uniqueIdentifier: Guid) =
        let fileName = uniqueIdentifier.ToString() + ".jpg"
        Path.Combine(folderPath, fileName)
 
    interface IPhotoImageProvider with 
        member this.InsertPhotoImage(photoImage: PhotoImage) = 
            let fullPath = this.GetPhotoImageUri(photoImage.UniqueId)
            use memoryStream = new MemoryStream(photoImage.ImageBytes)
            let image = Image.FromStream(memoryStream)
            image.Save(fullPath)
 
        member this.DeletePhotoImage(uniqueIdentifier: Guid) = 
            let fullPath = this.GetPhotoImageUri(uniqueIdentifier)
            File.Delete(fullPath)        
 
        member this.GetPhotoImage(uniqueIdentifier: Guid) = 
            let fullPath = this.GetPhotoImageUri(uniqueIdentifier)
            use fileStream = new FileStream(fullPath,FileMode.Open)
            let image = Image.FromStream(fileStream)
            use memoryStream = new MemoryStream()
            image.Save(memoryStream,ImageFormat.Jpeg)
            new PhotoImage(uniqueIdentifier, memoryStream.ToArray())

To call the save method, I altered the SavePhoto method in the C# project to use a MemoryStream and not a FileStream:

private void SavePhoto(byte[] colorData)
{
    var bitmapSource = BitmapSource.Create(640, 480, 96, 96, PixelFormats.Bgr32, null, colorData, 640 * 4);
    JpegBitmapEncoder encoder = new JpegBitmapEncoder();
    encoder.Frames.Add(BitmapFrame.Create(bitmapSource));
    using (MemoryStream memoryStream = new MemoryStream())
    {
        encoder.Save(memoryStream);
        PhotoImage photoImage = new PhotoImage(Guid.NewGuid(), memoryStream.ToArray());
 
        String folderUri = @"C:\Data";
        IPhotoImageProvider provider = new LocalFileSystemPhotoImageProvider(folderUri);
 
        provider.InsertPhotoImage(photoImage);
        memoryStream.Close();
    }
    _isTakingPicture = false;
}

And sure enough, it saves the photo to disk:

One problem that took me 20 minutes to uncover is that if you get your file system path wrong, you get the unhelpful exception:

This has been well-bitched about on stack overflow so I won’t comment further.

With the file system up and running, I turned my attention to Azure. Like the File System provider, it is very close to a C# implementation

namespace ChickenSoftware.Terminator.Core
 
open System
open System.IO
open Microsoft.WindowsAzure.Storage
open Microsoft.WindowsAzure.Storage.Blob
 
type AzureStoragePhotoImageProvider(customerUniqueId: Guid, connectionString: string) = 
 
    member this.GetBlobContainer(blobClient:Blob.CloudBlobClient) =
        let container = blobClient.GetContainerReference(customerUniqueId.ToString())
        if not (container.Exists()) then
            container.CreateIfNotExists() |> ignore
            let permissions = new BlobContainerPermissions()
            permissions.PublicAccess <- BlobContainerPublicAccessType.Blob
            container.SetPermissions(permissions)
        container
 
    member this.GetBlockBlob(uniqueIdentifier: Guid) =
        let storageAccount = CloudStorageAccount.Parse(connectionString)
        let blobClient = storageAccount.CreateCloudBlobClient()
        let container = this.GetBlobContainer(blobClient) 
        let photoUri = this.GetPhotoImageUri(uniqueIdentifier)
        container.GetBlockBlobReference(photoUri) 
 
    member this.GetPhotoImageUri(uniqueIdentifier: Guid) =
        uniqueIdentifier.ToString() + ".jpg"
 
    interface IPhotoImageProvider with 
        member this.InsertPhotoImage(photoImage: PhotoImage) =
            let blockBlob = this.GetBlockBlob(photoImage.UniqueId) 
            use memoryStream = new MemoryStream(photoImage.ImageBytes)
            blockBlob.UploadFromStream(memoryStream)
 
        member this.DeletePhotoImage(uniqueIdentifier: Guid) = 
            let blockBlob = this.GetBlockBlob(uniqueIdentifier) 
            blockBlob.Delete()       
 
        member this.GetPhotoImage(uniqueIdentifier: Guid) =
            let blockBlob = this.GetBlockBlob(uniqueIdentifier) 
            if blockBlob.Exists() then
                blockBlob.FetchAttributes()
                use memoryStream = new MemoryStream()
                blockBlob.DownloadToStream(memoryStream)
                let photoArray = memoryStream.ToArray()
                new PhotoImage(uniqueIdentifier,photoArray)
            else
                failwith "photo not found"

And when I pop it into the WPF application,

private void SavePhoto(byte[] colorData)
{
    var bitmapSource = BitmapSource.Create(640, 480, 96, 96, PixelFormats.Bgr32, null, colorData, 640 * 4);
    JpegBitmapEncoder encoder = new JpegBitmapEncoder();
    encoder.Frames.Add(BitmapFrame.Create(bitmapSource));
    using (MemoryStream memoryStream = new MemoryStream())
    {
        encoder.Save(memoryStream);
        PhotoImage photoImage = new PhotoImage(Guid.NewGuid(), memoryStream.ToArray());
 
        Guid customerUniqueId = new Guid("7282AF48-FB3D-489B-A572-2EFAE80D0A9E");
        String connectionString =
            "DefaultEndpointsProtocol=http;AccountName=XXX;AccountKey=XXX";
        IPhotoImageProvider provider = new AzureStoragePhotoImageProvider(customerUniqueId, connectionString);
 
 
        provider.InsertPhotoImage(photoImage);
        memoryStream.Close();
    }
    _isTakingPicture = false;
}

I can now write my images to Azure.

With that out of the way, I can now have SkyBiometry pick up my photo, analyze it, and push the results back. I went ahead and added in the .fs module that I had already created for this blog post. I then added FSharp.Data via NuGet and was ready to roll. In he Save photo event handler,after saving the photo to blob storage, it then calls Sky Biometry to compare against a base image that has already been trained:

private void SavePhoto(byte[] colorData)
{
    var bitmapSource = BitmapSource.Create(640, 480, 96, 96, PixelFormats.Bgr32, null, colorData, 640 * 4);
    JpegBitmapEncoder encoder = new JpegBitmapEncoder();
    encoder.Frames.Add(BitmapFrame.Create(bitmapSource));
    PhotoImage photoImage = UploadPhotoImage(encoder);
 
    String skyBiometryUri = "http://api.skybiometry.com&quot;;
    String uid = "Kinect@ChickenFace";
    String apiKey = "XXXX";
    String apiSecret = "XXXX";
 
    var imageComparer = new SkyBiometryImageComparer(skyBiometryUri, uid, apiKey, apiSecret);
    String basePhotoUri = "XXXX.jpg";
    String targetPhotoUri = "XXXX/" + photoImage.UniqueId + ".jpg";
 
    currentImage.Source = new BitmapImage(new Uri(basePhotoUri));
    compareImage.Source = new BitmapImage(new Uri(targetPhotoUri)); ;
    
    var matchValue = imageComparer.CalculateFacialRecognitionConfidence(basePhotoUri, targetPhotoUri);
    FacialRecognitionTextBox.Text = "Match Value is: " + matchValue.ToString();
    _isTakingPicture = false;
}

And I am getting a result back from Sky Biometry.

Finally, I added in the SkyBiometry X and Y coordinates for the photo and compared to the calculated ones based on the Kinect Skeleton Tracking:

currentImage.Source = new BitmapImage(new Uri(basePhotoUri));
compareImage.Source = new BitmapImage(new Uri(targetPhotoUri)); ;
 
var matchValue = imageComparer.CalculateFacialRecognitionConfidence(basePhotoUri, targetPhotoUri);
 
var selectedSkeleton = skeletons.FirstOrDefault(s => s.TrackingState == SkeletonTrackingState.Tracked);
if (selectedSkeleton != null)
{
    var headPosition = selectedSkeleton.Joints[JointType.Head].Position;
    var adjustedHeadPosition =
        _sensor.CoordinateMapper.MapSkeletonPointToColorPoint(headPosition, ColorImageFormat.RgbResolution640x480Fps30);
 
    var skyBiometryX = ((float)adjustedHeadPosition.X / 640) * 100;
    var skyBioMetryY = ((float)adjustedHeadPosition.Y / 480) * 100;
 
    StringBuilder stringBuilder = new StringBuilder();
    stringBuilder.Append("Match Value is: ");
    stringBuilder.Append(matchValue.Confidence.ToString());
    stringBuilder.Append("Sky Biometry X: ");
    stringBuilder.Append(matchValue.X.ToString());
    stringBuilder.Append("Sky Biometry Y: ");
    stringBuilder.Append(matchValue.Y.ToString());
    stringBuilder.Append("Kinect X: ");
    stringBuilder.Append(Math.Round(skyBiometryX, 2).ToString());
    stringBuilder.Append("Kinect Y: ");
    stringBuilder.Append(Math.Round(skyBioMetryY, 2).ToString());
    FacialRecognitionTextBox.Text = stringBuilder.ToString();
}
 
_isTakingPicture = false;