I am relatively new to media foundation and I had just finished building my first single webcam streaming application using 'Developing Microsoft Media foundation Applications'. I have multiple dynamic webcam sources(the webcam number can change) and I need to combine them using the video mixer to form a composite video stream much like how multiple CCTV live streaming working on one screen. And yet at the same time I can access individual video stream and highlight/bring in to focus/zoom in when the cursor is hovering over to a particular video stream(event listening on video stream).
I need a set of guidelines to work with media session, topology, video mixer and video presenter to implement the functionality above. Any help is greatly appreciated. I am not looking into directX to achieve the functionality above. I am into the native media foundation architecture.