Posted onOct 22, 2020 • Originally published ataaron-powell.com onOct 21, 2020

Building a Video Chat App, Part 2 - Accessing Cameras

#javascript #azure #webdev

On myTwitch channel we’re continuing to build our video chat application onAzure Communication Services (ACS).

For today’s post, we’re going to look at the next major milestone, accessing your camera and microphone.

How Browsers Access Devices

We're going to use the ACS SDK to do this, but before we get there let's first understand how we access cameras and microphones in the browser. Browsers have had this functionality for a while now, it came about as a need for the WebRTC specification, since that allows you to do what we're doing, run a video stream through the browser, and it works using thenavigator.mediaDevices API which replacednavigator.getUserMedia.

This API is promised based, so it works nicely withasync/await, and will return us theMediaStream available to the browser.

There is a catch though, the user has to consent to providing access to the devices, which makes sense as you don't want any random website to be able to access your camera and mic without you knowing about it, do you? The user will see a prompt like so:

In "raw JavaScript" we'd write something like this:

navigator.mediaDevices.getUserMedia({audio:true,video:true}).then(function(stream){/* use the stream */}).catch(function(err){/* handle the error */});

If the user denies the request then thecatch of the promise is triggered (or if they've previously denied it), otherwise you'll end up in theMediaStream for the camera/mic they have selected. TheMediaStream can be provided to a<video> element and you can look at yourself.

Accessing Devices with ACS

Now that we understand the fundamentals, let's look at how we use this in the ACS SDK to get one step closer to establishing out video call.

We'll need to add some npm packages to our UI:

npminstall--save @azure/communication-calling @azure/communication-common

With these packages, we're going to need four APIs,AzureCommunicationUserCredential,CallClient,CallAgent andDeviceManager.

To make the important parts of this available throughout our application, we're going to create aReact Context to hold it, so let's get started with that.

Defining Our Context

Let's create a file calleduseCallingContext.tsx since we'll have the context in there as well as a hook to access context, and define our context:

import{AudioDeviceInfo,CallAgent,CallClient,DeviceManager,VideoDeviceInfo}from"@azure/communication-calling";import{AzureCommunicationUserCredential}from"@azure/communication-common";importReact,{useState,useEffect,useContext}from"react";importuseTokenfrom"./useToken";exporttypeCallingProps={micList?:AudioDeviceInfo[];cameraList?:VideoDeviceInfo[];callAgent?:CallAgent;deviceManager?:DeviceManager;};constCallingContext=React.createContext<CallingProps>({});

The context will have available on it the list of cameras and mics, along with theCallAgent andDeviceManager instances since they will be useful later.

Since the logic to setup all the data available on the context only happens once, we'll implement the context provider within this file to, so let's do that.

exportconstCallingContextProvider=(props:{children:React.ReactNode;})=>{return(<CallingContext.Providervalue={/* todo */}>{props.children}</CallingContext.Provider>);};

Lastly, we'll expose a hook to make it easy to access the context elsewhere in the application:

exportconstuseCallingContext=()=>useContext(CallingContext);

Great, we're now ready to implement the context provider.

Implementing the Context Provider

The context provider here is key, as it's the thing that'll be responsible for getting the devices and making them available elsewhere in our application, and for that we're going to need some local state.

exportconstCallingContextProvider=(props:{children:React.ReactNode;})=>{consttoken=useToken();const[,setClient]=useState<CallClient>();const[callAgent,setCallAgent]=useState<CallAgent>();const[deviceManager,setDeviceManager]=useState<DeviceManager>();const[cameraList,setCameraList]=useState<VideoDeviceInfo[]>();const[micList,setMicList]=useState<AudioDeviceInfo[]>();

We're going to need the token that is generated for the user in}}">Part 1, and we're doing that through a custom hook:

import{useState,useEffect}from"react";exporttypeTokenResponse={token:string;expiresOn:Date;communicationUserId:string;};constuseToken=()=>{const[token,setToken]=useState("");useEffect(()=>{construn=async()=>{constres=awaitfetch("/api/issueToken");consttokenResponse:TokenResponse=awaitres.json();setToken(tokenResponse.token);};run();},[]);returntoken;};exportdefaultuseToken;

Then we've got some more state for the different parts of the ACS SDK that we're going to expose, except for theCallClient which we only need to establish the other parts of the API.

We'll use an effect hook to set this up, that'll be triggered when the token is available to us:

useEffect(()=>{construn=async(callClient:CallClient,token:string)=>{consttokenCredential=newAzureCommunicationUserCredential(token);letcallAgent:CallAgent|undefined=undefined;try{callAgent=awaitcallClient.createCallAgent(tokenCredential);constdeviceManager=awaitcallClient.getDeviceManager();constresult=awaitdeviceManager.askDevicePermission(true,true);if(result.audio){setMicList(deviceManager.getMicrophoneList());}if(result.video){setCameraList(deviceManager.getCameraList());}setCallAgent(callAgent);setDeviceManager(deviceManager);}catch{if(callAgent){callAgent.dispose();}}};if(token){constcallClient=newCallClient();setClient(callClient);run(callClient,token);}},[token]);

Ok, that's a lot of code, let's break it down piece by piece, starting at the bottom:

if(token){constcallClient=newCallClient();setClient(callClient);run(callClient,token);}

This is a check to make sure that the user token has been issued, and once it has been we're going to call anasync function (run), because an effect hook can't take an async function directly, and therun function is really where things happen.

First off, this function is going to create the credentials for ACS from the token provided:

consttokenCredential=newAzureCommunicationUserCredential(token);

Next, we'll setup atry/catch block to access the devices, and remember that the reason we'd do it this way is so that if the user declines the request to access devices, we can gracefully handle the error (theasync/await unwraps a promisescatch into thecatch of thetry/catch block).

We'll create thecallAgent using the credentials:

callAgent=awaitcallClient.createCallAgent(tokenCredential);

We're notactually using thecallAgent yet, it's what we use to connect to calls, but we need to create an instance of itbefore we access theDeviceManager.I'm unclear as towhy it's this way, and it's something I'm going to raise with the ACS team.

With ourcallAgent created, it's now time to access theDeviceManager, which will give us all the devices:

constdeviceManager=awaitcallClient.getDeviceManager();constresult=awaitdeviceManager.askDevicePermission(true,true);if(result.audio){setMicList(deviceManager.getMicrophoneList());}if(result.video){setCameraList(deviceManager.getCameraList());}

From thedeviceManager, which we get fromcallClient.getDeviceManager, we need to request permissions from the user to access their device list usingaskDevicePermissions. This method takes two arguments, whether you want audio and video access, and for our case we do. Assuming the user grants permissions, we can then usedeviceManager.getMicrophoneList anddeviceManager.getCameraList to get arrays ofAudioDeviceInfo andVideoDeviceInfo that we can present to the user for their selection.

This is the same as if you were to call theenumerateDevices method fromMediaDevices, but the SDK takes the liberty of splitting the enumerated devices into their appropriate types. What's important to know about this is that youmust callaskDevicePermissions first, otherwise you'll get an array with a single unknown device. That's becauseenumerateDevices, which is what's used internally by the SDK, accesses the available deviceswithout prompting for consent and if consent hasn't been provided, you can't get the devices.

Conclusion

Our React context is all ready for integration into the application. We've learnt how to get started using the ACS SDK and itsDeviceManager to request permission for the devices and then display the full list of them.

If you want to catch up on the whole episode, as well as look at how we integrate this into the overall React application, you can catch the recording onYouTube, along with thefull playlist