The present document relates to an electronic device, in particular to a docking station and/or to a smart speaker, which may be used within a household, notably within a kitchen, to provide support regarding a household task such as cooking.
An electronic device, such as a smart speaker and/or a docking station, may be used within a household, notably within a kitchen, to provide support for a household task, such as cooking, to a user of the electronic device. The user may use the electronic device for playing back an audio signal (e.g., music or audio instructions for a recipe or an answer from a voice assistant).
The electronic device may be placed on the worktop of a kitchen, next to a user working (e.g., cooking) at the worktop. The present document addresses the technical problem of enabling a particularly comfortable and reliable user-interaction with the electronic device. The technical problem is solved by the independent claim. Preferred examples are described in the dependent claims.
According to an aspect, an electronic device, notably a docking station configured to dock a user device and/or a smart speaker, is described. The electronic device may be configured to be converted from a smart speaker into a smart display by docking a user device, such as a tablet or smartphone. The electronic device may be designed to be placed on a table or a worktop within a kitchen. Furthermore, the electronic device may be configured to hold a DIN A4 or DIN A5 sized user device (e.g., with a display size of up to 11 inches or 12 inches). For this purpose, the electronic device may comprise a back support.
The electronic device typically comprises at least one active acoustic speaker driver (notably a loudspeaker) which is configured to render an audio signal. The speaker driver may be located in an acoustic chamber within a main body of the electronic device. The acoustic chamber and/or the main body may have a cylindrical form. The main body may exhibit a longitudinal axis which may be oriented in parallel to the ground (e.g., a worktop) onto which the electronic device is placed. The main body may e.g., have a diameter between 4cm and 8cm. Furthermore, the main body may e.g., have a length along the longitudinal axis between 10cm and 25cm.
In addition, the electronic device may comprise one or more feet extending downwards from the main body and enabling the main body to be placed on the ground (e.g., a worktop). The one or more (panel-shaped) feet may extend along the longitudinal axis of the main body (e.g., from one end to the other end of the main body). The electronic device may exhibit a front foot at a front side of the electronic device (facing the user of the electronic device, when the user is using the electronic device). Alternatively, or in addition, the electronic device may exhibit a rear foot (at the rear side of the electronic device). By providing one or more feet, the electronic device may be placed on a ground in a stable manner.
The one or more feet may be configured to hold the main body elevated at an elevation distance from the ground. The elevation distance may e.g., be between 1cm and 15cm. By positioning the main body (which comprises one or more speaker drivers, e.g., loudspeakers) at a certain elevation distance from the ground, an electronic device having a high audio quality and intelligibility may be provided. Furthermore, by holding the main body elevated from the ground, the main body may be protected from liquids on the ground in an efficient and reliable manner.
As indicated above, the (elevated) main body may comprise one or more (active or passive) speaker drivers for rendering an audio signal. The one or more speaker drivers are preferably oriented at least partially towards the ground. In particular, the one or more speaker drivers may be arranged such that a main emission direction of an audio signal that is emitted by the one or more speaker drivers is oriented towards the ground. By way of example, the main emission direction may form an angle with ground having a magnitude which is greater than 0°, notably greater than 20°.
By providing an electronic device having an elevated main body with one or more speaker drivers that are oriented towards the ground that the electronic device is standing on, the quality and the intelligibility of emitted audio signals may be increased, thereby increasing the comfort-of-use of the electronic device.
As indicated above, the electronic device may comprise a back support. The back support may extend upwards from the main body, enabling a user device to be placed in conjunction with and/or onto the electronic device. The back support may exhibit an angle between 45° and 75° relative to the ground that the electronic device is standing on. The back support may form a flat backplane which is at least partially in contact with the backside of a (flat) user device that is placed onto the electronic device. In other words, the back support may hold a user device from the back.
The electronic device may be designed such that the back support can be detached from the main body or attached to the main body by a user of the electronic device (e.g., using one or two hands). In particular, detaching and/or attaching may be performed without using a tool. As a result of this, a particularly comfortable electronic device may be provided.
Hence, an electronic device is described which comprises a (cylindrical) main body with an acoustic chamber with one or more speaker drivers for rendering an audio signal. The main body further comprises one or more microphones configured to capture a microphone signal.
The electronic device may comprise a control unit which is configured to analyze the microphone signal captured by the one or more microphones in order to detect a voice command for controlling the electronic device and/or for controlling the user device placed in conjunction with the electronic device. Furthermore, the control unit may be configured to cause an audio signal to be rendered by the one or more speaker drivers. Hence, the electronic device may be configured to perform audio rendering and capturing of voice commands in parallel.
The electronic device comprises one or more means for at least partially isolating the one or more microphones from vibrations of the acoustic chamber (caused during rendering of an audio signal by the one or more speaker drivers). By doing this, the quality of capturing microphone signals may be improved, thereby improving the reliability of voice control of the electronic device.
The main body may comprise an outer housing which encloses the acoustic chamber. The outer housing may have a (circular) cylindrical shape. Furthermore, the outer housing may comprise a speaker grill covering the one or more speaker drivers (which are placed within the acoustic chamber).
The outer housing may be spatially separated from the acoustic chamber via a mid-chamber, and the mid-chamber may be configured to isolate vibrations of the acoustic chamber from the outer housing. For this purpose, the mid-chamber may form a gap of 1 mm or more between the acoustic chamber and the outer housing. Alternatively, or in addition, the mid-chamber may comprise and/or may be filled with a vibration isolating material (e.g., a foam). By separating the outer housing and the acoustic chamber via a mid-chamber, vibrations of the acoustic chamber may be isolated in a particularly reliable manner.
The one or more microphones are preferably mechanically attached to the outer housing. In particular, the one or more microphones may be mounted to an electronic board, and the electronic board may be attached to the outer housing, in particular using one or more screws. On the other hand, the one or more microphones are typically not in contact with the acoustic chamber. By doing this, the one or more microphones may be isolated from vibrations of the acoustic chamber in a particularly reliable manner.
The one or more microphones and the one or more speaker drivers may be arranged around the central longitudinal axis of the main body at an angular distance of 90° or more (notably between 90° and 180°). As indicated above, the one or more speaker drivers may be at least partially oriented towards the ground that the electronic device is standing on. In this case, the one or more microphones may at least partially be oriented away from the ground. By doing this, the impact of audio rending on the captured microphone signals may be further reduced, thereby further improving the reliability of voice control of the electronic device and/or of the user device which is docked to the electronic device.
The one or more feet of the electronic device may each comprise a vibration isolating section, notably made of silicone, arranged to be in contact with the ground that the electronic device is standing on. By doing this, the quality of the isolation of vibrations may be further improved.
The electronic device may comprise one or more rubber pads configured to isolate vibrations at one or more screw connections of the electronic device, in particular at one or more screw connections for mechanically attaching an electronic board, notably the main electronic board, of the electronic device to the main body. By doing this, the quality of the isolation of vibrations may be further improved.
The control unit may be configured to determine whether or not a user device has been placed in conjunction with the electronic device (e.g., onto the back support). The control unit may be further configured to automatically establish a communication link with the user device (in order to dock the user device to the electronic device), if it is determined that the user device has been placed in conjunction with the electronic device. As a result of this, voice control of a user device may be enabled in a particularly efficient and reliable manner.
It should be noted that the methods and systems including its preferred embodiments as outlined in the present document may be used stand-alone or in combination with the other methods and systems disclosed in this document. In addition, the features outlined in the context of a system are also applicable to a corresponding method. Furthermore, all aspects of the methods and systems outlined in the present document may be arbitrarily combined. In particular, the features of the claims may be combined with one another in an arbitrary manner.
The invention is explained below in an exemplary manner with reference to the accompanying drawings, wherein
- Figure 1a shows a (smart) docking station in a perspective view;
- Figure 1b shows an example docking station in a side view;
- Figure 1c shows a front view of the docking station;
- Figures 2a to 2d show different views of different layers of the main body of the docking station;
- Figures 3a and 3b illustrate a placement of the microphone array relative to the one or more speaker drivers;
- Figures 4a and 4b show example means for reducing vibrations of the docking station; and
- Figures 5a and 5b show an example isolation of vibrations between the microphone array and the acoustic chamber.
As outlined above, the present document is directed at increasing the quality and reliability of user-interaction of a user with a docking station. The following aspects are described in the context of a docking station. It should be noted that these aspects are also applicable to an electronic device in general.
Figs. 1a to 1c show anexample docking station 100. The (smart)docking station 100 comprises amain body 106, which may have a cylindrical form as illustrated inFig. 1a. Themain body 106 may comprise one or more electronic components such as acontrol unit 120, e.g., a microcontroller, of thedocking station 100. Furthermore, themain body 106 may comprise acontrol panel 103 with one or more control elements, notably control buttons. Thecontrol panel 103 may enable a user to interact with thedocking station 100, e.g., for speaker volume control or for quick touch interaction with the voice assistant (stop listening, start listening).
Alternatively, or in addition to acontrol panel 103, thedocking station 100 may comprise a gesture sensor 104 (located e.g., on the main body 106), which is configured to sense gesture data regarding a (hand) gesture performed by a user of thedocking station 100. Thedocking station 100 may be controlled in dependence of the gesture data (i.e., of the measurement data of the gesture sensor 104). In addition, thedocking station 100 may comprise one or morelight elements 150, notably light emitting diodes (LED) and/or anLED strip 151, e.g., for providing status information regarding the status of thedocking station 100 to a user of thedocking station 100.
Themain body 106 may further comprise one or more active speaker drivers (which are also referred to herein as loudspeakers) 108 which are configured to emit an audio signal. The one ormore loudspeakers 108 may be located at a face side and/or at a shell surface of the (cylindrical)main body 106 or of the (cylindrical) acoustic chamber comprised within themain body 106. Themain body 106 may enclose an acoustic chamber for improving the sound quality of the one ormore loudspeakers 108. In addition, thedocking station 100 may comprise one or more (passive)acoustic radiators 107 on the (cylindrical) surface of themain body 106 and/or at the one or more face sides of themain body 106.
Thedocking station 100 may comprise one or morebase feet 105, 111, notably abase foot 105 at a front side facing the user and/or arear foot 111 at a rear side of thedocking station 100. The one or morebase feet 105, 111 may be attached to themain body 106, and may extend from themain body 106 towards theground 160, onto which thedocking station 100 is placed. The one or morebase feet 105, 111 may extend along the longitudinal axis of the (cylindrical)main body 106. In particular, thefeet 105, 111 may have the form of a plate or plane. As such thefeet 105, 111 may be referred to herein as foot plates.
The one or morebase feet 105, 111 may each comprise anisolator element 109, e.g., comprising silicone, at the end of therespective foot 105, 111, which is in contact with theground 160 that thedocking station 100 is placed on. The use of anisolator element 109 allows thedocking station 100 to be placed in a stable manner, preventing it from sliding over the surface that thedocking station 100 is placed on (notably in case of touch interaction with thedocking station 100 and/or when audio is played back). Furthermore, thedocking station 100 may be isolated against (acoustic) vibrations.
In addition, thedocking station 100 may comprise a holder, notably a back support, 102 which extends from themain body 106 upwards (i.e., away from theground 160 that thedocking station 100 is placed on). Theback support 102 may exhibit an angle with regards to theground 160 between 45° and 75°, when thedocking station 100 is placed on theground 160. Theback support 102 may be designed as a support for a typical tablet PC (e.g., with a screen size between 8 inches and 13 inches), also being able to support smaller devices such as phones starting at 5 inches. Theback support 102 may extend along the longitudinal axis of the main body 106 (from one face to the other face of the main body 106).
Theback support 102 may exhibit one or moreisolator stripes 101 which may extend horizontally across theback support 102, or may have at least two support pads. Furthermore, theback support 102 may comprise anisolator section 112 at the top end of theback support 102. The use ofisolator stripes 101 and/or of anisolator section 112 allows an object, such as a tablet PC, to be placed onto theback support 102 in a stable manner. Furthermore, isolation of acoustic vibrations may be provided.
In addition, thedocking station 100 may comprise abase support 124, as shown inFig. 1b, which is located at the contact point between themain body 106 and theback support 102. Thebase support 124 may form a gap within which the lower edge of a user device, notably of a tablet PC, may be placed, in order to hold the user device in a stable manner on theback support 102. Thebase support 124 may comprise an isolating material, in order to allow vibrations of the docking station 100 (e.g., due to rending of an audio signal via aloudspeaker 108 of the docking station 100) to be isolated. Alternatively, or in addition, thebase support 124 may comprise one or more presence sensors (e.g., weight sensors) for detecting the presence of a user device on theback support 102.
Thecontrol unit 120 of thedocking station 100 may be configured to automatically establish a communication link with the user device, if it is determined that the user device has been placed in conjunction with, notably into theholder 102 of, thedocking station 100. By way of example, a wireless communication link, such as a Bluetooth link, may be established between thedocking station 100 and the user device.
Furthermore, thecontrol unit 120 may be configured to receive an audio signal from the user device via the communication link, and to cause one ormore speaker drivers 107, 108 of thedocking station 100 to render the audio signal that has been received from the user device. Alternatively, or in addition, thecontrol unit 120 may be configured to control the user device via the communication link (e.g., using gesture control and/or speech control).
Thedocking station 100 may comprise one ormore microphones 110, e.g., at themain body 106, which are configured to capture acoustic microphone signals, notably speech signals, e.g., in order to enable speech control of thedocking station 100 and/or of a user device that is coupled with thedocking station 100. Thedocking station 100, notably thecontrol unit 120 of thedocking station 100, may be configured to separate an audio signal that has been emitted by thedocking station 100 from an acoustic microphone signal that has been captured by the one ormore microphones 110 of thedocking station 100, thereby enabling a reliable speech control, even when rendering an audio signal. The one ormore microphones 110 may be oriented away from the one ormore speakers 108 and/or the one ormore radiators 107, for enabling a particularly reliable separation of an emitted audio signal from a captured microphone signal. In particular, the angle between the one ormore microphones 110 and the one ormore speaker drivers 107, 108 may be greater than 90°.
Figure 1b shows a side view of thedocking station 100. Thedocking station 100 may comprise anelectronics housing 121 for one or more electronic components (e.g., for the control unit 120). Thehousing 121 may be located between thefront foot 105 and therear foot 111 of thedocking station 100. Furthermore, thedocking station 100 may comprise a data and/or power interface 122 (e.g., a USB interface) for data communication and/or for power supply (e.g., at the housing 121). In addition, thedocking station 100 may comprise asection 123 for placing a power cord for a power supply of thedocking station 100.
Figure 1c shows a front view of thedocking station 100. Themain body 106 of thedocking station 100 may comprise variousdifferent control elements 141, 142, 143, 144 such as acontrol button 141 for activating or deactivating a virtual assistant,volume buttons 142, 143 and/or an on/off-button 144 for activating and for deactivating the one ormore microphones 110.
Thedocking station 100 is typically dedicated for the use within a kitchen. Thedocking station 100 may be used in conjunction with a smartphone and/or tablet PC which is placed on theback support 102 of thedocking station 100. Furthermore, thedocking station 100 may be configured to act standalone as a smart speaker, e.g., for rendering audio signals that are provided by an internet server and/or by a smartphone and/or tablet PC.
Fig. 2a shows further details regarding themain body 106 of thedocking station 100. Themain body 106 comprises anacoustic chamber 201 which houses the one ormore speaker drivers 107, 108 (also referred to as transducers). Furthermore, themain body 106 comprises anouter housing 202 which surrounds theacoustic chamber 201 and which is preferably isolated from theacoustic chamber 201, in order to reduce the transfer of vibrations from theacoustic chamber 201 to theouter housing 202. In particular, a mid-chamber 203 may be located between theacoustic chamber 201 and theouter housing 202, as illustrated inFigs. 2b to 2d. The mid-chamber 203 may be filled with a layer of isolating material.
In the present document various techniques are described to improve voice interaction via the one ormore microphones 110 with thedocking station 100. In this context, echo, reverberation, additive noise, distortion, etc. of thedocking station 100 may be reduced. The microphone array comprising one ormore microphones 110 may be isolated from theacoustic chamber 202 by reducing, in particular by minimizing, the mechanical coupling between the one ormore microphones 110 and the one ormore speaker transducers 107, 108. Alternatively, or in addition, the firing angle of the one ormore transducers 107, 108 may be adjusted to more than ninety degrees with regards to the one ormore microphones 110. Alternatively, or in addition, screws with rubber sealings and/or silicone feet may be used to reduce the mechanical vibrations.
As outlined in the context ofFigs. 2a to 2d, theacoustic chamber 201 may be nested within theouter housing 202 of themain body 106 of thedocking station 100. Theacoustic chamber 201 may be separated from theouter housing 202 by acylindrical mid-chamber 203, which separates theacoustic chamber 201 from the microphone array comprising the one ormore microphones 110. Theouter housing 202, the mid-chamber 203 and theacoustic chamber 201 may be separated by still air, respectively, thereby further reducing mechanical coupling between the one ormore loudspeaker transducers 107, 108 and the one ormore microphones 110. As illustrated inFig. 2b a nested cylinder approach may be used to isolate theouter housing 202, the microphone array comprising the one ormore microphones 110 and theacoustic chamber 201.
Alternatively, or in addition, thefiring angle 301 of the one ormore transducers 108 may be more than or equal to 90° away from the microphone array comprising the one or more microphones 110 (as illustrated inFig. 3a and 3b). This arrangement prevents the music or speech, i.e., theaudio signal 302, that thedevice 100 is playing out, to be coupled back into the device's 100microphones 110.
Furthermore, the mechanical vibrations may be reduced through the device structure, as illustrated inFigs. 4a and 4b. In particular, the screws holding the structure together may make use ofrubber sealings 401. Furthermore, thedevice 100 may be equipped withfeet 105, 111 comprisingsilicone sections 403 for reducing reverberation. Alternatively, or in addition, anelectronic board 406 of thedocking station 100 may be attached to thedocking station 100, notably to themain body 106 and/or to thefeet 105, 111, using screw connections with rubber pads and/or rubber caps 405.
Figs. 5a and 5b shows one ormore microphones 110 of thedocking station 100, which are attached to theouter housing 202 of thedocking station 100, thereby isolating the one ormore microphones 110 from vibrations of theacoustic chamber 201. In particular, the one ormore microphones 110 may be mounted on anelectronic board 502, and theelectronic board 502 may be attached to theouter housing 202, e.g., using one ormore screws 501.
The means for mechanically separating the one ormore microphones 110 from the one ormore speakers 108 allows preventing the false triggering of voice commands when the user is listening to anaudio signal 302. On the other hand, the user is enabled to trigger voice commands while anaudio signal 302 is being played. The described measures allow providing a seamless audio experience to the user. Furthermore, the measures allow providing a reliable voice activateddevice 100 through accurate wake up word detection and command recognition (even in noisy environments).
It should be noted that the description and drawings merely illustrate the principles of the proposed methods and systems. Those skilled in the art will be able to implement various arrangements that, although not explicitly described or shown herein, embody the principles of the invention and are included within its spirit and scope. Furthermore, all examples and embodiment outlined in the present document are principally intended expressly to be only for explanatory purposes to help the reader in understanding the principles of the proposed methods and systems. Furthermore, all statements herein providing principles, aspects, and embodiments of the invention, as well as specific examples thereof, are intended to encompass equivalents thereof.