may then be used train a rotation matrix R for generating hash vectors. The rotation matrix may be initialized, such as by initializing it to a random rotation. The rotation matrix may then be trained by sequentially performing the following updates:
B=sign(

R)
U,S,V=SVD(B^T

)
R=VU^T
where signs( ) returns matrix of 1's and −1's according to the sign of corresponding elements of the input and SVD( ) performs a singular value decomposition of the input. This sequence of operations may be performed until a convergence criterion has been met. Each row of the final matrix B contains a resource hash vector for a corresponding resource and the final matrix B may have values of only 1 and −1. In some implementations, the matrix B may be converted to a matrix of 1s and 0s by converting all the −1s to 0s or performing some other similar operation. The quantization model comprises rotating a vector with rotation matrix R and then performing the sign( ) operation on the resulting vector.

FIGS. 9A and 9B illustrate systems for suggesting a resource to a CSR using hash vectors. At the beginning of a session with a CSR a first message is received. The first message may be from either the customer or the CSR. The received message is processed to compute a context hash vector, and then the context hash vector is used to retrieve resources fromresources data store140.

Insystem901 ofFIG. 9A, messagefeature extraction component810 computes features for the message,message embedding component820 computes a message embedding from the features,session context830 computes a context vector for the session (which may include only the first message at this point),encoding component840 computes an approximate context hash vector, andquantization component910 computes a context hash vector.Quantization component910 may receive the approximate context hash vector (which may have real values) and may perform a rotation and then compute the sign( ) of the rotated vector to generate the context hash vector (which may have values of only 1 and −1). In some implementations, the context hash vector may be converted to a vector of 1s and 0s similar to the processing of the resource hash vectors above.

Insystem902, the context hash vector is used to obtain resources to suggest to a CSR.Search component920 receives a context hash vector and retrieves resources fromresources data store140 by comparing the context hash vector to resource hash vectors in the data store.

In some implementations,search component920 may retrieve all resources where the resource hash vector of the resource is equal to the context hash vector by performing a query using the context hash vector.

In some implementations,search component920 may retrieve all resources where the resource hash vector of the resource is within a Hamming radius of the context hash vector. A Hamming radius of a hash vector may comprise all other vectors where the number of different elements is less than or equal to a specified value. A Hamming radius of 1 for a context hash vector would include a resource hash vector that is identical to the context hash vector and all resource hash vectors whose elements are the same as the context hash vector for all but one element. For example, for a context hash vector of [1, 0, 1, 0], resource hash vectors within a Hamming distance of 1 would include [1, 0, 1, 0]; [0, 0, 1, 0]; [1, 1, 1, 0]; [1, 0, 0, 0]; and [1, 0, 1, 1].Search component920 may determine all resource hash vectors within a Hamming radius of the context hash vector and retrieve corresponding resources fromresources data store140.

In some implementations,search component920 may implement an inverted index to speed up retrieval of resources using a context hash vector. An inverted index may include a list of resources corresponding to each possible resource hash vector, and allow for fast retrieval of resources fromresources data store140.

In some implementations,system902 may includepost-processing component930.Post-processing component930 may receive a list of resources fromsearch component920 and perform additional processing to determine which resources to present to a CSR.Post-processing component930 may rerank the resources received fromsearch component920 or may select no resources so that no suggestions are presented to a CSR. In some implementations,post-processing component930 may use a translation language model that was trained to translate between messages and resources used in response to messages; may apply any statistical machine translation techniques that indicate a match between a message and a resource, may use a ranking support vector machine that processes TFIDF features of a previous message, or may use any other known reranking techniques.

The resource suggestions may then be presented to a CSR, such as by presenting information about the suggested resources in a user interface, such as the user interface ofFIG. 2. The CSR may then use the suggested resources in a conversation with the customer. For example, where the suggested resource is a message, the CSR may send that message to the customer or may modify it and then send it to the customer.

FIG. 10 is a flowchart of an example implementation of training one or more models for computing hash vectors for suggesting resources. InFIG. 10, the ordering of the steps is exemplary and other orders are possible, not all steps are required and, in some implementations, some steps may be omitted or other steps may be added. The process of the flowcharts may be implemented, for example, by any of the computers or systems described herein.

Atstep1010, a message sequence is obtained from a training corpus, such as the message sequence ofFIG. 4 or a portion of the message sequence ofFIG. 4. The message sequence may include any number of messages between a customer and a CSR. The message sequence may also include other information, such as other resources used by a CSR during the session between the customer and the CSR. For example, the message sequence may be messages1-3 ofFIG. 4.

Atstep1020, an approximate context hash vector is computed for the message sequence. The approximate context hash vector may be computed using any of the techniques described above. For example, an approximate context hash vector may be computed iteratively for each message in the sequence, where each approximate context hash vector is computed using the approximate context hash vector from the previous iteration. Computing the approximate context hash vector may also include processing other resources used by a CSR, such as when the CSR uses an image in responding to the customer. In some implementations, an approximate context hash vector may be computed by messagefeature extraction component810,message embedding component820,session context component830, andencoding component840.

Atstep1030, a response of the CSR to the last message in the message sequence is obtained from the training data. For example, the response of the CSR may bemessage4 ofFIG. 4.

Atstep1040, an approximate resource hash vector is computed for the response. The approximate resource hash vector may be computed using any of the techniques described above. In some implementations, an approximate resource hash vector for the response may be computed by resourcefeature extraction component815,resource embedding component825, andencoding component841.

Atstep1050, a different resource is obtained that is different from the response of the CSR to customer atstep1040. For example, a resource may be selected randomly fromresources data store140.

Atstep1060, an approximate resource hash vector is computed for the different resource obtained atstep1050. The approximate resource hash vector for the different resource may be computed using any of the techniques described above. In some implementations, approximate resource hash vector for the different resource may be computed by resourcefeature extraction component816,resource embedding component826, andencoding component842.

In some implementations, multiple different resources may be used as described above. Accordingly, steps1050 and1060 may be performed for each different resource that is used.

Atstep1070, model parameters are updated using the approximate context hash vector, the approximate resource hash vector for the response of the CSR, and one or more approximate resource hash vectors for the different resources. The model parameters may be updated using any of the techniques described herein.

Atstep1080, it is determined whether the training process has completed. Any appropriate criteria may be used to determined when the training process has completed. For example, where the model parameters have converged to stable values (e.g., the differences with a previous iteration are small), it may be determined that the training has completed. Where training has not completed, processing may return to step1010 to, for example, obtain and process the following message in a message sequence.

Atstep1090, training is completed and the trained one or more models may be further processed (e.g., compute approximate resource hash vectors, train a quantization model, and compute resource hash vectors) and used for suggesting resources.

FIG. 11 is a flowchart of an example implementation of suggesting resources using hash vectors. InFIG. 11, the ordering of the steps is exemplary and other orders are possible, not all steps are required and, in some implementations, some steps may be omitted or other steps may be added. The process of the flowcharts may be implemented, for example, by any of the computers or systems described herein.

Atstep1110, a message is received from either a customer or a CSR. For example, the message may be sent from a customer to the CSR or from the CSR to the customer. The message may be the first message between them or it may follow other messages between them.

Atstep1120, a semantic representation is computed from the message. The semantic representation may be any representation of the message that indicates a meaning of the message, although the semantic representation may not be understandable by a person. The semantic representation may be a vector of real numbers or may take another form, such as a matrix or a tensor. In some implementations, the semantic representation may be a message embedding computed by messagefeature extraction component810 andmessage embedding component820.

Atstep1130, a context vector is computed for the session using the semantic representation of the message. The context vector may be any representation of the session that indicates a meaning of the session (for example, the meaning of the session may include a meaning of the current message and a meaning of previous messages in the session). The context vector may be computed using a context vector from a previous iteration that processed a previous message in the session. The context vector may be a vector of real numbers or may be in another format, such as a matrix or a tensor (the term context vector is used for clarity of presentation but a context matrix or context tensor may be computed instead). In some implementations, the context vector may be computed usingsession context component830.

Atstep1140, a context hash vector is computed for the session. The context hash vector may be any hash vector that indicates a meaning of the session (for example, the meaning of the session may include a meaning of the current message and a meaning of previous messages in the session). The context hash vector may be a vector where each element of the vector takes one of two values, such as 0 or 1. In some implementations, a context hash matrix or a context hash tensor may be computed instead of a context hash vector. In some implementations, the context hash vector may be computed usingencoding component840 andquantization component910.

Atstep1150, one or more resources are obtained using the context hash vector. For example, one or more resources may be retrieved from a data store of resources where a resource hash vector of a resource matches or is close to the context hash vector. In some implementations, resources may be obtained where the resource hash vectors are within a Hamming distance of the context hash vector. In some implementations, the one or more resources may be obtained bysearch component920.

Atstep1160, post-processing may be performed on the obtained resources as described above. In some implementations, the post-processing may be performed bypost-processing component930.

Atstep1170, one or more resources are caused to be presented to the CSR. For example, the one or more resources may be presented to the CSR using the user interface ofFIG. 2. The CSR may then use the resource in responding to the customer as described above.

FIG. 12 illustrates components of one implementation of acomputing device1200 for implementing any of the techniques described above. InFIG. 12, the components are shown as being on asingle computing device1200, but the components may be distributed among multiple computing devices, such as a system of computing devices, including, for example, an end-user computing device (e.g., a smart phone or a tablet) and/or a server computing device (e.g., cloud computing).

Computing device

1200 may include any components typical of a computing device, such as volatile ornonvolatile memory1210, one ormore processors1211, and one or more network interfaces1212.Computing device1200 may also include any input and output components, such as displays, keyboards, and touch screens.Computing device1200 may also include a variety of components or modules providing specific functionality, and these components or modules may be implemented in software, hardware, or a combination thereof. Below, several examples of components are described for one example implementation, and other implementations may include additional components or exclude some of the components described below.

Computing device

1200 may have asupport component1220 that provides functionality for allowing a customer and a CSR to interact with each other in a support session, such as presenting user interfaces to a customer or CSR, allowing messages to be transmitted between the customer and the CSR, or presenting suggestions to a CSR.Computing device1200 may have asuggestion component1230 that may identify resources as possible suggestions for a CSR, such as by processing a message transmitted between a customer and a CSR, computing a context for the session, and retrieving resources from a data store using the computed context.Computing device1200 may have amodel training component1240 that train mathematical models, such as artificial neural networks, for suggesting resources based on a context of a session.

Computing device

1200 may include or have access to various data stores, such as

data stores

140 and510. Data stores may use any known storage technology, such as files or relational or non-relational databases. For example,computing device1200 may have aresources data store140 and a trainingcorpus data store510, as described above.

For clarity of presentation, the techniques described above have been presented in the context of a session between a customer and a CSR where the customer and CSR are exchanging messages with each other. The techniques described above, however, are not limited to that particular example, and other applications are possible.

The techniques described above may be applied to any two entities exchanging messages with each other. For example, two individuals may be exchanging messages with each other, resources may be suggested to either user, and the suggested resources may include a message to send in response or something else, such as a URL to a website with information relevant to the conversation.

The techniques described above may be applied to interactions other than messages. For example, the interactions between two entities may be in the form of audio and/or video and resources may be suggested to the entities by processing the audio and/or video to determine a context of the session and suggest resources to the entities.

The techniques described above may be applied to interactions that proceed in non-linear ways, such as a directed acyclic graph (in comparison to linear, sequential exchanges in a messaging session). For interactions that proceed as a directed acyclic graph, a recursive neural network (e.g., with long short-term memory units) may be used that is adapted to process nodes of a directed acyclic graph.

The techniques described above may be combined with any of the techniques described in U.S. patent application Ser. No. 15/254,008 filed on Sep. 1, 2016, now issued as U.S. Pat. No. 9,715,496 on Jul. 25, 2017, and U.S. patent application Ser. No. 15/383,707, filed on Dec. 19, 2016 and entitled “Word Hash Language Model”, each of which is herein incorporated by reference in its entirety for all purposes. For example, any of the techniques described herein may be provided as part of a third-party semantic processing service whereby a third party provides semantic processing services to a company to assist the company in providing customer service to its customers.

The methods and systems described herein may be deployed in part or in whole through a machine that executes computer software, program codes, and/or instructions on a processor. “Processor” as used herein is meant to include at least one processor and unless context clearly indicates otherwise, the plural and the singular should be understood to be interchangeable. The present invention may be implemented as a method on the machine, as a system or apparatus as part of or in relation to the machine, or as a computer program product embodied in a computer readable medium executing on one or more of the machines. The processor may be part of a server, client, network infrastructure, mobile computing platform, stationary computing platform, or other computing platform. A processor may be any kind of computational or processing device capable of executing program instructions, codes, binary instructions and the like. The processor may be or include a signal processor, digital processor, embedded processor, microprocessor or any variant such as a co-processor (math co-processor, graphic co-processor, communication co-processor and the like) and the like that may directly or indirectly facilitate execution of program code or program instructions stored thereon. In addition, the processor may enable execution of multiple programs, threads, and codes. The threads may be executed simultaneously to enhance the performance of the processor and to facilitate simultaneous operations of the application. By way of implementation, methods, program codes, program instructions and the like described herein may be implemented in one or more thread. The thread may spawn other threads that may have assigned priorities associated with them; the processor may execute these threads based on priority or any other order based on instructions provided in the program code. The processor may include memory that stores methods, codes, instructions and programs as described herein and elsewhere. The processor may access a storage medium through an interface that may store methods, codes, and instructions as described herein and elsewhere. The storage medium associated with the processor for storing methods, programs, codes, program instructions or other type of instructions capable of being executed by the computing or processing device may include but may not be limited to one or more of a CD-ROM, DVD, memory, hard disk, flash drive, RAM, ROM, cache and the like.

A processor may include one or more cores that may enhance speed and performance of a multiprocessor. In embodiments, the process may be a dual core processor, quad core processors, other chip-level multiprocessor and the like that combine two or more independent cores (called a die).

The methods and systems described herein may be deployed in part or in whole through a machine that executes computer software on a server, client, firewall, gateway, hub, router, or other such computer and/or networking hardware. The software program may be associated with a server that may include a file server, print server, domain server, internet server, intranet server and other variants such as secondary server, host server, distributed server and the like. The server may include one or more of memories, processors, computer readable media, storage media, ports (physical and virtual), communication devices, and interfaces capable of accessing other servers, clients, machines, and devices through a wired or a wireless medium, and the like. The methods, programs, or codes as described herein and elsewhere may be executed by the server. In addition, other devices required for execution of methods as described in this application may be considered as a part of the infrastructure associated with the server.

The server may provide an interface to other devices including, without limitation, clients, other servers, printers, database servers, print servers, file servers, communication servers, distributed servers and the like. Additionally, this coupling and/or connection may facilitate remote execution of program across the network. The networking of some or all of these devices may facilitate parallel processing of a program or method at one or more location without deviating from the scope of the invention. In addition, any of the devices attached to the server through an interface may include at least one storage medium capable of storing methods, programs, code and/or instructions. A central repository may provide program instructions to be executed on different devices. In this implementation, the remote repository may act as a storage medium for program code, instructions, and programs.

The software program may be associated with a client that may include a file client, print client, domain client, internet client, intranet client and other variants such as secondary client, host client, distributed client and the like. The client may include one or more of memories, processors, computer readable media, storage media, ports (physical and virtual), communication devices, and interfaces capable of accessing other clients, servers, machines, and devices through a wired or a wireless medium, and the like. The methods, programs, or codes as described herein and elsewhere may be executed by the client. In addition, other devices required for execution of methods as described in this application may be considered as a part of the infrastructure associated with the client.

The client may provide an interface to other devices including, without limitation, servers, other clients, printers, database servers, print servers, file servers, communication servers, distributed servers and the like. Additionally, this coupling and/or connection may facilitate remote execution of program across the network. The networking of some or all of these devices may facilitate parallel processing of a program or method at one or more location without deviating from the scope of the invention. In addition, any of the devices attached to the client through an interface may include at least one storage medium capable of storing methods, programs, applications, code and/or instructions. A central repository may provide program instructions to be executed on different devices. In this implementation, the remote repository may act as a storage medium for program code, instructions, and programs.

The methods and systems described herein may be deployed in part or in whole through network infrastructures. The network infrastructure may include elements such as computing devices, servers, routers, hubs, firewalls, clients, personal computers, communication devices, routing devices and other active and passive devices, modules and/or components as known in the art. The computing and/or non-computing device(s) associated with the network infrastructure may include, apart from other components, a storage medium such as flash memory, buffer, stack, RAM, ROM and the like. The processes, methods, program codes, instructions described herein and elsewhere may be executed by one or more of the network infrastructural elements.

The methods, program codes, and instructions described herein and elsewhere may be implemented on a cellular network having multiple cells. The cellular network may either be frequency division multiple access (FDMA) network or code division multiple access (CDMA) network. The cellular network may include mobile devices, cell sites, base stations, repeaters, antennas, towers, and the like. The cell network may be a GSM, GPRS, 3G, EVDO, mesh, or other networks types.

The methods, programs codes, and instructions described herein and elsewhere may be implemented on or through mobile devices. The mobile devices may include navigation devices, cell phones, mobile phones, mobile personal digital assistants, laptops, palmtops, netbooks, pagers, electronic books readers, music players and the like. These devices may include, apart from other components, a storage medium such as a flash memory, buffer, RAM, ROM and one or more computing devices. The computing devices associated with mobile devices may be enabled to execute program codes, methods, and instructions stored thereon. Alternatively, the mobile devices may be configured to execute instructions in collaboration with other devices. The mobile devices may communicate with base stations interfaced with servers and configured to execute program codes. The mobile devices may communicate on a peer-to-peer network, mesh network, or other communications network. The program code may be stored on the storage medium associated with the server and executed by a computing device embedded within the server. The base station may include a computing device and a storage medium. The storage device may store program codes and instructions executed by the computing devices associated with the base station.

The computer software, program codes, and/or instructions may be stored and/or accessed on machine readable media that may include: computer components, devices, and recording media that retain digital data used for computing for some interval of time; semiconductor storage known as random access memory (RAM); mass storage typically for more permanent storage, such as optical discs, forms of magnetic storage like hard disks, tapes, drums, cards and other types; processor registers, cache memory, volatile memory, non-volatile memory; optical storage such as CD, DVD; removable media such as flash memory (e.g. USB sticks or keys), floppy disks, magnetic tape, paper tape, punch cards, standalone RAM disks, Zip drives, removable mass storage, off-line, and the like; other computer memory such as dynamic memory, static memory, read/write storage, mutable storage, read only, random access, sequential access, location addressable, file addressable, content addressable, network attached storage, storage area network, bar codes, magnetic ink, and the like.

The methods and systems described herein may transform physical and/or or intangible items from one state to another. The methods and systems described herein may also transform data representing physical and/or intangible items from one state to another.

The elements described and depicted herein, including in flow charts and block diagrams throughout the figures, imply logical boundaries between the elements. However, according to software or hardware engineering practices, the depicted elements and the functions thereof may be implemented on machines through computer executable media having a processor capable of executing program instructions stored thereon as a monolithic software structure, as standalone software modules, or as modules that employ external routines, code, services, and so forth, or any combination of these, and all such implementations may be within the scope of the present disclosure. Examples of such machines may include, but may not be limited to, personal digital assistants, laptops, personal computers, mobile phones, other handheld computing devices, medical equipment, wired or wireless communication devices, transducers, chips, calculators, satellites, tablet PCs, electronic books, gadgets, electronic devices, devices having artificial intelligence, computing devices, networking equipments, servers, routers and the like. Furthermore, the elements depicted in the flow chart and block diagrams or any other logical component may be implemented on a machine capable of executing program instructions. Thus, while the foregoing drawings and descriptions set forth functional aspects of the disclosed systems, no particular arrangement of software for implementing these functional aspects should be inferred from these descriptions unless explicitly stated or otherwise clear from the context. Similarly, it will be appreciated that the various steps identified and described above may be varied, and that the order of steps may be adapted to particular applications of the techniques disclosed herein. All such variations and modifications are intended to fall within the scope of this disclosure. As such, the depiction and/or description of an order for various steps should not be understood to require a particular order of execution for those steps, unless required by a particular application, or explicitly stated or otherwise clear from the context.

The methods and/or processes described above, and steps thereof, may be realized in hardware, software or any combination of hardware and software suitable for a particular application. The hardware may include a general-purpose computer and/or dedicated computing device or specific computing device or particular aspect or component of a specific computing device. The processes may be realized in one or more microprocessors, microcontrollers, embedded microcontrollers, programmable digital signal processors or other programmable device, along with internal and/or external memory. The processes may also, or instead, be embodied in an application specific integrated circuit, a programmable gate array, programmable array logic, or any other device or combination of devices that may be configured to process electronic signals. It will further be appreciated that one or more of the processes may be realized as a computer executable code capable of being executed on a machine-readable medium.

The computer executable code may be created using a structured programming language such as C, an object oriented programming language such as C++, or any other high-level or low-level programming language (including assembly languages, hardware description languages, and database programming languages and technologies) that may be stored, compiled or interpreted to run on one of the above devices, as well as heterogeneous combinations of processors, processor architectures, or combinations of different hardware and software, or any other machine capable of executing program instructions.

Thus, in one aspect, each method described above and combinations thereof may be embodied in computer executable code that, when executing on one or more computing devices, performs the steps thereof. In another aspect, the methods may be embodied in systems that perform the steps thereof, and may be distributed across devices in a number of ways, or all of the functionality may be integrated into a dedicated, standalone device or other hardware. In another aspect, the means for performing the steps associated with the processes described above may include any of the hardware and/or software described above. All such permutations and combinations are intended to fall within the scope of the present disclosure.

While the invention has been disclosed in connection with the preferred embodiments shown and described in detail, various modifications and improvements thereon will become readily apparent to those skilled in the art. Accordingly, the spirit and scope of the present invention is not to be limited by the foregoing examples, but is to be understood in the broadest sense allowable by law.

All documents referenced herein are hereby incorporated by reference.

Claims

What is claimed is:

1. A computer-implemented method for presenting information about a resource to a user, the method performed by one or more server computers and comprising:

receiving a plurality of electronic messages during a session between a first computing device of a first user and a second computing device of a second user;

computing a message embedding for each message of the plurality of electronic messages with a first neural network, wherein each message embedding represents a corresponding message in a vector space;

computing a first context vector by sequentially processing the message embeddings for the plurality of electronic messages, wherein the processing is performed using a second neural network;

quantizing the first context vector to obtain a first context hash vector;

selecting a first resource from a data store using the first context hash vector and a hash vector for the first resource, wherein (i) the data store comprises a plurality of resources, (ii) each resource of the plurality of resources is associated with a hash vector, (iii) selecting the first resource comprises computing a distance between the first context hash vector and the hash vector for the first resource; and

transmitting, during the session, information about the first resource to the first computing device to allow the first user to access the first resource.

2. The computer-implemented method ofclaim 1, wherein the first user is a customer service representative and the second user is a customer.

3. The computer-implemented method ofclaim 1, wherein the method further comprises:

receiving a subsequent message between the first user and the second user;

computing a subsequent message embedding for the subsequent message;

computing a second context vector using the first context vector and the subsequent message embedding of the subsequent message;

quantizing the second context vector to obtain a second context hash vector;

selecting a second resource from the data store using the second context hash vector and a hash vector for the second resource; and

transmitting, during the session, information about the second resource to the first computing device.

4. The computer-implemented method ofclaim 1, wherein the second neural network is a recurrent neural network or a convolution neural network.

5. The computer-implemented method ofclaim 1, wherein each element of the first context hash vector comprises a boolean value.

6. The computer-implemented method ofclaim 1, wherein the first resource comprises text of a message, a document, an image, or a URL.

7. The computer-implemented method ofclaim 1, wherein the first context hash vector is equal to the hash vector for the first resource.

8. The computer-implemented method ofclaim 1, wherein the distance is a Hamming distance.

9. A system for presenting information about a resource to a user, the system comprising:

at least one server computer comprising at least one processor and at least one memory,

the at least one server computer configured to:

receive, a plurality of electronic messages during a session between a first computing device of a first user and a second computing device of a second user;

compute, a semantic representation of each message of the plurality of electronic messages;

compute, a first context vector by processing the semantic representations for the plurality of electronic messages;

quantize, the first context vector to obtain a first context hash vector;

select a first resource from a data store using the first context hash vector and a hash vector for the first resource, wherein the data store comprises a plurality of resources and each resource of the plurality of resources is associated with a hash vector; and

transmit, during the session, information about the first resource to the first computing device.

10. The system ofclaim 9, wherein the at least one server computer is configured to:

receive a selection of the first resource by the first user; and

cause the first resource to be transmitted to the second user.

11. The system ofclaim 9, wherein the semantic representation comprises a message embedding.

12. The system ofclaim 9, wherein the at least one server computer is configured to compute the first context vector using a recurrent neural network with long short-term memory units.

13. The system ofclaim 9, wherein the at least one server computer is configured to select the first resource using an inverted index.

14. The system ofclaim 9, wherein the at least one server computer is configured to quantize the first context vector by performing a rotation of the first context vector.

15. The system ofclaim 9, wherein the first context vector is computed using a neural network and the neural network is trained by minimizing a triplet rank loss function.

16. One or more non-transitory computer-readable media comprising computer executable instructions that, when executed, cause at least one processor to perform actions comprising:

computing a semantic representation of each message of the plurality of electronic messages;

computing a context vector by processing the semantic representations for the plurality of electronic messages;

quantizing the context vector to obtain a context hash vector;

selecting a first resource from a data store using the context hash vector and a hash vector for the first resource, wherein the data store comprises a plurality of resources and each resource of the plurality of resources is associated with a hash vector; and

transmitting, during the session, information about the first resource to the first computing device.

17. The one or more non-transitory computer-readable media ofclaim 16, wherein each element of the context hash vector comprises a boolean value.

18. The one or more non-transitory computer-readable media ofclaim 16, wherein selecting the first resource from the data store comprises computing a Hamming distance between the context hash vector and the hash vector for the first resource.

19. The one or more non-transitory computer-readable media ofclaim 16, wherein the semantic representation comprises a message embedding.

20. The one or more non-transitory computer-readable media ofclaim 16, wherein selecting the first resource comprises using an inverted index.