This example demonstrates how data from an external data source (e.g. a work tracking system) might be integrated with Fluid to enable more-real-time collaboration. For example, to allow collaborators to see proposed changes updating live before committing them back to the database.
Please note that the ideas explored here are experimental and under development. They are not yet recommended for broad use in production. When this changes, we will update the documents accordingly.
This example demonstrates a scenario in which the "source of truth" of the customer data lives in a service that is external to the Fluid service. Customers can then:
- Import the external data into a Fluid collaboration session.
- Export data from a Fluid collaboration session back to the source of truth.
- Sync updates between Fluid and the source of truth in as close to real-time as the scenario allows.
In this case, the Fluid collaboration session serves as a "drafting surface" in which clients collaborate to create a draft of the data and then send the saved data back to the source of truth for long term storage.
In order to accomplish the goals above, we have split up the responsibilities into a few different pieces:
Many services that would hold the "source of truth" data offer explicit commit style interfaces (e.g. via REST call or similar) which are not well suited to rapid updates. However, they often expose third-party integration via REST APIS for querying and manipulating data, as well as webhooks for watching updates to the data.
This repo contains a service that mocks the external "source of truth" data server. This mock service offers a REST API collection and webhook interfaces in ./src/mock-external-data-service
. The API requests served by this "external" service are the following:
- POST
/register-for-webhook?externalTaskListId=XXX
. Required body parameters:url
(string). Registers the sender's URL to receive notifications when the external task-list data changes. Currently, the Customer Service registers its/external-data-webhook
endpoint here to be called when data changes. - GET
/fetch-tasks/:externalTaskListId
: Fetches the task list from the external data store. Called by the Fluid client. - POST
/set-tasks/:externalTaskListId
: Updates external data store with new tasks list (complete override). Called by the Fluid client.
Find the details of the API in the External Data Service README
Next we need a customer service that functions as the intermediary between the External Data Service and the Fluid Service. This server is responsible for authenticating to the external service on the customer's behalf. It registers to the External Data Service webhooks and listens for incoming changes. It also acts as a translation layer, translating to and from the External Data Service expected format and the Fluid Service's expected format.
In this example, the Customer Service contains the following endpoints:
1.POST /register-session-url
. Required body parameters: containerUrl
(string), externalTaskListId
(string). Creates an entry in the Customer Service of the mapping between the container and the external resource id. It then calls the External Service's /register-for-webhook
endpoint to call it's own /external-data-webhook?externalTaskListId=XXX
endpoint (details below), in order to notify the Fluid Service containers subscribed to that externalTaskListId of the corresponding change.
- POST
/external-data-webhook
. Required querystring parameters:externalTaskListId
(string). This gets called by the External Data Service when there's been a change to the data and causes the Customer Service to in turn call the/broadcast-signal
endpoint in the Fluid Service. Note that this is a route established by the Customer Service to be used exclusively as a subscription endpoint for the external data service's webhook and should be considered a private implementation detail to the service. Customers may choose to implement this differently.
Find the details of the API in the Customer Service README.
The broadcast-signal
endpoint is new and still under construction. We will update it here once it is available to use.
- POST
/broadcast-signal
. Required body parameters:containerUrl
(string),externalTaskListId
(string),taskData
(ITaskData). This endpoint is called by the Customer Service when it needs the Fluid Service to notify the Fluid clients that there has been a change to the upstream data. The body must contain thecontainerUrl
, from which we can extract a few pieces relevant to the Fluid service: thesocketStreamUrl
, thecontainerId
(sometimes known as thedocumentId
), and thetenantId
. The Fluid service will broadcast a signal of typeSignalType.RuntimeSignal
to the clients to alert them of the change. In this way, it "echoes" the webhook call from the External Data Service to the Customer Service.
This example uses the tinylicious driver to stub out what changes will be necessary in the odsp-driver. The prototype of the full signal and driver flow can be seen in this dev/external-data-prototyping
branch to main comparison.
On receiving the signal, the clients (or elected leader client) can then send a fetch call to retrieve the information and display it to screen by making a call to the external data server's GET /fetch-tasks
endpoint.
The client can then display the diff on the screen and the users can choose how to reconcile the changes.
Once the changes are reconciled, the collaboration session can continue as expected, and when the collaboration session is ready to be closed, the clients can simply Save Changes to write back to the External Data Source by making a request to the External Data Server's POST /set-tasks
endpoint.
A few useful concepts to understand in implementing conflict resolution in this app:
Task - This is the unit that can be edited and attributed to an author. It is also the level at which conflict resolution currently takes place. The task holds knowledge of the local edits as well as the external edits and can display one or both to the screen.
TaskList - This can be compared to a "board" that holds all of the tasks. It is the larger visible entity to show up in a component within the app. The app stores two types of TaskLists - a "draft" version and a "saved version". More on these below.
TaskList - This can be compared to a "board" that holds a list of related tasks; for example, the results of a particular query. It is the larger visible entity to show up in a component within the app. The app stores two types of TaskLists - a "draft" version and a "saved version". More on these below.
ITaskData - This is similar to Task except that it is the External Data Services model of the task.
ITaskListData - This is similar to TaskList except that it is the External Data Services model of the taskList.
SavedData - Data that comes in fresh from the external data source is first stored in a SharedMap known as "SavedData".
DraftData - Local collaboration state between the Fluid clients is stored in a SharedMap known as "DraftData". This is known as draft data because we are treating the Fluid collaboration session as a drafting surface.
You can run this example using the following steps:
- Enable corepack by running
corepack enable
. - Run
pnpm install
andpnpm run build:fast --nolint
from theFluidFramework
root directory.- For an even faster build, you can add the package name to the build command, like this:
pnpm run build:fast --nolint @fluid-example/app-integration-external-data
- For an even faster build, you can add the package name to the build command, like this:
- Run
pnpm start
from this directory and open http://localhost:8080 in a web browser to see the app running.
npm run test:jest
For in browser testing update ./jest-puppeteer.config.js
to:
launch: {
dumpio: true, // output browser console to cmd line
slowMo: 500,
headless: false,
},