Sending Clickstream data from client-side to Pipelines
In this tutorial, you will learn how to ingest clickstream data to a R2 bucket using Pipelines. You will send this data from the client-side, that means you will make a call to the Pipelines URL directly from the client-side JavaScript code.
For this tutorial, you will build a landing page of an e-commerce website. The page will list the products available for sale. A user can click on the view button to view the product details or click on the add to cart button to add the product to their cart. The focus of this tutorial is to show how to ingest the data to R2 using Pipelines from the client-side. Hence, the landing page will be a simple HTML page with no actual e-commerce functionality.
- Create a R2 bucket in your Cloudflare account.
- Install
Node.jsβ.
Node.js version manager
Use a Node version manager like Volta β or
nvm β to avoid permission issues and change
Node.js versions. Wrangler, discussed
later in this guide, requires a Node version of 16.17.0 or later.
You will create a new Worker project that will use Static Assets to serve the HTML file. While you can use any front-end framework, this tutorial uses plain HTML and JavaScript to keep things simple. If you are interested in learning how to build and deploy a web application on Workers with Static Assets, you can refer to the Frameworks documentation.
Create a new Worker project by running the following commands:
npm create cloudflare@latest -- e-commerce-pipelines-client-sideyarn create cloudflare@latest e-commerce-pipelines-client-sidepnpm create cloudflare@latest e-commerce-pipelines-client-sideFor setup, select the following options:
- For What would you like to start with?, choose
Hello World example. - For Which template would you like to use?, choose
Worker + Assets. - For Which language do you want to use?, choose
TypeScript. - For Do you want to use git for version control?, choose
Yes. - For Do you want to deploy your application?, choose
No(we will be making some changes before deploying).
Navigate to the e-commerce-pipelines-client-side directory:
cd e-commerce-pipelines-client-sideUsing Static Assets, you can serve the frontend of your application from your Worker. The above step creates a new Worker project with a default public/index.html file. Update the public/index.html file with the following HTML code:
Select to view the HTML code
<!DOCTYPE html><html> <head> <meta charset="utf-8" /> <title>E-commerce Store</title> <script src="https://cdn.tailwindcss.com"></script> </head> <body> <nav class="bg-gray-800 text-white p-4"> <div class="container mx-auto flex justify-between items-center"> <a href="/" class="text-xl font-bold"> E-Commerce Demo </a> <div class="space-x-4 text-gray-800"> <a href="#"> <button class="border border-input bg-white h-10 px-4 py-2 rounded-md">Cart</button> </a> <a href="#"> <button class="border border-input bg-white h-10 px-4 py-2 rounded-md">Login</button> </a> <a href="#"> <button class="border border-input bg-white h-10 px-4 py-2 rounded-md">Signup</button> </a> </div> </div> </nav> <div class="container mx-auto px-4 py-8"> <h1 class="text-3xl font-bold mb-6">Our Products</h1> <div class="grid grid-cols-1 md:grid-cols-2 lg:grid-cols-3 gap-6" id="products"> <!-- This section repeats for each product -->
<!-- End of product section --> </div> </div>
<script> // demo products const products = [ { id: 1, name: 'Smartphone X', desc: 'Latest model with advanced features', cost: 799, }, { id: 2, name: 'Laptop Pro', desc: 'High-performance laptop for professionals', cost: 1299, }, { id: 3, name: 'Wireless Earbuds', desc: 'True wireless earbuds with noise cancellation', cost: 149, }, { id: 4, name: 'Smart Watch', desc: 'Fitness tracker and smartwatch combo', cost: 199, }, { id: 5, name: '4K TV', desc: 'Ultra HD smart TV with HDR', cost: 599, }, { id: 6, name: 'Gaming Console', desc: 'Next-gen gaming system', cost: 499, }, ];
// function to render products function renderProducts() { console.log('Rendering products...'); const productContainer = document.getElementById('products'); productContainer.innerHTML = ''; // Clear existing content products.forEach((product) => { const productElement = document.createElement('div'); productElement.classList.add('rounded-lg', 'border', 'bg-card', 'text-card-foreground', 'shadow-sm'); productElement.innerHTML = ` <div class="flex flex-col space-y-1.5 p-6"> <h2 class="text-2xl font-semibold leading-none tracking-tight">${product.name}</h2> </div> <div class="p-6 pt-0"> <p>${product.desc}</p> <p class="font-bold mt-2">$${product.cost}</p> </div> <div class="flex items-center p-6 pt-0 flex justify-between"> <button class="border px-4 py-2 rounded-md" onclick="handleClick('product_view', ${product.id})" name="">View Details</button> <button class="border px-4 py-2 rounded-md" onclick="handleClick('add_to_cart', ${product.id})">Add to Cart</button> </div> `; productContainer.appendChild(productElement); }); } renderProducts();
// function to handle click events async function handleClick(action, id) { console.log(`Clicked ${action} for product with id ${id}`); } </script> </body>
</html>The above code does the following:
- Uses Tailwind CSS to style the page.
- Renders a list of products.
- Adds a button to view the details of a product.
- Adds a button to add a product to the cart.
- Contains a
handleClickfunction to handle the click events. This function logs the action and the product ID. In the next steps, you will create a pipeline and add the logic to send the click events to this pipeline.
You need to create a new pipeline and connect it to your R2 bucket.
Create a new pipeline clickstream-pipeline-client using the Wrangler CLI:
npx wrangler pipelines create clickstream-pipeline-client --r2-bucket <BUCKET_NAME> --compression noneReplace <BUCKET_NAME> with the name of your R2 bucket.
When you run the command, you will be prompted to authorize Cloudflare Workers Pipelines to create R2 API tokens on your behalf. These tokens are required by your Pipeline. Your Pipeline uses these tokens when loading data into your bucket. You can approve the request through the browser link which will open automatically.
π Authorizing R2 bucket "<BUCKET_NAME>"Opening a link in your default browser: https://oauth.pipelines.cloudflare.com/oauth/login?accountId=<ACCOUNT_ID>&bucketName=<BUCKET_NAME>&pipelineName=clickstream-pipeline-clientπ Checking access to R2 bucket "<BUCKET_NAME>"π Creating Pipeline named "clickstream-pipeline-client"β
Successfully created Pipeline "clickstream-pipeline-client" with id <PIPELINE_ID>π You can now send data to your Pipeline!
To start interacting with this Pipeline from a Worker, open your Workerβs config file and add the following binding configuration:
{ "pipelines": [ { "pipeline": "clickstream-pipeline-client", "binding": "PIPELINE" } ]}
Send data to your Pipeline's HTTP endpoint:
curl "https://<PIPELINE_ID>.pipelines.cloudflare.com" -d '[{"foo": "bar"}]'Make a note of the URL of the pipeline. You will use this URL to send the clickstream data from the client-side.
You need to send clickstream data like the timestamp, user_id, session_id, and device_info to your pipeline. You can generate this data on the client side. Add the following function in the <script> tag in your public/index.html. This function gets the device information:
function extractDeviceInfo(userAgent) { let browser = "Unknown"; let os = "Unknown"; let device = "Unknown";
// Extract browser if (userAgent.includes("Firefox")) { browser = "Firefox"; } else if (userAgent.includes("Chrome")) { browser = "Chrome"; } else if (userAgent.includes("Safari")) { browser = "Safari"; } else if (userAgent.includes("Opera") || userAgent.includes("OPR")) { browser = "Opera"; } else if (userAgent.includes("Edge")) { browser = "Edge"; } else if (userAgent.includes("MSIE") || userAgent.includes("Trident/")) { browser = "Internet Explorer"; }
// Extract OS if (userAgent.includes("Win")) { os = "Windows"; } else if (userAgent.includes("Mac")) { os = "MacOS"; } else if (userAgent.includes("Linux")) { os = "Linux"; } else if (userAgent.includes("Android")) { os = "Android"; } else if (userAgent.includes("iOS")) { os = "iOS"; }
// Extract device const mobileKeywords = [ "Android", "webOS", "iPhone", "iPad", "iPod", "BlackBerry", "Windows Phone", ]; device = mobileKeywords.some((keyword) => userAgent.includes(keyword)) ? "Mobile" : "Desktop";
return { browser, os, device };}You will send the clickstream data to the pipline from the client-side. To do that, update the handleClick function to make a POST request to the pipeline URL with the data. Replace <PIPELINE_URL> with the URL of your pipeline.
async function handleClick(action, id) { console.log(`Clicked ${action} for product with id ${id}`);
const userAgent = window.navigator.userAgent; const timestamp = new Date().toISOString(); const { browser, os, device } = extractDeviceInfo(userAgent);
const data = { timestamp, session_id: "1234567890abcdef", // For production use a unique session ID user_id: "user" + Math.floor(Math.random() * 1000), // For production use a unique user ID event_data: { event_id: Math.floor(Math.random() * 1000), event_type: action, page_url: window.location.href, timestamp, product_id: id, }, device_info: { browser, os, device, userAgent, }, referrer: document.referrer, }; try { const response = await fetch("<PIPELINE_URL>", { method: "POST", headers: { "Content-Type": "application/json", }, body: JSON.stringify([data]), }); if (!response.ok) { throw new Error("Failed to send data to pipeline"); } } catch (error) { console.error("Error sending data to pipeline:", error); }}The handleClick function does the following:
- Gets the device information using the
extractDeviceInfofunction. - Makes a
POSTrequest to the pipeline with the data. - Logs any errors that occur.
If you start the development server and open the application in the browser, you can see the handleClick function gets executed when you click on the View Details or Add to Cart button.
npm run devHowever, no data gets sent to the pipeline. Inspect the browser console to view the error message. The error message you see is for CORS β. In the next step, you will update the CORS settings to allow the client-side JavaScript to send data to the pipeline.
By default, the Pipelines endpoint does not allow cross-origin requests. You need to update the CORS settings to allow the client-side JavaScript to send data to the pipeline. To update the CORS settings, execute the following command:
npx wrangler pipelines update clickstream-pipeline-client --cors-origins http://localhost:8787Now when you run the development server and open the application in the browser, you will see the clickstream data being sent to the pipeline when you click on the View Details or Add to Cart button. You can also see the data in the R2 bucket.
You can learn more about the CORS settings in the Specifying CORS settings documentation.
To deploy the application, run the following command:
npm run deployThis will deploy the application to the Cloudflare Workers platform.
π Building list of assets...π Starting asset upload...π Found 1 new or modified static asset to upload. Proceeding with upload...+ /index.htmlUploaded 1 of 1 assetsβ¨ Success! Uploaded 1 file (2.37 sec)
Total Upload: 25.73 KiB / gzip: 6.17 KiBWorker Startup Time: 15 msUploaded e-commerce-pipelines-client-side (11.79 sec)Deployed e-commerce-pipelines-client-side triggers (7.60 sec) https://<URL>.workers.devCurrent Version ID: <VERSION_ID>Update the CORS settings to add the deployed URL. Replace <URL> with the URL provided in the output of the deploy command.
npx wrangler pipelines update clickstream-pipeline-client --cors-origins https://<URL>.workers.devYou can access the application at the deployed URL. When you click on the View Details or Add to Cart button, the clickstream data will be sent to your pipeline.
You can view the data in the R2 bucket. If you are not signed in to the Cloudflare dashboard, sign in and navigate to the R2 overview page.
Open the bucket you configured for your pipeline in Step 3. You can see the clickstream data in the Objects column.
You have successfully created a Pipeline and used it to send clickstream data from the client. Through this tutorial, you've gained hands-on experience in:
- Creating a Workers project with a static frontend
- Generating and capturing clickstream data
- Setting up a Cloudflare Pipelines to ingest data into R2
- Deploying the application to Workers
For your next steps, consider connecting your R2 bucket to MotherDuck to analyse the data. You can follow the instructions in the Analyzing Clickstream Data with MotherDuck and Cloudflare R2 tutorial to connect your R2 bucket to MotherDuck and analyse data.
You can find the source code of the application in the GitHub repository β.