HTTP Downloads
SheetJS is a JavaScript library for reading and writing data from spreadsheets.
A number of JavaScript APIs, including XMLHttpRequest
and fetch
, allow
scripts to download spreadsheets for further processing.
This demo uses various APIs and wrapper libraries to download workbooks and pass raw binary data to SheetJS libraries.
-
"Browser Demos" run entirely within the web browser. A test workbook will be downloaded and parsed in the web browser.
-
"NodeJS Demos" run in NodeJS and other server-side platforms.
This demo focuses on downloading files. Other demos cover other HTTP use cases:
- "HTTP Uploads" covers uploading files
- "HTTP Server Processing" covers HTTP servers
Third-party cloud platforms such as AWS may corrupt raw binary downloads by encoding requests and responses in UTF-8 strings.
For AWS, in the "Binary Media Types" section of the API Gateway console, the
"application/vnd.ms-excel"
type should be added to ensure that AWS Lambda
functions functions can send files to clients.
Binary Data​
Most interesting spreadsheet files are binary data that contain byte sequences that represent invalid UTF-8 characters.
APIs generally provide options to control how downloaded data is interpreted.
The arraybuffer
response type usually forces the data to be presented as an
ArrayBuffer
object which can be parsed with the SheetJS read
method1.
The following example shows the data flow using fetch
to download files:
flowchart LR
server[(Remote\nFile)]
response(Response\nobject)
subgraph SheetJS operations
ab(XLSX Data\nArrayBuffer)
wb(((SheetJS\nWorkbook)))
end
server --> |`fetch`\nGET request| response
response --> |`arrayBuffer`\n\n| ab
ab --> |`read`\n\n| wb
/* download data into an ArrayBuffer object */
const res = await fetch("https://docs.sheetjs.com/pres.numbers");
const ab = await res.arrayBuffer(); // recover data as ArrayBuffer
/* parse file */
const wb = XLSX.read(ab);
Browser Demos​
When the page is accessed, https://docs.sheetjs.com/pres.numbers will be fetched
and parsed. The old table will be replaced with a table whose contents match the
first worksheet. The SheetJS sheet_to_html
method2 creates the HTML table.
Each browser demo was tested in the following environments:
Browser | Date |
---|---|
Chrome 126 | 2024-06-19 |
Safari 17.3 | 2024-06-19 |
XMLHttpRequest​
For downloading data, the arraybuffer
response type generates an ArrayBuffer
that can be viewed as an Uint8Array
and fed to the SheetJS read
method. For
legacy browsers, the option type: "array"
should be specified:
/* set up an async GET request */
var req = new XMLHttpRequest();
req.open("GET", url, true);
req.responseType = "arraybuffer";
req.onload = function(e) {
/* parse the data when it is received */
var data = new Uint8Array(req.response);
var workbook = XLSX.read(data, {type:"array"});
/* DO SOMETHING WITH workbook HERE */
};
req.send();
Live Download demo (click to show)
fetch​
For downloading data, Response#arrayBuffer
resolves to an ArrayBuffer
that
can be converted to Uint8Array
and passed to the SheetJS read
method:
fetch(url).then(function(res) {
/* get the data as a Blob */
if(!res.ok) throw new Error("fetch failed");
return res.arrayBuffer();
}).then(function(ab) {
/* parse the data when it is received */
var data = new Uint8Array(ab);
var workbook = XLSX.read(data, {type:"array"});
/* DO SOMETHING WITH workbook HERE */
});
Live Download demo (click to show)
jQuery​
jQuery is a JavaScript library that includes helpers for
performing "Ajax" network requests. jQuery.ajax
($.ajax
) does not support
binary data out of the box3. A custom ajaxTransport
can add support.
SheetJS users have reported success with jquery.binarytransport.js
4 in IE10.
After including the main jquery.js
and jquery.binarytransport.js
scripts,
$.ajax
will support dataType: "binary"
and processData: false
.
In a GET request, the default behavior is to return a Blob
object. Passing
responseType: "arraybuffer"
returns a proper ArrayBuffer
object in IE10:
$.ajax({
type: "GET", url: "https://docs.sheetjs.com/pres.numbers",
/* suppress jQuery post-processing */
processData: false,
/* use the binary transport */
dataType: "binary",
/* pass an ArrayBuffer in the callback */
responseType: "arraybuffer",
success: function (ab) {
/* at this point, ab is an ArrayBuffer */
var wb = XLSX.read(ab);
/* do something with workbook here */
var ws = wb.Sheets[wb.SheetNames[0]];
var html = XLSX.utils.sheet_to_html(ws);
$("#out").html(html);
}
});
Wrapper Libraries​
Before fetch
shipped with browsers, there were various wrapper libraries to
simplify XMLHttpRequest
. Due to limitations with fetch
, these libraries are
still relevant.
axios​
axios
presents a Promise based interface. Setting
responseType
to arraybuffer
ensures the return type is an ArrayBuffer. The
data
property of the result can be passed to the SheetJS read
method:
async function workbook_dl_axios(url) {
const res = await axios(url, {responseType:'arraybuffer'});
const workbook = XLSX.read(res.data);
return workbook;
}
Live Download demo (click to show)
superagent​
superagent
is a network request library
with a "Fluent Interface". Calling the responseType
method with
"arraybuffer"
will ensure the final response object is an ArrayBuffer
:
/* set up an async GET request with superagent */
superagent
.get(url)
.responseType('arraybuffer')
.end(function(err, res) {
/* parse the data when it is received */
var data = new Uint8Array(res.body);
var workbook = XLSX.read(data, {type:"array"});
/* DO SOMETHING WITH workbook HERE */
});
Live Download demo (click to show)
NodeJS Demos​
These examples show how to download data in NodeJS.
HTTPS GET​
The https
module provides a low-level get
method for HTTPS GET requests:
var https = require("https"), XLSX = require("xlsx");
https.get('https://docs.sheetjs.com/pres.xlsx', function(res) {
var bufs = [];
res.on('data', function(chunk) { bufs.push(chunk); });
res.on('end', function() {
var buf = Buffer.concat(bufs);
var wb = XLSX.read(buf);
/* print the first worksheet to console */
var ws = wb.Sheets[wb.SheetNames[0]];
console.log(XLSX.utils.sheet_to_csv(ws));
});
});
This demo was tested in the following environments:
NodeJS | Date | Workarounds |
---|---|---|
0.10.48 | 2024-06-21 | NODE_TLS_REJECT_UNAUTHORIZED |
0.12.18 | 2024-06-21 | NODE_TLS_REJECT_UNAUTHORIZED |
4.9.1 | 2024-06-21 | NODE_TLS_REJECT_UNAUTHORIZED |
6.17.1 | 2024-06-21 | NODE_TLS_REJECT_UNAUTHORIZED |
8.17.0 | 2024-06-21 | NODE_TLS_REJECT_UNAUTHORIZED |
10.24.1 | 2024-06-21 | |
12.22.12 | 2024-06-21 | |
14.21.3 | 2024-06-21 | |
16.20.2 | 2024-06-21 | |
18.20.3 | 2024-06-21 | |
20.15.0 | 2024-06-21 | |
22.3.0 | 2024-06-21 |
The NODE_TLS_REJECT_UNAUTHORIZED
workaround sets the value to '0'
:
process.env.NODE_TLS_REJECT_UNAUTHORIZED = '0';
Complete Example (click to show)
fetch​
Experimental support for fetch
was introduced in NodeJS 16.15.0
. It will be
considered stable in NodeJS LTS version 22
.
The fetch
implementation has the same return types as the browser version:
async function parse_from_url(url) {
const res = await fetch(url);
if(!res.ok) throw new Error("fetch failed");
const ab = await res.arrayBuffer();
const workbook = XLSX.read(ab);
return workbook;
}
This demo was tested in the following environments:
NodeJS | Date |
---|---|
18.20.3 | 2024-06-21 |
20.15.0 | 2024-06-21 |
22.3.0 | 2024-06-21 |
Complete Example (click to show)
Wrapper Libraries​
The latest releases of NodeJS support fetch
natively. Before fetch
support
was added to the platform, third party modules wrapped the native APIs.
request​
request
has been deprecated and should only be used in legacy deployments.
Setting the option encoding: null
passes raw buffers:
var XLSX = require('xlsx'), request = require('request');
var url = 'https://docs.sheetjs.com/pres.xlsx';
/* call `request` with the option `encoding: null` */
request(url, {encoding: null}, function(err, res, data) {
if(err || res.statusCode !== 200) return;
/* if the request was successful, parse the data */
var wb = XLSX.read(data);
/* print the first worksheet to console */
var ws = wb.Sheets[wb.SheetNames[0]];
console.log(XLSX.utils.sheet_to_csv(ws));
});
This demo was tested in the following environments:
NodeJS | Date | Workarounds |
---|---|---|
0.10.48 | 2024-06-21 | NODE_TLS_REJECT_UNAUTHORIZED |
0.12.18 | 2024-06-21 | NODE_TLS_REJECT_UNAUTHORIZED |
4.9.1 | 2024-06-21 | NODE_TLS_REJECT_UNAUTHORIZED |
6.17.1 | 2024-06-21 | NODE_TLS_REJECT_UNAUTHORIZED |
8.17.0 | 2024-06-21 | NODE_TLS_REJECT_UNAUTHORIZED |
10.24.1 | 2024-06-21 | |
12.22.12 | 2024-06-21 | |
14.21.3 | 2024-06-21 | |
16.20.2 | 2024-06-21 | |
18.20.3 | 2024-06-21 | |
20.15.0 | 2024-06-21 | |
22.3.0 | 2024-06-21 |
The NODE_TLS_REJECT_UNAUTHORIZED
workaround sets the value to '0'
:
process.env.NODE_TLS_REJECT_UNAUTHORIZED = '0';
Complete Example (click to show)
axios​
When the responseType
is "arraybuffer"
, axios
actually captures the data
in a NodeJS Buffer. The SheetJS read
method handles NodeJS Buffer objects:
const XLSX = require("xlsx"), axios = require("axios");
async function workbook_dl_axios(url) {
const res = await axios(url, {responseType:'arraybuffer'});
/* at this point, res.data is a Buffer */
const workbook = XLSX.read(res.data);
return workbook;
}
This demo was tested in the following environments:
NodeJS | Axios | Date |
---|---|---|
10.24.1 | 0.28.1 | 2024-06-21 |
12.22.12 | 1.7.2 | 2024-06-21 |
14.21.3 | 1.7.2 | 2024-06-21 |
16.20.2 | 1.7.2 | 2024-06-21 |
18.20.3 | 1.7.2 | 2024-06-21 |
20.15.0 | 1.7.2 | 2024-06-21 |
22.3.0 | 1.7.2 | 2024-06-21 |
Complete Example (click to show)
Other Platforms​
Other demos show network operations in special platforms:
- React Native "Fetching Remote Data"
- NativeScript "Fetching Remote Files"
- AngularJS "Remote Files"
- Dojo Toolkit "Parsing Remote Files"
Footnotes​
-
See
dataType
injQuery.ajax
in the official jQuery documentation. ↩ -
See the official
jquery.binarytransport.js
repo for more details. ↩