Sharepoint OneDrive

info

High-Level Information: The Sharepoint OneDrive integration allows you to extract files from your Sharepoint folder and load them into your destination.

Key Details:

You can sync data from 1 Sharepoint site per connection.
Files are merged into a single data stream during the extraction process.
Supported file types include delimited text files such as CSV, TSV, and similar formats.
All data types within the files are automatically converted into string.

Source Setup Guide

in the Edit Source form to the left and follow the authentication flow (OAuth) in Sharepoint's website to grant Extract the required permissions.
Confirm you can see your email and profile picture, and that the source is Connected.
Paste the URL of the OneDrive folder from which you want us to extract the files.
Specify the file pattern that will be used to identify the files for processing.
Indicate the table name you want to use in your destination.
Click "Save"

Connection Setup Guide

Once you conneted Google Drive to a destination, you will also need to configure:

Connection Pull Schedule: Determines how frequently data is extracted from the source.
Backfill (Days): Specifies how far back we should search for updated files.
Schema Migration Policy: Controls how Extract will handle schema changes from the source.

Connector Information

info

File Name Partitioning: Data will be extracted based on file name partitioning, meaning only files modified within the backfill period will be updated.

The Cursor: Tracks the last modified file from the previous run and updates all files that were modified since then.

Additional Fields in the Destination:

internal_file_timestamp: Indicates when the connection run occurred.
internal_file_name: Identifies the file the record originated from.
internal_last_modified: Specifies when the file was last updated in the source.

Files Consistency Inconsistency in the structure of the files might surface problems loading the data

Supported File Formats:

Json
XML
CSV
Gsheet

We support the following json file extensions: .json, .jsonl, .ndjson

Important: The schema will be based on the first record, so ensure all objects have the same schema.

Json objects delimited by lines (any format is acceptable):

{"name": "John", "age": 30}
{"name": "Jane", "age": 25}

{
    "name": "John",
    "age": 30
}
{
    "name": "Jane",
    "age": 25
}

Json array (any format is acceptable):

[
    {"name": "John", "age": 30},
    {"name": "Jane", "age": 25}
]

[
    {
        "name": "John",
        "age": 30
    },
    {
        "name": "Jane",
        "age": 25
    }
]

[{"name": "John","age": 30},{"name": "Jane","age": 25}]

All of the above examples will be inserted at the destination table as:

name	age
John	30
Jane	25

Important: Regardless of the format, we always extract the objects at the top level. Therefore, the objects corresponding to the records should be the top-level objects in the file (either under an array or with no parent object). For example, the following file will not be processed correctly, as we will assume the schema has a single field named `records`:

{
    "records": [
        {"name": "John", "age": 30},
        {"name": "Jane", "age": 25}
    ]
}

This will be inserted at the destination table as:

records
`[{"name": "John", "age": 30}, {"name": "Jane", "age": 25}]`

XML files have a single root tag that contains all the records.
XML can have a complex schema with nested objects and tags, making it less suitable for tabular data (in contrast to JSONL or CSV files). Therefore, we treat each record under the root tag as a single column in the destination table (named 'records'), where the XML representation of the record is stored as a string.

Expected Format:
The records should be enclosed in a single root tag ('items' in this case).
We'll extract each record under the root tag.

<items>
    <item_record>
        <name>John</name>
        <age>30</age>
    </item_record>
    <item_record>
        <name>Jane</name>
        <age>25</age>
    </item_record>
</items>

The above records we'll be inserted at the destination table as:

records
<item_record><name>John</name><age>30</age></item_record>
<item_record><name>Jane</name><age>25</age></item_record>

The following example is invalid because the records are not enclosed in a single root tag:

<root>
    <records>
        <record>
            <name>John</name>
            <age>30</age>
        </record>
        <record>
            <name>Jane</name>
            <age>25</age>
        </record>
    </records>
</root>

Supported Compressions We support the following compression formats:

Gzip
BZip2
Zstd

We automatically detect the compression. Whether you want to have the compression format in the file name or not (e.g. file.csv.bz2 or simply file.csv) is up to you.

Source Setup Guide​

Connection Setup Guide​

Connector Information​

Source Setup Guide

Connection Setup Guide

Connector Information