This article describes how to ingest a file using Python3.
The main steps in the code involve using your Tenant API Key (for security measures this should be a variable, and not hardcoded!) along with the name of the file you want to upload (this may be a variable, in case you want to use the same script to periodically upload data incrementally) to generate a Signed URL.
You then use the generated Signed URL to upload the file to the Peak Data lake.
Prerequisites: Tenant API Key. |
Sample Code
##### # Sending Peak a file via SignedURL using python3 # Please note this is only a quick and simple code example # Other file types/formats etc. may vary # No responsibility is taken by the author for the running of the code #### import requests import os import sys # make sure your tenant api key is set as a variable # please do not hard code sensitive information in code! peak_tenant_api_key = os.environ["peak_tenant_api_key"] file_name = "sample_20181112.csv" ###### ## First call - Generating the SignedURL ###### # filename in the payload may be a variable # use the same script to send incremental data updates payload = '{"fileName": "' + file_name + '"}' headers = {'Content-Type': 'application/json', 'Authorization': peak_tenant_api_key} url = 'https://api.peak.ai/file/signedurl/client' # make a post request print('Generating URL...') first_response = requests.post(url, data=payload, headers=headers) if first_response.status_code == 200: print('Status code: ' + str(first_response.status_code) + ' - Success!') generated_url=first_response.text else: print('Status code: ' + str(first_response.status_code) + ' - Something went wrong!') sys.exit(1) ###### ## Second call - Uploading the file ###### # This time the url will be the signedURL we generated # so we use the result of the first call # remove the leading and training double quotes url = generated_url.strip('"') headers = {'Content-Type': 'application/json'} # read in the file for upload payload=open(file_name, 'rb') print('Uploading file...') # This time we use a put request to upload the file put_response = requests.put(url, data=payload, headers=headers) if put_response.status_code == 200: print('Status code: ' + str(put_response.status_code) + ' - Success!') else: print('Status code: ' + str(put_response.status_code) + ' - Something went wrong!') sys.exit(1)
Next Steps:
Once the data has been ingested into the Peak data lake, a feed can be created and scheduled.