Lina Xu Posted April 3, 2019 Posted April 3, 2019 Hi I am looking for data profiling tool on AWS marketplace and found some information on Clarity. My questions are Q1. can it connect different data sources such as Amazon S3, files (flat, excel, csv, HDFS Parquet, xml, json) , db2, oracle, teradata, snowflake, PostgreSQL Q2. What the report looks like in a excel file or read online (on screen) Q3. Run data profiling on sample data only or full data If full data, is there limitation on the storage size Q4. How much technical skills are needed from end users (SQL, ETL, ...) Thanks, Lina Xu
Bruno Guimarães Posted May 19, 2020 Posted May 19, 2020 Hello!!! A1: Yes,TIBCO Clarity supports uploading raw data from disparate sources in a wide variety of data formats: File formats:CSV, TSV, *SV, TXT, XLS, XLSX, JSON, XML, and the compressed formats (.zip, .gz, .bz2, .7z) Cloud storage: Box, Dropbox, Google Drive, Amazon S3 Database: Oracle, Microsoft SQL Server, MySQL, PostgreSQL, Amazon Redshift and DB2(Enterprise edition only) Data management software: TIBCO MDM, TIBCO ActiveSpaces BI tools: TIBCO Spotfire (Synchronizing) Marketing tools: Salesforce, Marketo OData TIBCO Data Virtualization(Enterprise edition only) Web URL Clipboard Still more to come, like MongoDB, DynamoDB, Microsoft Dynamics CRM, OData and so on A2: It looks like an Excel view, but also to read online reports A3: You can select to run against whole dataset A4: Clarity was designed to people that don't have SQL Skills, is basically point and click, but also, if you want it's possible to use SQL query.
Recommended Posts
Create an account or sign in to comment
You need to be a member in order to leave a comment
Create an account
Sign up for a new account in our community. It's easy!
Register a new accountSign in
Already have an account? Sign in here.
Sign In Now