MyDataWork is a workspace for data analysts. It catalogs the files and cloud assets behind your work, visualizes how they connect, tracks use cases and business outcomes, and helps you communicate the value your data work delivers.
Dashboard is the first tab on every workspace. It surfaces six panels of workspace health at a glance — designed to be read in seconds before you dive into the rest of the app.
The dashboard reflects what you see on the Assets tab — your own assets plus any shared to the team. Admins can see the full workspace catalog on the Setup → Workspace view if needed for oversight.
Demo / real toggle — when your workspace has demo data loaded, a small segmented toggle in the upper-right lets you switch the dashboard between the demo view and your real-data view. The toggle is dashboard-only: it doesn’t change what’s shown in Assets, Use Cases, Suggestions, or the Assessment. When you remove demo data, the toggle disappears and the dashboard renders from your real data only. If your workspace has neither real assets nor demo data, the dashboard shows a "No real assets connected yet" empty state with shortcuts to Setup or to load demo data.
An AI-powered review of your workspace, runnable on demand from Setup → Asset Estate Assessment. Surfaces connections worth documenting, use case opportunities, and health observations — the kind of analysis a senior analyst would write up after auditing your estate. First run is free on every plan — one free assessment per email, forever. You can re-run the assessment any time afterward; subsequent runs use AI credits like other AI features.
Install the Windows Connector once. It runs silently in the background, scanning the folders you choose and keeping your workspace current automatically.
Best for: Excel, SQL, Python, Power BI, Tableau, Alteryx, CSV files on your PC or synced cloud drives (OneDrive, Google Drive, Dropbox).
Connect directly to cloud platforms using an API token. No installation required — works on any device.
Supported: GitHub repositories, dbt Cloud projects, Databricks workspaces, Snowflake databases.
The MyDataWork Connector is a small background process that scans your chosen folders and sends file metadata to your workspace. It starts automatically when you log in to Windows.
Connect cloud platforms directly from Setup → Cloud Sources. No installation required. Each source type needs an API token or credentials.
| Source | What's indexed | Credentials needed |
|---|---|---|
| GitHub | Data files (.sql, .py, .ipynb, .xlsx, .csv) across repositories | Personal Access Token with repo read access |
| dbt Cloud | Models, sources, and exposures from dbt projects | dbt Cloud API token and account ID |
| Databricks | Notebooks and jobs from your workspace | Workspace URL and personal access token |
| Snowflake | Tables and views from your databases | Account, username, password, database, schema |
Cloud source assets appear in the Assets tab alongside your local file assets, with a badge showing their source (GitHub, dbt, Databricks, or Snowflake).
When you connect Snowflake or Databricks to MyDataWork, you’re connecting more than raw tables. You’re cataloging the work products of every tool that writes into those warehouses — which, in the modern data stack, is most of them.
This is one of MyDataWork’s core design choices: integrate with the warehouse, not the dozen tools that use it. When your stack evolves, MyDataWork follows automatically. If it lands in your warehouse, it lands in MyDataWork.
These tools persist their outputs as tables, views, materializations, or model artifacts in Snowflake or Databricks. When you connect your warehouse, MyDataWork catalogs those objects automatically — schema, columns, lineage, freshness — with no tool-specific integration needed.
Transformation
Analytics and BI builders
ML and data science
Data movement (the landing zone for your warehouse)
These tools query your warehouse but persist their own state — dashboards, semantic models, extracts — in their own cloud. MyDataWork catalogs the source-of-truth tables they depend on, so you can see what data feeds each dashboard and what would break if a column changes. The dashboards themselves live in those tools’ own admin consoles.
These tools store their data in their own cloud, not in your warehouse. To catalog them in MyDataWork, replicate their data into Snowflake or Databricks first — typically via Fivetran, Airbyte, or a similar service. Examples:
MyDataWork also catalogs assets from these sources directly:
A team uses Snowflake as their warehouse, dbt to transform data, Sigma to build analytics, DataRobot for ML predictions, and Tableau for executive dashboards. They also use Smartsheet for project tracking.
When MyDataWork connects to their Snowflake:
The modern data stack consolidates around the warehouse. By cataloging what lives in your warehouse, MyDataWork covers the work products of essentially every modern data tool — without you waiting for tool-specific connectors, and without losing coverage as your team adopts new tools.
| Question | Answer |
|---|---|
| Does MyDataWork support [my specific tool]? | MyDataWork integrates with your data warehouses (Snowflake, Databricks) and with local files via the desktop Connector. Most modern data tools (dbt, Sigma, Hex, Dataiku, DataRobot, Alteryx, Fivetran, Matillion, Coalesce, Prophecy, Datameer, Airbyte) write their work products as tables and views into your warehouse, and those outputs are cataloged automatically when you connect your warehouse — no tool-specific integration needed. BI tools (Tableau, Power BI, Looker, ThoughtSpot) query the warehouse rather than writing to it; MyDataWork catalogs the source-of-truth tables those dashboards depend on. Tools whose data lives in their own backend (Smartsheet, Airtable, Salesforce, HubSpot, etc.) require replication to your warehouse first via Fivetran or Airbyte. See What gets cataloged through your cloud connections for the full breakdown. |
| What if my team adopts a new tool you’ve never heard of? | If the new tool persists its outputs to Snowflake or Databricks, MyDataWork catalogs those outputs automatically — no waiting for a tool-specific connector. This is the design choice: MyDataWork integrates with the warehouse, not the dozen tools that use it. If the new tool stores data only in its own backend, replicate it to your warehouse via Fivetran or Airbyte and the resulting tables become cataloged like any other warehouse object. |
To remove a cloud source, click Remove next to the source in Setup → Cloud Sources. A confirmation dialog gives you a choice:
For GitHub connectors configured with multiple repositories, “Also remove all assets” applies to all repos in that connector.
From metadata alone — without ever reading file contents — MyDataWork can tell you which Excel models feed which Power BI dashboards, which SQL scripts populate which reports, and how your entire data landscape is connected.
Assets in the Assets tab may show status badges indicating their current state:
Hover over a Source unreachable badge to see the specific error type: path not found, permission denied, timeout, or OS error.
| Type | Extensions | What's extracted |
|---|---|---|
| Excel | .xlsx, .xlsm | Sheet names |
| CSV | .csv | Column names |
| SQL | .sql | Table references (FROM / JOIN) |
| Python notebooks | .py, .ipynb | Embedded SQL tables, imports |
| Alteryx | .yxmd | Connection hints |
| Tableau | .twb, .twbx | Connection hints |
| Power BI | .pbix, .pbit | File name and path |
| ThoughtSpot | .tml | Object kind (worksheet / model / answer / liveboard), tables and connections referenced |
| Dataiku | .zip (DSS project export) | Project name, recipe count, dataset count |
| Looker | .lkml (.view.lkml, .model.lkml, .dashboard.lkml) | Object kind, dimensions, measures, connection, sql_table_name references |
| GitHub | All data file types | File path, size, last modified |
| dbt Cloud | Models, sources | Model name, project, materialization |
| Databricks | Notebooks, jobs | Notebook path, language, last modified |
| Snowflake | Tables, views | Table name, schema, row count, last altered |
ThoughtSpot, Dataiku, and Looker integrations support file-based metadata extraction. Cloud-native instances of these tools require export or local file access.
MyDataWork includes several AI-powered features that require AI to be enabled in your workspace. Go to Setup → AI Assist to enable AI. Once enabled, the following features become available:
All AI features share a daily credit quota. Solo users receive 5 AI credits per day. Team plan users receive 20 per day. Credits reset daily. The Assistant draws from the same quota.
The Leverage tab is your AI-powered analysis hub. It offers four modes, each requiring AI to be enabled:
The Suggestions tab includes a proactive observer that scans your workspace on demand and surfaces six categories of state worth your attention. Click Analyze and the agent runs every rule eligible for your plan, producing suggestions with deep-links to the relevant asset or use case.
The six rules group into three categories:
Each suggestion can be dismissed with a reason (not relevant, already handled, revisit later) and reacted to with 👍/👎 feedback. Auto-resolve: when a suggestion's condition no longer applies (you assign the stakeholder, link the asset, restore the removed asset), it's automatically marked resolved on the next run.
Your first Analyze run is complimentary. Subsequent runs draw from your daily AI credit pool, with a ceiling of up to 8 credits per run (shown above the Analyze button before you click). In practice runs usually consume far fewer credits because most rules don't find candidates.
Each use case has a Notes URL field on the Overview tab. Paste a Google Doc, Notion page, Confluence URL, or any HTTPS link where you keep notes for this use case. When set, an "Open notes ↑↗" button appears in the panel and a small ↑↗ indicator shows next to the use case in the list view.
Team workspaces are private by default for every role — including admins. Each member’s assets and use cases are visible only to that member; the team admin sees the same private-by-default view of their own work. The asset list defaults to a merged view showing your own assets plus any asset another team member has explicitly shared to the team.
The Assets tab has a toggle — All (default, your own + shared), Mine (only your own), and Shared (only team-shared) — so you can focus on what’s relevant. Admins additionally see a Workspace tab that exposes the full workspace catalog when needed for onboarding, audit, or oversight tasks. Use of the Workspace view is explicit, not the default. The Use Cases tab has an All / Mine toggle (plus the admin-only Workspace tab); use-case sharing is on the roadmap and a Shared option will appear there when it ships.
Export your entire workspace as a JSON file for backup or migration. Go to Setup → Data portability.
The Connector sends only file metadata to your workspace — never file contents. Your files stay on your machine. Cloud source connectors read only the asset catalog information exposed by each platform's API — never the underlying data.
MyDataWork never reads, copies, stores, or transmits the contents of your files. Only names, paths, types, sizes, and modification dates are collected. All AI features — including the Asset Estate Assessment, Workspace Agent, Leverage analyses, and the in-app assistant — send only asset metadata (names, paths, tool types, topic tags, lineage edges) and use case text (titles, objectives, descriptions, value figures, stakeholder names) to the AI model. File contents are never sent.
Demo data is fully synthetic. Loading demo from the wizard, the Setup tab’s “Load demo data” button, or the Dashboard’s empty-state button creates fabricated workspace data (assets, use cases, lineage) for exploration. It carries no real customer information and can be removed any time from Setup → Connect your data with no impact on the rest of your workspace.
MyDataWork includes lightweight statistical checks designed to help you present use case values you can stand behind in stakeholder conversations.
This approach is inspired by robust statistics, a field concerned with producing reliable estimates even when data contains outliers or errors.
MyDataWork can push use case summaries to Jira as new issues or comments on existing issues. This is useful when your organization uses Jira for project tracking and you need to keep it updated without maintaining two separate records.
https://yourcompany.atlassian.net) and your API token. For Jira Cloud, use email:api_token format. For Jira Data Center, use a Personal Access Token.MyDataWork works in mobile browsers with no app download needed. Once your desktop is set up, you can manage your portfolio from any device.
| Question | Answer |
|---|---|
| What plans are available? | MyDataWork offers a free Explorer plan and four paid plans. The Explorer plan is a 90-day free trial with no credit card required — 1 seat, 30 user-added assets, 3 AI credits per day with a 60-credit total cap. The paid plans are Solo Monthly ($20/month, 1 seat, 300 assets, 5 credits/day), Solo Annual ($192/year, same caps as Solo Monthly), Team Starter ($900/year, 2-5 seats, 2,000 assets, 20 credits/day), and Team Growth ($1,440/year, 6-10 seats, same caps as Team Starter). Paid plans include a 14-day free trial. Visit mydatawork.com or go to Account → View plans & billing for current pricing and plan details. |
| How do I upgrade? | Go to Account → View plans & billing, or visit /app/billing. Click Start free trial on any plan to begin. |
| How do I cancel? | Go to Account → View plans & billing → Manage billing. Monthly Solo plans continue until end of period. Annual plans are non-refundable. Canceling is different from deleting — see below. |
| What happens to my data if I cancel? | Your workspace data is retained for 30 days after cancellation. You can resubscribe during this period and resume where you left off. Contact contact@mydatawork.com to export during this period. This is different from account deletion, which removes all data immediately. |
| What’s the difference between canceling and deleting? | Canceling your subscription stops future charges but keeps your data for 30 days — you can resubscribe and pick up where you left off. Deleting your account permanently removes your account, workspace, and all data immediately with no recovery option. If you just want to stop paying, cancel your subscription — don’t delete your account. |
| Annual plan refunds on deletion? | Annual plans are non-refundable. If you delete your account mid-cycle, your subscription is canceled immediately and any remaining prepaid time is forfeited. |
| How do I add team members? | As the workspace admin, go to Setup → Members. Enter the team member's name, email, and a starter password. Share credentials with them directly. They should change their password on first login. |
| Problem | Solution |
|---|---|
| Setup file does not open on Windows | Right-click the downloaded .exe and choose Run as administrator. If Windows shows a security warning, click More info then Run anyway. |
| No files appear after setup | Wait 30–60 seconds and refresh. The first scan can take up to a minute for large folders. Make sure you selected the correct top-level folder during setup. |
| 'Another process is running' error | The Connector is already running from a previous install. The new installer will stop it automatically — click Next to continue. |
| Open file does not work | The Connector must be running in the background. Check Task Manager. If not found, go to Setup and reconnect your folders. |
| I need to add or change a folder | Go to Setup → click Connect folders. You can update your folder selection at any time without reinstalling the Connector. |
| "Connect folders" shows an error / the Connector is serving another workspace | A single Connector installation can serve multiple workspaces. Go to Setup → Connect Data, find the right-side Connector panel, and click Add this workspace to Connector. The Connector adds this workspace to its config and starts scanning here too — no reinstall needed. Once added, try Connect folders again. |
| I forgot my password | On the login page, click Forgot password? and enter your email. A reset link will be sent — it expires after 60 minutes. |
| I need to upgrade or manage my subscription | Go to Account → View plans & billing to start a free trial, upgrade, or manage your subscription. |
| Cloud source not syncing | Check that your API token has the correct permissions (repo read for GitHub, API access for dbt/Databricks). Click Scan now in Setup → Cloud Sources to trigger a fresh sync. |
| AI features not working | Go to Setup → AI Assist and confirm AI is enabled (status shows "enabled" in teal). If disabled, click Enable AI. Also confirm your daily quota has not been reached — usage is shown in the same section. |
| Automation candidates or marketplace recommendations not generating | Confirm AI is enabled in Setup → AI Assist. For marketplace recommendations, also confirm you have selected at least one cloud provider in Setup → Cloud Providers. |
| I shared an automation opportunity but want to share it again | Previously generated opportunities are saved in the archive section of the Leverage tab under "Identify automation candidates." Expand any past opportunity and use the Share button to send it again. |
| External dependencies not showing in lineage | Enable the "Show external dependencies" toggle above the lineage visualization in the Lineage tab or above the lineage preview in the Assets tab. The two toggles are independent — enabling one does not affect the other. Both default to off. |
| Ghost nodes not appearing in exported lineage diagrams | In the Portfolio tab export section, check the "Include external dependencies in lineage" checkbox before clicking Export now. This option defaults to off to keep standard exports clean. |
| Migration Assist returns no results or an error | Confirm AI is enabled in Setup → AI Assist and that your daily quota has not been reached. If analyzing a use case, ensure at least one asset is linked to that use case — Migration Assist needs asset tool type information to generate recommendations. |
| 'Connector not found' on mobile | This is expected. Connectors only run on Windows desktop. You can still view all previously discovered assets, manage use cases, and export from mobile. Set up the Connector on a desktop first. |
| Small text or buttons on phone | Use landscape mode or a tablet for a better experience. MyDataWork is optimized for touch interfaces but detailed views work best in landscape. |
| How do I delete my account? | Go to Account settings and click “Delete my account” at the bottom. The deletion dialog includes an optional feedback form (reason and comments), then requires a confirmation checkbox. Your account and all workspace data will be deleted immediately and permanently — this cannot be undone. Active subscriptions are canceled automatically. Annual plans are non-refundable and remaining prepaid time is forfeited. Team admins must transfer ownership or remove all members first. Super-admin accounts cannot be self-deleted. After deletion you are redirected to the login page with a confirmation message. |
| Send to Jira button is grayed out | Jira has not been configured yet. Go to Setup → Integrations, enter your Jira instance URL and API token, and click Connect. Once connected the button becomes active on all use case Overview tabs. |
| Jira push fails or shows an error | Check that your Jira API token has permission to create issues and add comments in the target project. For Jira Cloud use email:api_token format. For Jira Data Center use a Personal Access Token. Verify your instance URL includes https:// and has no trailing slash. |
| I don't see the value quality checks or Portfolio distribution section | These features require a minimum number of use cases with non-zero values to activate — at least 4 for the per-field outlier hints and at least 3 for the Portfolio value distribution summary. With fewer use cases the checks run silently to avoid false positives. Add more use cases with estimated and realized values to see these features. |
| ‘Rate limit exceeded’ message | Some actions have daily or hourly limits to ensure system reliability: lineage rebuilds (5 per day), Jira pushes (20 per hour), exports (10 per day). Wait for the limit to reset or contact support if you need higher limits. |
The Windows Connector is required only for local file folders. Cloud sources (GitHub, dbt, Databricks, Snowflake) do not require the Connector.
Download Connector for WindowsStill stuck? Email contact@mydatawork.com. We aim to respond within 2 business days.