← Back to app

Help — MyDataWork

MyDataWork is a workspace for data analysts. It catalogs the files and cloud assets behind your work, visualizes how they connect, tracks use cases and business outcomes, and helps you communicate the value your data work delivers.

Getting Started

New to MyDataWork? Read the Getting Started Guide (PDF) →

Asset Health Dashboard

Dashboard is the first tab on every workspace. It surfaces six panels of workspace health at a glance — designed to be read in seconds before you dive into the rest of the app.

The dashboard reflects what you see on the Assets tab — your own assets plus any shared to the team. Admins can see the full workspace catalog on the Setup → Workspace view if needed for oversight.

Demo / real toggle — when your workspace has demo data loaded, a small segmented toggle in the upper-right lets you switch the dashboard between the demo view and your real-data view. The toggle is dashboard-only: it doesn’t change what’s shown in Assets, Use Cases, Suggestions, or the Assessment. When you remove demo data, the toggle disappears and the dashboard renders from your real data only. If your workspace has neither real assets nor demo data, the dashboard shows a "No real assets connected yet" empty state with shortcuts to Setup or to load demo data.

Asset Estate Assessment

An AI-powered review of your workspace, runnable on demand from Setup → Asset Estate Assessment. Surfaces connections worth documenting, use case opportunities, and health observations — the kind of analysis a senior analyst would write up after auditing your estate. First run is free on every plan — one free assessment per email, forever. You can re-run the assessment any time afterward; subsequent runs use AI credits like other AI features.

Two Ways to Connect Your Assets

📁 Local File Folders

Install the Windows Connector once. It runs silently in the background, scanning the folders you choose and keeping your workspace current automatically.

Best for: Excel, SQL, Python, Power BI, Tableau, Alteryx, CSV files on your PC or synced cloud drives (OneDrive, Google Drive, Dropbox).

☁️ Cloud Sources

Connect directly to cloud platforms using an API token. No installation required — works on any device.

Supported: GitHub repositories, dbt Cloud projects, Databricks workspaces, Snowflake databases.

The Windows Connector

The MyDataWork Connector is a small background process that scans your chosen folders and sends file metadata to your workspace. It starts automatically when you log in to Windows.

Cloud Sources

Connecting Snowflake or Databricks catalogs much more than raw tables — it catalogs the work products of every tool that writes into your warehouse (dbt, Sigma, Hex, Dataiku, DataRobot, and more). See What gets cataloged through your cloud connections ↓ for the full picture.

Connect cloud platforms directly from Setup → Cloud Sources. No installation required. Each source type needs an API token or credentials.

SourceWhat's indexedCredentials needed
GitHubData files (.sql, .py, .ipynb, .xlsx, .csv) across repositoriesPersonal Access Token with repo read access
dbt CloudModels, sources, and exposures from dbt projectsdbt Cloud API token and account ID
DatabricksNotebooks and jobs from your workspaceWorkspace URL and personal access token
SnowflakeTables and views from your databasesAccount, username, password, database, schema
Cloud Sources is included on all plans. Connection behavior may vary by platform — contact us at contact@mydatawork.com if you encounter issues.

Cloud source assets appear in the Assets tab alongside your local file assets, with a badge showing their source (GitHub, dbt, Databricks, or Snowflake).

What gets cataloged through your cloud connections

When you connect Snowflake or Databricks to MyDataWork, you’re connecting more than raw tables. You’re cataloging the work products of every tool that writes into those warehouses — which, in the modern data stack, is most of them.

This is one of MyDataWork’s core design choices: integrate with the warehouse, not the dozen tools that use it. When your stack evolves, MyDataWork follows automatically. If it lands in your warehouse, it lands in MyDataWork.

Tools whose work products are cataloged via your warehouse connection

These tools persist their outputs as tables, views, materializations, or model artifacts in Snowflake or Databricks. When you connect your warehouse, MyDataWork catalogs those objects automatically — schema, columns, lineage, freshness — with no tool-specific integration needed.

Transformation

Analytics and BI builders

ML and data science

Data movement (the landing zone for your warehouse)

Tools that read from the warehouse rather than write to it

These tools query your warehouse but persist their own state — dashboards, semantic models, extracts — in their own cloud. MyDataWork catalogs the source-of-truth tables they depend on, so you can see what data feeds each dashboard and what would break if a column changes. The dashboards themselves live in those tools’ own admin consoles.

Tools whose data lives in their own backend

These tools store their data in their own cloud, not in your warehouse. To catalog them in MyDataWork, replicate their data into Snowflake or Databricks first — typically via Fivetran, Airbyte, or a similar service. Examples:

Direct connectors not dependent on a warehouse

MyDataWork also catalogs assets from these sources directly:

A practical example

A team uses Snowflake as their warehouse, dbt to transform data, Sigma to build analytics, DataRobot for ML predictions, and Tableau for executive dashboards. They also use Smartsheet for project tracking.

When MyDataWork connects to their Snowflake:

The strategic point

The modern data stack consolidates around the warehouse. By cataloging what lives in your warehouse, MyDataWork covers the work products of essentially every modern data tool — without you waiting for tool-specific connectors, and without losing coverage as your team adopts new tools.

Frequently asked questions about supported tools

QuestionAnswer
Does MyDataWork support [my specific tool]?MyDataWork integrates with your data warehouses (Snowflake, Databricks) and with local files via the desktop Connector. Most modern data tools (dbt, Sigma, Hex, Dataiku, DataRobot, Alteryx, Fivetran, Matillion, Coalesce, Prophecy, Datameer, Airbyte) write their work products as tables and views into your warehouse, and those outputs are cataloged automatically when you connect your warehouse — no tool-specific integration needed. BI tools (Tableau, Power BI, Looker, ThoughtSpot) query the warehouse rather than writing to it; MyDataWork catalogs the source-of-truth tables those dashboards depend on. Tools whose data lives in their own backend (Smartsheet, Airtable, Salesforce, HubSpot, etc.) require replication to your warehouse first via Fivetran or Airbyte. See What gets cataloged through your cloud connections for the full breakdown.
What if my team adopts a new tool you’ve never heard of?If the new tool persists its outputs to Snowflake or Databricks, MyDataWork catalogs those outputs automatically — no waiting for a tool-specific connector. This is the design choice: MyDataWork integrates with the warehouse, not the dozen tools that use it. If the new tool stores data only in its own backend, replicate it to your warehouse via Fivetran or Airbyte and the resulting tables become cataloged like any other warehouse object.

Managing Cloud Sources

To remove a cloud source, click Remove next to the source in Setup → Cloud Sources. A confirmation dialog gives you a choice:

For GitHub connectors configured with multiple repositories, “Also remove all assets” applies to all repos in that connector.

What Metadata Tells You

From metadata alone — without ever reading file contents — MyDataWork can tell you which Excel models feed which Power BI dashboards, which SQL scripts populate which reports, and how your entire data landscape is connected.

Asset Status Badges

Assets in the Assets tab may show status badges indicating their current state:

Hover over a Source unreachable badge to see the specific error type: path not found, permission denied, timeout, or OS error.

Supported File Types

TypeExtensionsWhat's extracted
Excel.xlsx, .xlsmSheet names
CSV.csvColumn names
SQL.sqlTable references (FROM / JOIN)
Python notebooks.py, .ipynbEmbedded SQL tables, imports
Alteryx.yxmdConnection hints
Tableau.twb, .twbxConnection hints
Power BI.pbix, .pbitFile name and path
ThoughtSpot.tmlObject kind (worksheet / model / answer / liveboard), tables and connections referenced
Dataiku.zip (DSS project export)Project name, recipe count, dataset count
Looker.lkml (.view.lkml, .model.lkml, .dashboard.lkml)Object kind, dimensions, measures, connection, sql_table_name references
GitHubAll data file typesFile path, size, last modified
dbt CloudModels, sourcesModel name, project, materialization
DatabricksNotebooks, jobsNotebook path, language, last modified
SnowflakeTables, viewsTable name, schema, row count, last altered

ThoughtSpot, Dataiku, and Looker integrations support file-based metadata extraction. Cloud-native instances of these tools require export or local file access.

AI-Powered Features

MyDataWork includes several AI-powered features that require AI to be enabled in your workspace. Go to Setup → AI Assist to enable AI. Once enabled, the following features become available:

All AI features share a daily credit quota. Solo users receive 5 AI credits per day. Team plan users receive 20 per day. Credits reset daily. The Assistant draws from the same quota.

Note: AI features will not function if AI is disabled in Setup, even if your plan includes AI access. The app will show a clear notice if AI needs to be enabled before a feature can be used.

Leverage Tab

The Leverage tab is your AI-powered analysis hub. It offers four modes, each requiring AI to be enabled:

Workspace Agent

The Suggestions tab includes a proactive observer that scans your workspace on demand and surfaces six categories of state worth your attention. Click Analyze and the agent runs every rule eligible for your plan, producing suggestions with deep-links to the relevant asset or use case.

The six rules group into three categories:

Each suggestion can be dismissed with a reason (not relevant, already handled, revisit later) and reacted to with 👍/👎 feedback. Auto-resolve: when a suggestion's condition no longer applies (you assign the stakeholder, link the asset, restore the removed asset), it's automatically marked resolved on the next run.

Your first Analyze run is complimentary. Subsequent runs draw from your daily AI credit pool, with a ceiling of up to 8 credits per run (shown above the Analyze button before you click). In practice runs usually consume far fewer credits because most rules don't find candidates.

Use case notes URL

Each use case has a Notes URL field on the Overview tab. Paste a Google Doc, Notion page, Confluence URL, or any HTTPS link where you keep notes for this use case. When set, an "Open notes ↑↗" button appears in the panel and a small ↑↗ indicator shows next to the use case in the list view.

Team Workspaces

Team Collaboration

Team workspaces are private by default for every role — including admins. Each member’s assets and use cases are visible only to that member; the team admin sees the same private-by-default view of their own work. The asset list defaults to a merged view showing your own assets plus any asset another team member has explicitly shared to the team.

The Assets tab has a toggle — All (default, your own + shared), Mine (only your own), and Shared (only team-shared) — so you can focus on what’s relevant. Admins additionally see a Workspace tab that exposes the full workspace catalog when needed for onboarding, audit, or oversight tasks. Use of the Workspace view is explicit, not the default. The Use Cases tab has an All / Mine toggle (plus the admin-only Workspace tab); use-case sharing is on the roadmap and a Shared option will appear there when it ships.

Data Portability

Export your entire workspace as a JSON file for backup or migration. Go to Setup → Data portability.

Privacy

The Connector sends only file metadata to your workspace — never file contents. Your files stay on your machine. Cloud source connectors read only the asset catalog information exposed by each platform's API — never the underlying data.

MyDataWork never reads, copies, stores, or transmits the contents of your files. Only names, paths, types, sizes, and modification dates are collected. All AI features — including the Asset Estate Assessment, Workspace Agent, Leverage analyses, and the in-app assistant — send only asset metadata (names, paths, tool types, topic tags, lineage edges) and use case text (titles, objectives, descriptions, value figures, stakeholder names) to the AI model. File contents are never sent.

Demo data is fully synthetic. Loading demo from the wizard, the Setup tab’s “Load demo data” button, or the Dashboard’s empty-state button creates fabricated workspace data (assets, use cases, lineage) for exploration. It carries no real customer information and can be removed any time from Setup → Connect your data with no impact on the rest of your workspace.

Value quality and robust statistics

MyDataWork includes lightweight statistical checks designed to help you present use case values you can stand behind in stakeholder conversations.

This approach is inspired by robust statistics, a field concerned with producing reliable estimates even when data contains outliers or errors.

Jira integration

MyDataWork can push use case summaries to Jira as new issues or comments on existing issues. This is useful when your organization uses Jira for project tracking and you need to keep it updated without maintaining two separate records.

Mobile access

MyDataWork works in mobile browsers with no app download needed. Once your desktop is set up, you can manage your portfolio from any device.

Plans & Subscriptions

QuestionAnswer
What plans are available?MyDataWork offers a free Explorer plan and four paid plans. The Explorer plan is a 90-day free trial with no credit card required — 1 seat, 30 user-added assets, 3 AI credits per day with a 60-credit total cap. The paid plans are Solo Monthly ($20/month, 1 seat, 300 assets, 5 credits/day), Solo Annual ($192/year, same caps as Solo Monthly), Team Starter ($900/year, 2-5 seats, 2,000 assets, 20 credits/day), and Team Growth ($1,440/year, 6-10 seats, same caps as Team Starter). Paid plans include a 14-day free trial. Visit mydatawork.com or go to Account → View plans & billing for current pricing and plan details.
How do I upgrade?Go to Account → View plans & billing, or visit /app/billing. Click Start free trial on any plan to begin.
How do I cancel?Go to Account → View plans & billing → Manage billing. Monthly Solo plans continue until end of period. Annual plans are non-refundable. Canceling is different from deleting — see below.
What happens to my data if I cancel?Your workspace data is retained for 30 days after cancellation. You can resubscribe during this period and resume where you left off. Contact contact@mydatawork.com to export during this period. This is different from account deletion, which removes all data immediately.
What’s the difference between canceling and deleting?Canceling your subscription stops future charges but keeps your data for 30 days — you can resubscribe and pick up where you left off. Deleting your account permanently removes your account, workspace, and all data immediately with no recovery option. If you just want to stop paying, cancel your subscription — don’t delete your account.
Annual plan refunds on deletion?Annual plans are non-refundable. If you delete your account mid-cycle, your subscription is canceled immediately and any remaining prepaid time is forfeited.
How do I add team members?As the workspace admin, go to Setup → Members. Enter the team member's name, email, and a starter password. Share credentials with them directly. They should change their password on first login.

Troubleshooting

ProblemSolution
Setup file does not open on WindowsRight-click the downloaded .exe and choose Run as administrator. If Windows shows a security warning, click More info then Run anyway.
No files appear after setupWait 30–60 seconds and refresh. The first scan can take up to a minute for large folders. Make sure you selected the correct top-level folder during setup.
'Another process is running' errorThe Connector is already running from a previous install. The new installer will stop it automatically — click Next to continue.
Open file does not workThe Connector must be running in the background. Check Task Manager. If not found, go to Setup and reconnect your folders.
I need to add or change a folderGo to Setup → click Connect folders. You can update your folder selection at any time without reinstalling the Connector.
"Connect folders" shows an error / the Connector is serving another workspaceA single Connector installation can serve multiple workspaces. Go to Setup → Connect Data, find the right-side Connector panel, and click Add this workspace to Connector. The Connector adds this workspace to its config and starts scanning here too — no reinstall needed. Once added, try Connect folders again.
I forgot my passwordOn the login page, click Forgot password? and enter your email. A reset link will be sent — it expires after 60 minutes.
I need to upgrade or manage my subscriptionGo to Account → View plans & billing to start a free trial, upgrade, or manage your subscription.
Cloud source not syncingCheck that your API token has the correct permissions (repo read for GitHub, API access for dbt/Databricks). Click Scan now in Setup → Cloud Sources to trigger a fresh sync.
AI features not workingGo to Setup → AI Assist and confirm AI is enabled (status shows "enabled" in teal). If disabled, click Enable AI. Also confirm your daily quota has not been reached — usage is shown in the same section.
Automation candidates or marketplace recommendations not generatingConfirm AI is enabled in Setup → AI Assist. For marketplace recommendations, also confirm you have selected at least one cloud provider in Setup → Cloud Providers.
I shared an automation opportunity but want to share it againPreviously generated opportunities are saved in the archive section of the Leverage tab under "Identify automation candidates." Expand any past opportunity and use the Share button to send it again.
External dependencies not showing in lineageEnable the "Show external dependencies" toggle above the lineage visualization in the Lineage tab or above the lineage preview in the Assets tab. The two toggles are independent — enabling one does not affect the other. Both default to off.
Ghost nodes not appearing in exported lineage diagramsIn the Portfolio tab export section, check the "Include external dependencies in lineage" checkbox before clicking Export now. This option defaults to off to keep standard exports clean.
Migration Assist returns no results or an errorConfirm AI is enabled in Setup → AI Assist and that your daily quota has not been reached. If analyzing a use case, ensure at least one asset is linked to that use case — Migration Assist needs asset tool type information to generate recommendations.
'Connector not found' on mobileThis is expected. Connectors only run on Windows desktop. You can still view all previously discovered assets, manage use cases, and export from mobile. Set up the Connector on a desktop first.
Small text or buttons on phoneUse landscape mode or a tablet for a better experience. MyDataWork is optimized for touch interfaces but detailed views work best in landscape.
How do I delete my account?Go to Account settings and click “Delete my account” at the bottom. The deletion dialog includes an optional feedback form (reason and comments), then requires a confirmation checkbox. Your account and all workspace data will be deleted immediately and permanently — this cannot be undone. Active subscriptions are canceled automatically. Annual plans are non-refundable and remaining prepaid time is forfeited. Team admins must transfer ownership or remove all members first. Super-admin accounts cannot be self-deleted. After deletion you are redirected to the login page with a confirmation message.
Send to Jira button is grayed outJira has not been configured yet. Go to Setup → Integrations, enter your Jira instance URL and API token, and click Connect. Once connected the button becomes active on all use case Overview tabs.
Jira push fails or shows an errorCheck that your Jira API token has permission to create issues and add comments in the target project. For Jira Cloud use email:api_token format. For Jira Data Center use a Personal Access Token. Verify your instance URL includes https:// and has no trailing slash.
I don't see the value quality checks or Portfolio distribution sectionThese features require a minimum number of use cases with non-zero values to activate — at least 4 for the per-field outlier hints and at least 3 for the Portfolio value distribution summary. With fewer use cases the checks run silently to avoid false positives. Add more use cases with estimated and realized values to see these features.
‘Rate limit exceeded’ messageSome actions have daily or hourly limits to ensure system reliability: lineage rebuilds (5 per day), Jira pushes (20 per hour), exports (10 per day). Wait for the limit to reset or contact support if you need higher limits.

Download the Connector

The Windows Connector is required only for local file folders. Cloud sources (GitHub, dbt, Databricks, Snowflake) do not require the Connector.

Download Connector for Windows

Still stuck? Email contact@mydatawork.com. We aim to respond within 2 business days.