This provides a seamless integration between Entity Framework Core and DuckDB, allowing you to use DuckDB as your database while maintaining the familiar EF Core API.
- Simple API: Just use
UseDuckDb()similar toUseSqlite()orUseSqlServer() - Automatic SQL Translation: Converts EF Core generated SQL to DuckDB-compatible SQL
- Full EF Core Support: Works with LINQ queries, navigation properties, and all EF Core features
- Performance: Leverages DuckDB's columnar storage for fast analytical queries on Parquet files with significant performance improvements over SQLite
- Developer-Friendly API: The extension replaces fragile, manual setups with a clean, fluent API. This reduces code complexity, improves maintainability, and accelerates development.
The performance improvements are significant when using DuckDB over SQLite for analytical queries:
| Database | OData Query Performance |
|---|---|
| SQLite | 7.61s - 8.83s |
| DuckDB | 800ms - 1s |
Performance Gain: ~8-10x faster 🚀
DuckDB's columnar storage and vectorized execution engine provide substantial performance benefits for analytical workloads, making it an excellent choice for data analysis and reporting scenarios.
Ensure you have the necessary NuGet packages:
<PackageReference Include="Microsoft.EntityFrameworkCore" Version="8.0.6" />
<PackageReference Include="Microsoft.EntityFrameworkCore.Sqlite" Version="8.0.6" />
<PackageReference Include="DuckDB.NET.Data.Full" Version="1.2.1" />using EnergyExemplar.EntityFrameworkCore.DuckDb;
using Microsoft.EntityFrameworkCore;
var options = new DbContextOptionsBuilder<MyDbContext>()
.UseDuckDb(opts =>
{
opts.ConnectionString = "DataSource=C:/data/mydb.duckdb";
opts.MemoryLimitGB = 4; // Optional: set DuckDB memory limit to 4GB
opts.FileSearchPath = "C:/data/"; // Optional: set file_search_path for relative views
})
.Options;
using var context = new MyDbContext(options);
var data = await context.MyEntities.Where(e => e.Name.StartsWith("Gen")).ToListAsync();// In your Program.cs or Startup.cs
services.AddDbContext<MyDbContext>(options =>
options.UseDuckDb(opts =>
{
opts.ConnectionString = builder.Configuration["DuckDb:ConnectionString"];
opts.MemoryLimitGB = 8;
opts.FileSearchPath = builder.Configuration["DuckDb:FileSearchPath"];
})
);var duckDbOptions = new DuckDbConnectionOptions
{
ConnectionString = "DataSource=C:/data/mydb.duckdb",
MemoryLimitGB = 16,
FileSearchPath = "C:/data/"
};
var options = new DbContextOptionsBuilder<MyDbContext>()
.UseDuckDb(duckDbOptions)
.Options;var options = new DbContextOptionsBuilder<MyDbContext>()
.UseDuckDbOnParquet("C:/data/mydata.parquet")
.Options;Tip: Change tracking is disabled by default for maximum performance. To enable it, pass
noTracking: falsetoUseDuckDbOnParquet()or setNoTracking = falseonDuckDbConnectionOptions.
Manual DbConnection management, custom interceptors, and scattered configuration.
var options = new DbContextOptionsBuilder<MyDbContext>()
.UseDuckDb(opts =>
{
opts.ConnectionString = "DataSource=C:/data/mydb.duckdb";
})
.Options;- ConnectionString (required): Path to the DuckDB database file (e.g.,
DataSource=mydb.duckdb) - FileSearchPath (optional): Base directory for resolving relative paths in DuckDB views
- MemoryLimitGB (optional): Memory limit for DuckDB operations (in GB)
- NoTracking (optional): If true (the default), all queries will be executed with
AsNoTracking(). Set tofalseto enable change tracking.
using Microsoft.EntityFrameworkCore;
using Microsoft.Extensions.DependencyInjection;
using Microsoft.Extensions.Hosting;
using EnergyExemplar.EntityFrameworkCore.DuckDb;
var host = Host.CreateDefaultBuilder(args)
.ConfigureServices((context, services) =>
{
services.AddDbContext<MyDbContext>(options =>
options.UseDuckDb(opts =>
{
opts.ConnectionString = context.Configuration["DuckDb:ConnectionString"];
opts.MemoryLimitGB = 8;
opts.FileSearchPath = context.Configuration["DuckDb:FileSearchPath"];
})
);
services.AddScoped<IDataService, DataService>();
})
.Build();
await host.RunAsync();
public class DataService : IDataService
{
private readonly MyDbContext _context;
public DataService(MyDbContext context) => _context = context;
public async Task<List<MyEntity>> GetDataAsync()
=> await _context.MyEntities.Where(e => e.IsActive).ToListAsync();
}services.AddDbContext<InputContext>(options =>
options.UseDuckDb(opts =>
{
opts.ConnectionString = "DataSource=input.duckdb";
})
);
services.AddDbContext<OutputContext>(options =>
options.UseDuckDb(opts =>
{
opts.ConnectionString = "DataSource=output.duckdb";
})
);services.AddDbContext<MyParquetContext>(options =>
options.UseDuckDbOnParquet("C:/data/mydata.parquet")
);- Memory Issues: Set
MemoryLimitGBin the options lambda or object. - Path Issues: Ensure
FileSearchPathis set correctly for relative paths in views. - Write Operations: The integration is read-only; any write operation will throw
NotSupportedException. - Performance: DuckDB excels at analytical queries but may be slower for OLTP workloads.
- Cleaner Code: No need to manually configure interceptors and options
- Type Safety: Configuration options are strongly typed
- Consistency: Ensures all required components are properly configured
- Maintainability: Changes to DuckDB integration are centralized
- Testability: Easy to mock or substitute in tests
- The implementation uses SQLite as the base provider but intercepts and translates all SQL to DuckDB
- Query tracking is disabled by default for better performance (can be overridden via
NoTracking = false) - All DuckDB dependencies are automatically registered when using the extensions
This project is licensed under the MIT License - see the LICENSE file for details.
This project depends on the following third-party libraries:
- Microsoft.EntityFrameworkCore - Licensed under the MIT License. See Entity Framework Core repository for details.
- Microsoft.EntityFrameworkCore.Sqlite - Licensed under the MIT License. See Entity Framework Core repository for details.
- DuckDB.NET.Data.Full - Licensed under the MIT License. See DuckDB.NET repository for details.