Releases: dolthub/go-mysql-server
Q4 2024 Release
This is the periodic rollup release of features, bug fixes, and perf improvements for Q4 2024.
Interfaces are not guaranteed stable until 1.0.
Merged PRs
go-mysql-server
- 2798: cache session charset
perf: dolthub/dolt#8691 - 2796: apply table projections through
Distinct
nodes
We weren't pruning table columns when there was a distinct clause over the projections, this resulted the deserialization of every column, even if they weren't going to make it to the result. This is bad for performance, especially if the unread columns are ofTEXT
,LONGTEXT
, 'BLOB,
LONGBLOB` type as those are stored out of band, and take longer to deserialize.
fixes: dolthub/dolt#8689 - 2795: allow using function as table function
- 2794: Bump go-icu-regex
Incorporates the fix from here: - 2793: Revert byte copying optimization
- 2792: Fix the warning checks in enginetest to be more strict
Previously if an error was expected but none was produced the test would pass. - 2791: Properly cast
ENUM
s toTEXT
forCASE
andCONVERT
statements
fixes: dolthub/dolt#8598 - 2788: Fewer wire allocs in SQL() implementations
Optimizes SQL() implementations that convert interface values into type-specific byte arrays. Combination of skippingruntime.convT
checks, that unnecessarily allocated variables to the heap, redundant byte array copying, and other conversion inefficiencies.
dolt perf here: dolthub/dolt#8651goos: darwin goarch: arm64 pkg: github.com/dolthub/go-mysql-server/sql/types cpu: Apple M3 Pro │ before.txt │ after.txt │ │ sec/op │ sec/op vs base │ NumI64SQL-12 62.04n ± 2% 51.13n ± 1% -17.59% (p=0.002 n=6) Varchar10SQL-12 66.85n ± 1% 31.38n ± 2% -53.06% (p=0.002 n=6) TimespanSQL-12 62.36n ± 0% 40.31n ± 1% -35.35% (p=0.002 n=6) TimestampSQL-12 1960.0n ± 1% 255.0n ± 2% -86.99% (p=0.002 n=6) DatetimeSQL-12 1968.5n ± 0% 269.5n ± 3% -86.31% (p=0.002 n=6) EnumSQL-12 111.85n ± 1% 37.49n ± 1% -66.48% (p=0.002 n=6) SetSQL-12 175.15n ± 0% 63.55n ± 1% -63.72% (p=0.002 n=6) BitSQL-12 41.84n ± 1% 41.74n ± 1% ~ (p=0.589 n=6) DecimalSQL-12 683.8n ± 3% 281.9n ± 1% -58.77% (p=0.002 n=6) NumF64SQL-12 105.15n ± 1% 91.72n ± 0% -12.77% (p=0.002 n=6) geomean 189.2n 80.50n -57.45% │ before.txt │ after.txt │ │ B/op │ B/op vs base │ NumI64SQL-12 24.00 ± 0% 16.00 ± 0% -33.33% (p=0.002 n=6) Varchar10SQL-12 40.000 ± 0% 8.000 ± 0% -80.00% (p=0.002 n=6) TimespanSQL-12 24.00 ± 0% 16.00 ± 0% -33.33% (p=0.002 n=6) TimestampSQL-12 1520.00 ± 0% 56.00 ± 0% -96.32% (p=0.002 n=6) DatetimeSQL-12 1520.00 ± 0% 56.00 ± 0% -96.32% (p=0.002 n=6) EnumSQL-12 112.000 ± 0% 8.000 ± 0% -92.86% (p=0.002 n=6) SetSQL-12 136.00 ± 0% 16.00 ± 0% -88.24% (p=0.002 n=6) BitSQL-12 16.00 ± 0% 16.00 ± 0% ~ (p=1.000 n=6) ¹ DecimalSQL-12 439.0 ± 0% 228.0 ± 0% -48.06% (p=0.002 n=6) NumF64SQL-12 38.00 ± 0% 31.00 ± 0% -18.42% (p=0.002 n=6) geomean 108.0 24.94 -76.92% ¹ all samples are equal │ before.txt │ after.txt │ │ allocs/op │ allocs/op vs base │ NumI64SQL-12 2.000 ± 0% 1.000 ± 0% -50.00% (p=0.002 n=6) Varchar10SQL-12 2.000 ± 0% 1.000 ± 0% -50.00% (p=0.002 n=6) TimespanSQL-12 2.000 ± 0% 1.000 ± 0% -50.00% (p=0.002 n=6) TimestampSQL-12 42.000 ± 0% 3.000 ± 0% -92.86% (p=0.002 n=6) DatetimeSQL-12 42.000 ± 0% 3.000 ± 0% -92.86% (p=0.002 n=6) EnumSQL-12 3.000 ± 0% 1.000 ± 0% -66.67% (p=0.002 n=6) SetSQL-12 5.000 ± 0% 2.000 ± 0% -60.00% (p=0.002 n=6) BitSQL-12 2.000 ± 0% 2.000 ± 0% ~ (p=1.000 n=6) ¹ DecimalSQL-12 22.00 ± 0% 10.00 ± 0% -54.55% (p=0.002 n=6) NumF64SQL-12 3.000 ± 0% 2.000 ± 0% -33.33% (p=0.002 n=6) geomean 5.554 1.931 -65.24% ¹ all samples are equal
- 2787: Enable support for
caching_sha2_password
auth
This change enables users configured with thecaching_sha2_password
auth plugin to authenticate to a running SQL server. The default authentication plugin is stillmysql_native_password
, but users can opt-in tocaching_sha2_password
by creating a user and explicitly specifying the auth plugin.
Note that thecaching_sha2_password
auth plugin requires running the SQL server with a certificate so that TLS connections can be established.
Depends on: dolthub/vitess#390
Related to: dolthub/dolt#8496 - 2784: implement
EXPLAIN
andEXPLAIN PLAN
Moving our current implementation ofEXPLAIN
toEXPLAIN PLAN
, and replaceEXPLAIN
with a dummy implementation of MySQL'sEXPLAIN
Looks like this now:related: dolthub/dolt#8592tmp2/main> explain select * from t; +----+-------------+-------+------------+------+---------------+------+---------+------+------+----------+-------+ | id | select_type | table | partitions | type | possible_keys | key | key_len | ref | rows | filtered | Extra | +----+-------------+-------+------------+------+---------------+------+---------+------+------+----------+-------+ | 1 | SELECT | NULL | NULL | NULL | NULL | NULL | NULL | NULL | NULL | NULL | | +----+-------------+-------+------------+------+---------------+------+---------+------+------+----------+-------+ 1 row in set (0.00 sec) tmp2/main> explain plan select * from t; +------------------+ | plan | +------------------+ | Table | | ├─ name: t | | └─ columns: [i] | +------------------+ 3 rows in set (0.00 sec)
- 2782: avoid fmt.Sprintf and string alloc for time.Sql
perf here dolthub/dolt#8640 - 2781: return ok result for
select into
statements
OurSELECT ... INTO ...
statements return empty result set, which produces strange output in thedolt sql
shell.
MySQL just returns ok results, so we should too.
discovered in: #2779 - 2780: Add support for creating users with the
caching_sha2_password
auth plugin
This change enables customers to create users configured to authenticate with thecaching_sha2_password
auth plugin. The generated authentication string uses the same logic as MySQL'scaching_sha2_password
auth plugin. Users created withcaching_sha2_password
can not yet authenticate with a GMS server – the next change in this series will enable that.
Example usage:Depends on: dolthub/vitess#387CREATE USER fred@localhost identified with caching_sha2_password by 'pa$$w0rd';
Related to: dolthub/dolt#8496 - 2779: fix
AS OF
clause panic for certain expressions
We attempt to parse evalAS OF
expressions in the builder (because we assume it is going to be a literal), but Subqueries cannot be evaluated until after they have gone through the analyzer.
partially addresses: dolthub/dolt#8635 - 2776: bump mysql version
Certain tools expect a higher version of MySQL.
Currently, the latest stable version of MySQL is 8.4.4, but8.0.23
is the minimum needed to satisfy mydumper.
Additionally, this alters theversion()
method to select directly from the@@version
system variable.
related: dolthub/dolt#8592 - 2775: prevent creating and dropping
mysql
andinformation_schema
databases
fixes: dolthub/dolt#8621 - 2774: insert ignore to enum column truncates data
- 2773: Fixed error in setup found by user
- 2769: support
NO_AUTO_CREATE_USER
option insql_mode
Older MySQL 5.7 Docs; https://dev.mysql.com/doc/refman/5.7/en/sql-mode.html#sqlmode_no_auto_create_user
We already do this by default, so this is just parsing the option and does nothing. - 2767: Add support for
show slave status
This syntax is deprecated and will be removed in future MySQL versions (SHOW REPLICA STATUS
should be preferred instead). However, some tools (e.g. Dolphie, MyDumper) still rely on this deprecated syntax, so we're adding support for it to keep compatibility with those tools.
Depends on: dolthub/vitess#381 - [2766](https://gith...
Q2 2024 Release Patch
This is a patch release on the Q2 release to remove the NewDefaultServer
method, which was was panicking at query time when used.
go-mysql-server's sql interfaces are not guaranteed to settle until 1.0.
Merged PRs
go-mysql-server
- 2437: lowercase when looking up self referential foreign key columns
fixes: dolthub/dolt#7700 - 2436: Making
@@server_id
default value match MySQL - 2434: Fixing the default value for the
binlog_checksum
system variable
Small change to make the default value of the globalbinlog_checksum
system variable match MySQL's default value (i.e. "CRC32"). - 2433: NULL to nil
TheSHOW FIELDS/COLUMNS FROM <table>
query would return the string"NULL"
forDefault
column rather thannil
.
This mattered for Knex, which relied on it beingNULL
and not"NULL"
.
fixes: dolthub/dolt#7692 - 2432: support
Threads_connected
andThreads_running
status variables
This PR adds support forThreads_connected
andThreads_running
status variables.
Additionally, the local enginetest are flaking consistently in dolt ci, so those have been removed;
we have handler tests forcom_delete
,com_insert
, andcom_update
anyway.
Related: dolthub/dolt#7646 - 2431: Setting
Innodb_buffer_pool_pages_total
to 1, to avoid an issue with Datadog's collector
Datadog's metric collector errors out with a divide by zero error if theInnodb_buffer_pool_pages_total
status variable is0
; changing it to1
avoids this and allows the agent to collect metrics from Dolt. - 2430: have status variables use go routines
This PR changes Status Variables to update through go routines, to avoid slowing down query execution due to the mutexes present. - 2429: server trace time includes parsing
- 2427: support
Com_delete
,Com_insert
,Com_update
status variables
related: dolthub/dolt#7646 - 2426: use
@@session.collation_server
duringcreate database ...
This PR makes it socreate database ...
actually reads the@@session.collation_server
variable.
Additionally, this ensures that settings@@character_set_server
sets@@collation_server
and vice versa.
Interestingly, it seems like MySQL actually ignores the global scope of these system variables, and reads the session scope instead.
fixes dolthub/dolt#7651 - 2423: Adding test for preparing
time.Time
types
This PR adds tests for usingtime.Time
, some tests have to be skipped because we don't support Timespan correctly.
companion pr:dolthub/vitess#327dolthub/vitess#328
test for dolthub/dolt#7665 - 2422: Support
Questions
status variable
This PR adds logic to update status variableQuestions
.
This only works in the server context, probably doesn't throughdolt sql
cli.
dolthub/dolt#7646 - 2421: [stats] costed index scan perf
Histogram copying is expensive. Instead pass and mutate references. We have to use a different struct type to load stats from JSON in order to support histogram interface generalization.
related Dolt-side: dolthub/dolt#7666 - 2420: support case-insensitive
LIKE
for show status/variables
MySQL stores session and global variables in aperformance_schema
database, and these tables have a case-insensitive collation on the variable names.
This PR emulates that behavior by hard coding the collation the schemas forShowStatus
andShowVariables
nodes. - 2419: Bug fix: Allow JSON scalar comparison between int64 and float64
When comparing JSON values, numbers may be represented internally as an int64 or float64, but our comparison code wasn't casting an int64 to a float64 in order to compare it with a float64 value.
Fixes dolthub/dolt#7656 - 2418: fix
show create database
to actually show charset/collation
This PR fixes theSHOW CREATE DATABASE ...
statement to actually show the charset/collation that the db is under instead of always default.
Additionally, this PR parses thecharset
database option, instead of ignoring it like before.
partially fixes: dolthub/dolt#7651 - 2416: /{.github,go.mod,go.sum}: bump go version
- 2414: stubbing out status variables
This PR adds the initial implementation of Status Variables.
There are 682 status variables, and are very similar to System Variables.
Every variable is read-only (and can only be updated by the server itself), and there are session-specific variables.
MySQL Docs: https://dev.mysql.com/doc/refman/8.0/en/server-status-variable-reference.html
Related: dolthub/dolt#7646 - 2412: New interface for binlog primary callbacks
First pass on connecting the GMS layer with the Dolt layer for handling callbacks when the SQL server is acting in binlog primary mode, through the newBinlogPrimaryController
interface. This new interface pretty closely mirrors the existing callback interface for replica callbacks, theBinlogReplicaController
interface.
Related to dolthub/dolt#7512 - 2411: implement
json_search()
MySQL Docs: https://dev.mysql.com/doc/refman/8.3/en/json-search-functions.html#function_json-search - 2410: Adding system variable
innodb_autoinc_lock_mode
We currently only supportinnodb_autoinc_lock_mode = 2
, not0
or1
.
MySQL Docs:
https://dev.mysql.com/doc/refman/8.0/en/innodb-auto-increment-handling.html
related: dolthub/dolt#7634 - 2404: Improve handling of unsigned and decimal types in JSON
Fixes #2391
MySQL's JSON type differs from standard JSON in some important ways. It supports types not supported in standard JSON, such as separate types for integers and floats, an unsigned int type, and a decimal type.
Prior to this PR, we would convert values to JSON by using theencodings/json
package to marshall the value to a JSON string and then unmarshall it to a go map. This is not only slow, but it's incorrect for these additional types.
The main purpose of this PR is to add special handling for these types that allow them to be stored in JSON documents. We also avoid generating and parsing JSON in places where it's not actually necessary, and fix bugs where decimals get incorrectly converted into strings, or unsigned ints get converted into signed ints.
Finally, this fixes an issue where we send incorrect bytes for JSON-wrapped decimal values along the wire. - 2403: fix dbName not being used in the example
This PR makes sure thatdbName
in the example is actually being used, instead of having a hardcoded "mydb" increateTestDatabase
.
fixes #2402 - 2401: refactor and parse table options, support auto_increment table option
Table Options are now parsed as structs, so we can read/use some of the variables.
Character Sets, Collations, Create Table, TableSpec, etc. have been refactored.
Additionally, this PR adds support to parse and use the auto_increment table option.
TODO:CREATE TABLE ... LIKE ...
needs to preserve table opts, like commentsalter table add column ... auto_increment
does not work when there are already rows
Companion PR: dolthub/vitess#322
- 2399: fix custom insert ordering for pk
fixes #2397 - 2398: fix in-memory implementation of RenameTable to read from session
The in-memory implementation of RenameTable uses data from the BaseDatabase, instead of reading it from the session.
This is problematic when there are multiple alter statements.
Additonally, includes some small refactor so all functions are pointer receiver instead of a mix.
fixes #2396 - 2394: Bug fix: Set non-boolean system variable enum values to 'ON' or 'OFF'
We were automatically convertingON
andOFF
values to totrue
andfalse
when setting a system variable, which made it impossible to set system variables to those enum values. For example:SET @@GLOBAL.gtid_mode='ON'; Variable 'gtid_mode' can't be set to the value of 'true'
- [2393](/~https://github.com/dolthub...
Q1 2024 Release
This is our quarterly roll-up release that contains many bug fixes, features, and performance improvements. Reminder that interfaces are not guaranteed to settle until 1.0.
This release contains a breaking change for setting up an in-memory database server, e.g. in testing. The in-memory database now requires its own session implementation, and the server must be supplied with a compatible session builder to prevent panics. See the _examples
directory for an example.
Merged PRs
go-mysql-server
- 2350: [planbuilder] Limit allows unsigned ints
Parameterizing limits in stored procedures exposed a bug where we weren't accepting unsigned ints in LIMIT/OFFSET. This is only possible in prepared or stored procedure contexts afaik. - 2348: Optional integrator schema validation
Move table schema validation so that we validate the accumulated result rather than sequential alters. - 2347: implement
json_depth()
MySQL Docs:
https://dev.mysql.com/doc/refman/8.0/en/json-attribute-functions.html#function_json-depth - 2346: fix null and empty paths for json functions
- 2345: Return correct MySQL error code when inserting into nonexistent columns.
Fixes #2344
This PR also adds a test framework for testing MySQL error codes, since this doesn't appear to currently be tested. This should make adding tests for other error codes easy.
Some of the MySQL error codes that I expected us to use (such as1050: ERTableExists
) have zero usages in GMS or Dolt. We're probably returning1105: ERUnknownError
for these. - 2343: fix json bool comparison
JSON Boolean values are special; they do not behave like normal when converted/compared against other types.
fixes dolthub/dolt#7528 - 2342: Add row length guards
Tests and basic engine-side guards similar to mysql's behavior.
re: dolthub/dolt#7524 - 2339: only use
crosshashjoin
on subqueryaliases
This fixes 2 sqllogictests. - 2336: fix rangetree removal of largest node without left child
When removing a right child, a missing nil guard was causing us to improperly propagate the new max range.
This resulted in overlapping ranges.
This fixes 1 sqllogictest. - 2335: Schema pruning considers database name
Fix some inconsistencies with project and join hints. New test added for new project pruning changes. - 2334: fix char to bool conversion
This PR adds missingnil
guards in a few conversion methods.
fixes dolthub/dolt#7515 - 2333: [planbuilder] Skip post-aggregate projection when there are no aliases in target scope
re: dolthub/dolt#6982
This takes the last big chunk of CPU pressure off of TPC-C execution, which is mostly disk limited now:
- 2332: [time] shortcut for no timezone conversion
This function reads files from disk to try to convert a timezone to itself. There are other optimizations we could do to maybe try checking for the integer offset before doing syscalls.
re: dolthub/dolt#6982, ~10% TPC-C boost - 2331: fix
NOT
expression in conjunction withWHERE EXISTS(<subquery>)
The analyzer ruleunnest_exists_subqueries
was accidentally droppingNOT
expressions when hoisting subqueries fromWHERE EXISTS...
clauses.
This should fix 8 sqllogictests.
Correctness: dolthub/dolt#7510 - 2330: Fix
DISTINCT
overDECIMALS
There was another place where we were usinghashstructure
package, which does not hashdecimal.Decimal
types correctly.
Switched toxxhash
package, which is what we use everywhere else.
Reusing solution from: #2279
This will fix 1 sqllogictest. - 2329:
expression.Div
Micro Benchmarks
There are going to be changes to our division behavior that impact both its accuracy and speed.
This PR adds benchmarks to track the runtime improvements/degradations - 2327: memo/planbuilder: Off by one error, and issue importing statistics for testing
The original bug was a one-off error whereconjCollector
was checking for the wrong equivalence column. That spiraled into:- statistics were not properly being used for some tests because PRIMARY case sensitivity in
ANALYZE ... UPDATE HISTOGRAM
statements - one integration test had a weird edge case where OSX and linux compute slightly differently float values, leading to different plans locally vs CI. One option was to truncate the specificity of all cost values with a multiply/round to avoid numerical weirdness...picking the first lowest plan also seems to work.
- variety of other small bugs in costing
- update join bug where the projections after a rearrangement lost expression id/nullability info. This was exposed by the changes above
- statistics were not properly being used for some tests because PRIMARY case sensitivity in
- 2326: Various Decimal Type Fixes
This PR has a variety of fixes to have arithmetic operations (especially those involving decimals) behave more like MySQL.
The logic for theType()
method forArthmetic
andDiv
is simpler, and better tested.
When comparing Decimal results from division operations, MySQL has an internal Scale that is different than the Scale used when returning Decimal results for display.
Here's a matrix displaying the resulting scale:
(Ex:1 / 3 = 0.333333333
; scale 0 div scale 0 should return scale 9)
Additionally, this PR adds test for arithmetic operations over Datetime and Year types. There are still a some problems dealing with precision and parsing there...
Note: I believe the division optimization where float division is preferred over decimal division for internal calculations may be causing problems. More testing is needed to see if it's possible to enable this without causing inaccuracies/precision loss.
There are microbenchmarks measuring the performance of div expression, and it turns out these changes actually greatly improve the runtime.
Correctness: dolthub/dolt#7484
Fixes - 2323: SQLEXCEPTION proc handler
We originally supported one type of procedure handler,NOT FOUND
, which explicitly checked for an error when fetching from a cursor io.EOFs. The implementation for that handler would walk the entire BEGIN/END scope stack inside the Fetch call looking for a handler, execute the handler body, and then embed the scope height into a special return error. The error walked back up the callstack looking for the BEGIN/END block embedded in the error message. - 2319: Add net.Listener as server config option
Presently go-mysql-server is served by anet.Listener
backed by a real port or socket. In some environments (like testing) we want to avoid the host's networking stack entirely. This changeset adds support for the caller to provide thenet.Listener
, which gives them full control over where the sever serves.
This opens the door for some cool workflows. For example, you can run the server using a buffer-basednet.Listener
implementation (which is what happens in the test that I added).
I landed on this solution while thinking through #2314 - 2317: support
SELECT ... INTO OUTFILE/DUMPFILE ...
This adds support for MySQL'sSELECT ... INTO OUTFILE/DUMPFILE ...
feature.
It is the complement toLOAD DATA
. There is noLOCAL
option, so files created using this feature are on the server.
This PR adds a custom TestSuite for testing these files, as it needs to write, read, and delete files.
syntax: dolthub/vitess#311
fixes dolthub/dolt#7453 - 2316: Allow primary keys with
auto_increment
columns to be dropped when an appropriate index exists
Related to dolthub/dolt#7456 - 2315: Stored procedures can use params as LIMIT,OFFSET
fixes: dolthub/dolt#7458
Fixes procedure param types in the process.
Testing against mysql it doesn't seem like subqueries or regular column types are valid as LIMIT values in any case other than procedure parameters. I still need to test whether trigger columns can be used in LIMIT,OFFSET, but that seems like a separate problem. - 2312: Added the ExtendedType interface
This ...
Aug 2023 Release
This is our ~quarterly rollup release, containing many new features, bug fixes, and performance improvements. See the merged PRs for details. Interfaces are not guaranteed stable before 1.0.0.
This release includes dramatic changes to parsing and the query planner. Users running a test server are expected to be unaffected, but integrators building their own backend may need to make changes.
Merged PRs
go-mysql-server
- 1965: Add error test for ambiguous column name query
close dolthub/dolt#6395 - 1963: Fixed type hashing for Full-Text
Fixes the reopened issue: dolthub/dolt#6543
Not all types were covered by the default case. - 1961: engine: Make ReadOnly a toggleable atomic.Bool.
- 1960: Fixed Full-Text NULL handling and ALTER TABLE case
Fixes dolthub/dolt#6540 and dolthub/dolt#6541 - 1959: Bug fix: Prevent panic when reading non-existent user and system vars
Fixes dolthub/dolt#6546
We have enginetests that cover both of these queries, but because we don't run them with the full wire request/response processing code, we didn't catch these bugs during testing. Happy to add more tests here if there are suggestions, but I think the right way to test both of these is to get our existing test suite running over a SQL server connection, instead of just using the internal interfaces to the engine (i.e. dolthub/dolt#3646), which Zach started on last week. - 1958: Don't parse queries twice
Local profile foroltp_point_select
(query with smallest time spent in execution) is 5-15% speedup. Impact on queries with longer-runtime will be smaller, proportional to the fraction of time spent in analysis vs execution.
results here: dolthub/dolt#6547 (comment) - 1957: Fixed relevancy ordering for Full-Text
Fixes dolthub/dolt#6530 - 1956: Removed unused AutoIncrementGetter interface
Related to dolthub/dolt#6543.
TheAutoIncrementGetter
was moved from the editor to the table years ago, however the interface remained. I debated deleting this during my Full-Text implementation, however decided to leave it. Now, we've encountered an error with an integrator making use of the interface, so it has been removed here and the interface was moved into the integrator since it's an internal detail now. - 1955: adding catalog table function interface
- 1954: Lateral join uses prepend row on RHS
- 1952: GMS tests have to resolve defaults
Force GMS to resolve column defaults, fix bugs exposed by additional testing. Dolt enginetests pass locally. - 1951: Fixed collation display in SHOW CREATE TABLE
Originally, we didn't display the collation inSHOW CREATE TABLE
when the collation was the default collation. Now, it doesn't display it if it's the same as the table collation, which mimics MySQL's behavior. - 1950: Honor precision for datetime and timestamp, default to 0 (no fractional seconds)
- 1949: Revert "Merge pull request #1944 from dolthub/zachmu/timestamp"
This reverts commit ca69015, reversing changes made to c11b504. - 1948: Name resolution refactor
Accumulation of: - 1944: Changed datetime and timestamp types to honor precision and default to 0 digit precision
This matches the MySQL behavior.
Partial fix for dolthub/dolt#6503 - 1942: Full-Text Fixes Pt. 3
- 1941: Bug fix for JSON_ARRAY function with binary arguments
Binary args are now treated as character strings
Also added a testing path to ScriptTests that let you inject Vitess bindvars to exercise more server logic
Fixes #1855 - 1940: Add SECURITY.md.
- 1938: Create interface for indexible tables in
IndexedTableAccess
Currently, only ResolvedTables are allowed to have indexes. There exists an interface,sql.IndexAddressable
, which any node or table can implement in order to be a candidate for index-based optimization. But in practice, implementing that interface won't actually do anything because theIndexedTableAccess
struct explicitly requires a ResolvedTable.
This PR replaces theResolvedTable
field inIndexedTableAccess
with a new interface tentatively calledTableNode
, although a more specific name would probably be better.
In order for a node to be used for index-based optimization, it must implement this interface, and the table returned by theUnderlyingTable
method must implementsql.IndexAddressable
- 1937: Remove do-nothing logic from
pushdown.go
This code is for an optimization that "pushes" filters deeper into the tree so that they're adjacent to the tables they modify.
I've simplified the logic in two ways:- Removed handling of
IndexedTableAccess
nodes. This analysis pass used to run after these nodes were generated, but now runs before, so this code path will never be hit. Removing this logic makes it easier to make future changes toIndexedTableAccess
- Removed the
withTable
subfunction, which never actually does anything, because the only table it will attempt to assign to a ResolvedTable is the table already on the node. This was likely leftover from a previous cleanup.
- Removed handling of
- 1935: Allow timestamps when encoding json
As reported on discord, Nautobot, through DJango, puts time stamp data into a json object. This fails because:This change enables the encoding of a time stamp into a string.db> select JSON_OBJECT("a", Now()); unsupported type: time.Time
- 1933: Fixed case sensitivity bugs in various statements
- 1930: leave aliases in projection
We convertexpression.Alias
intoexpression.GetField
with the name replaced in the top-level projection.
When aliasing two different columns with the same name, we fail to properly distinguish them and end up rewriting the GetField indexes for both columns to be the same; this leads to incorrect results.
A simple fix appears to be simply allowing the top-level projection to remain as anexpression.Alias
.
This fix does not work for prepared statements in the old name resolution path.
fix for: dolthub/dolt#6455 - 1929: Add support for json_contains_path()
- 1928: Generated stored columns prototype
Looking for feedback. The approach is a little bit wonky: it uses the same code paths as column defaults since they behave so similarly, but this has some weird consequences:- For generated columns, we fill in the column.DefaultValue field with the Generated expression
- In various places, we now have to consider whether to use either column.DefaultValue, or column.Generated
Overall I think I'm favorable on this approach, but keeping the two values more separate might be better for maintainability, not sure. Thoughts?
- 1926: Use EvaluateCondition for conditions in join statements
Fixes dolthub/dolt#6412
Use the given EvaluateCondition util function to determine whether a join condition is satisfied, which accounts for truthy integer values as mentioned in the linked issue. - 1925: Fix load data check constraint indexing bug
fix show tests - 1924: Makes several new kinds of alter table statements involving auto_increment columns work correctly
Also: makes several kinds of ALTER TABLE statements with multiple clauses more lenient in their error checking than MySQL. These statements will now succeed instead of being rejected.
Fixes dolthub/dolt#6218 - 1922: Render enum/set variables as strings after they are set
Fixes https:...
Q3 2023 Release
This is the quarterly roll-up release, containing many new features and bug fixes.
Interfaces will not be stable until 1.0.
Merged PRs
go-mysql-server
- 1861: chore: remove refs to deprecated io/ioutil
- 1860: chore: unnecessary use of fmt.Sprintf
- 1859: chore: use copy(to, from) instead of a loop
- 1856: Support IPV6 loopback address for looking up user credentials
Map "::1" and "127.0.0.1" to localhost when looking up users.
There don't appear to be tests for this code path. TBD if I'll add some.
Related to: dolthub/dolt#6239 - 1854: Prevent loops in stored procedures from returning multiple result sets
The query in dolthub/dolt#6230 was causing rows from many result sets to be returned from a stored procedure. We already have code that limitsBEGIN/END
blocks to return the last SELECTed result set; this PR extends that logic to loop constructs as well.
Fixes: dolthub/dolt#6230
Dolt CI Checks: dolthub/dolt#6245 - 1853: chore: slice replace loop
- 1852: Alter stored procedure execution to deal with statements that commit transactions
This change adds checks to begin a new transaction whenever there isn't one during stored procedure execution. This lets things likedolt_commit()
execute correctly in stored procedures. - 1851:
memo.Literal
has different type than lookup
This panics on dolt:The PutField function expects the value to match the tuple descriptor exactly, and will panic if it does not.CREATE TABLE tab2(pk INTEGER PRIMARY KEY, col0 INTEGER, col1 FLOAT, col2 TEXT, col3 INTEGER, col4 FLOAT, col5 TEXT); CREATE UNIQUE INDEX idx_tab2_0 ON tab2 (col1 DESC,col4 DESC); CREATE INDEX idx_tab2_1 ON tab2 (col1,col0); CREATE INDEX idx_tab2_2 ON tab2 (col4,col0); CREATE INDEX idx_tab2_3 ON tab2 (col3 DESC); INSERT INTO tab2 VALUES(0,344,171.98,'nwowg',833,149.54,'wjiif'); INSERT INTO tab2 VALUES(1,353,589.18,'femmh',44,621.85,'qedct'); SELECT pk FROM tab2 WHERE ((((((col0 IN (SELECT col3 FROM tab2 WHERE ((col1 = 672.71)) AND col4 IN (SELECT col1 FROM tab2 WHERE ((col4 > 169.88 OR col0 > 939 AND ((col3 > 578))))) AND col0 >= 377) AND col4 >= 817.87 AND (col4 > 597.59)) OR col4 >= 434.59 AND ((col4 < 158.43)))))) AND col0 < 303) OR ((col0 > 549)) AND (col4 BETWEEN 816.92 AND 983.96) OR (col3 BETWEEN 421 AND 96);
The section of code in memo that creates a new range uses the type from the expression, but in other places it uses the index column expression types.
An alternative solution would be to have some logic in dolt to convert to the correspondingsql.Type
based off theval.Enc
- 1848:
IntDiv.Type()
should always return eitheruint64
orint64
Previously, ourIntDiv.convertLeftRight()
usedIntDiv.Type()
to determine the larger type betweenIntDiv.Left.Type()
andIntDiv.Right.Type()
to avoid precision loss when doing internal calculations. Now, that logic is moved fromIntDiv.Type()
toIntDiv.convertLeftRight()
, andIntDiv.Type()
can only returnuint64
orint64
.
This should fix the sql correctness regression from #1834 - 1847: Fix TargetSchema.Resolved() to check targetSchema column default expressions
A coupleSchemaTarget
implementations weren't checking if thetargetSchema
was resolved as part of theResolved()
method. Added tests, audited the other implementations, and simplified the logic to use a new method onSchema
to check that column default expressions are resolved.
Fixes: dolthub/dolt#6206
Dolt CI Run: dolthub/dolt#6213 - 1846: update
information_schema.processlist
to correctly display status of processes and databases
We used to hardcode"Query"
, now we referenceprocess.Command
Additionally, we now get the database from the current session and use that variable.
fix for: dolthub/dolt#6023 - 1844: fix panic for group by binary type
We made a bad type assertion forsql.StringType
.
Additionally, this fixes a issue whereUnaryExpressions
withGetFields
would incorrectly throw a functional dependency error withONLY_FULL_GROUP_BY
enabled.
Fix for second part of: dolthub/dolt#6179 - 1843: Improvements to
CAST
andCONVERT
functions
This PR adds support for casting/converting toFLOAT
andDOUBLE
types with theCAST
andCONVERT
functions. It also adds support for length (aka precision) and scale type constraints (e.g.CAST(1.2345 AS DECIMAL(3,2))
).
Parser support forDOUBLE
andFLOAT
withCAST
andCONVERT
: dolthub/vitess#249
Fixes: dolthub/dolt#5835 - 1841: adding
version
andversion_comment
values
@@version
now returns8.0.11
@@version_comment
now returns "Dolt"; in mysql, this appears to be dependent on OS / method of install- Some people get
MySQL Community Server - GPL
- Others get
Homebrew
Fix for first part of: dolthub/dolt#6179
- Some people get
- 1840: deduplicate (hash) intuple for and queries
This PR was originally supposed to fix it: original fix: #1677, butAND
statements weren't covered.
fix for: dolthub/dolt#6189 - 1839: Slow degenerate semi join, hoist select opt
This enables recursive subquery decorrelations, and adds a hash join execution option for semi joins that is equivalent to cached subquery existence checks. - 1838: resolve aliases in subqueries in function arguments
The rulereorderProjection
also replaces subqueries with getfields in projections when they are used by subqueries, but it did not check for function expressions.
This meant that aliases in subqueries as arguments to functions threw a"x" could not be found
error.
This PR just has the section ofreorderProjection
that is supposed to find deferredColumns also look at the arguments of functions recursively (because we can nest functions).
Additionally, there was another schema type bug:MySQL returns an Integer type for if statement, and if either argument is a String, it always returns a String.tmp> select 0 as foo, if((select foo), 123, 456); +-----+----------------------------+ | foo | if((select foo), 123, 456) | +-----+----------------------------+ | 0 | 127 | +-----+----------------------------+ 1 row in set (0.00 sec)
fix for: dolthub/dolt#6174 - 1836: update cached table count in prepared statements
Prepared statements were caching table counts. We need to update the table count when finalizing prepared statements to bring table count up to date with any intermediate edits. - 1834: fix expected schema for
sum(literal)
The code path we take when print rows to shell is different than spooling from server.
In the sql case, we ignore the schema we get from analysis.
In the server case, we actually read the schema, and ensure that the rows are of that type.
When doingsum(literal)
, we use the type of the literal. In this issue, the literal was1
, so anINT8
, which caps out at127
.
sum()
is always supposed to return a float64, so I made a change to do that.
I checked by starting mysql with--column-type-info
option, and it does appear that any columns coming fromsum()
has aDECIMAL
type.
Fix for: dolthub/dolt#6120 - 1830: Use SO_REUSEADDR and SO_REUSEPORT options when creating the sql server on Unix
This prevents a transient error we've been seeing where the server sometimes fails to start, and the OS claimsport already in use
, even though we've already confirmed that the port is not in use prior to runningdolt sql-server
. - 1829: plan.TableCountLookup short circuits count()
In many cases it is unnecessary to read an entire table to report count(*). We can use the RowCount() interface to jump to the answer. - 1828: Consolidated collation maps
Main file to check is thegenerate/main.go
file. After running the updated generation program, these are the consolidated files:common_utf8mb4_es_0900_ai_ci_Weights: [utf8mb4_es_0900_ai_ci_Weights, utf8mb4_es_trad_0900_ai_ci_Weights] common_utf8mb4_es_0900_as_cs_Weights: [utf8mb4_es_0900_as_cs_Weights, utf8mb4_es_trad_0900_as_cs_Weights] common_utf_croatian_ci_Weights: [utf16_croatian_ci_Weights, utf32_croatian_ci_Weights, utf8mb3_croatian_ci_Weights, utf8mb4_croatian_ci_Weights] common_utf_czech_ci_Weights: [u...
Q2 2023 Release
This is a periodic rollup release of ongoing development work. Reminder that core interfaces are not guaranteed to be stable until 1.0.
Merged PRs
go-mysql-server
- 1719: Added serving tray and bowtie
- 1717: Added a mascot image
- 1716: Revert "allow renaming views with
RENAME TABLE
statement (#1712)"
This reverts commit dac7262 from PR #1712
The PR above added aViewDatabase
implementation forPrivilegedDatabase
, which is causing some Dolt cluster integration tests to fail. Temporarily pulling this commit out so we can get other GMS changes through and can debug the Dolt test failures separately. - 1715: implement
find_in_set
MySQL docs: https://dev.mysql.com/doc/refman/8.0/en/string-functions.html#function_find-in-set
Note: Collations priority isn't exactly the same as MySQL
fix for: dolthub/dolt#5659 - 1713: Changing
MaxTextResponseByteLength()
to respectcharacter_set_results
TheType.MaxTextResponseByteLength()
function was not honoring thecharacter_set_results
system variable, which resulted in Dolt sending different response length metadata from MySQL when the value ofcharacter_set_results
is not the same as the type's charset.
This change adds a*sql.Context
parameter to theMaxTextResponseByteLength(ctx)
function, so that the correct value can be pulled out of the session when the response metadata is sent through the handler.
Fixes: dolthub/dolt#5735
Related Dolt change: dolthub/dolt#5752 - 1712: allow renaming views with
RENAME TABLE
statement- Added renaming of views with
RENAME TABLE ... TO ...
statement - Added
ViewDatabase
implementation forPrivilegedDatabase
TODO:ALTER TABLE ... RENAME ...
should fail for renaming of views. Currently,vitess
parses both the statements into the same node, which makesGMS
parser not be able to detect the difference.
Should return error:ERROR 1347 (HY000): 'mydb.myview' is not BASE TABLE
- Added renaming of views with
- 1711: update stored procedure timestamps to be accurate
Currently when we load non-built-in stored procedures, we re-analyze the sql string which also re-creates all the metadata. This means that theCreated At
andModified At
fields can re-set every time the procedure is loaded in, instead of showing the correct timestamp. This change updates the loaded in procedure with the correct timestamps.
fixes: dolthub/dolt#3081 - 1710: Detect invalid uses of * and window functions in queries.
This PR is the GMS side of the fix for dolthub/dolt#5656.
Preventing panics from invalid window functions is easy: replace the panic with returning a new kind of error.
The invalid *s were trickier. I added an additional analysis rule that runs immediately after resolving function names. It checks for any uses of "*" in a subexpression (so, not just "SELECT *") that aren't used inside a COUNT, COUNTDISTINCT, or JSONARRAY function.
It's possible that there's other places where *s are allowed that we need to account for. It's also possible that there may be some other disallowed uses of * that will pass this and still cause a panic. - 1708: Allow uppercase collations
Fixes dolthub/dolt#5699
We were comparing the raw strings, which failed when they were anything but lowercase. Now we just force all strings to lowercase. - 1707: fix JSON map key ordering
fix for: dolthub/dolt#5711 - 1704: fix group by over empty result sets
fix for: dolthub/dolt#5683 - 1702: sql/plan: track column renames on alter/modify column
fixes dolthub/dolt#5689 - 1701: support
EVENT
storage handling
This PR implements:CREATE EVENT
statementDROP EVENT
statementSHOW EVENTS
statementSHOW CREATE EVENT
statement.
This PR handles events storage only. The event execution logic is not implemented yet. No events will be run and/or dropped if the execution time is passed or dropped if 'ends' time is passed and 'not preserve' is defined. Created events will be only stored in the database and can be queried to see details usingSHOW
statements.
- 1700: OrderedDistinct passes join tree row during exec
Re: dolthub/dolt#5700
OrderedDistinct dropped a parent row while executing a join tree, which caused a field index error in the child.
The query below has a slightly different plan, but the OrderedDistinct in the right half of a join tree, with a joinSEMI_JOIN(scalarSubq0, scalarSubq1)
as a child is the issue.Project ├─ columns: [SUM(xy.x):0!null as SUM(x)] └─ GroupBy ├─ select: SUM(xy.x:0!null) ├─ group: └─ Project ├─ columns: [xy.x:0!null, xy.y:1] └─ HashJoin ├─ Eq │ ├─ xy.x:0!null │ └─ scalarSubq0.u:2!null ├─ Filter │ ├─ Eq │ │ ├─ xy.x:0!null │ │ └─ 2 (tinyint) │ └─ Table │ ├─ name: xy │ └─ columns: [x y] └─ HashLookup ├─ source: TUPLE(xy.x:0!null) ├─ target: TUPLE(scalarSubq0.u:0!null) └─ CachedResults └─ OrderedDistinct └─ Project ├─ columns: [scalarSubq0.u:0!null] └─ SemiJoin ├─ Eq │ ├─ scalarSubq0.u:2!null │ └─ scalarSubq1.a:4!null ├─ TableAlias(scalarSubq0) │ └─ Table │ ├─ name: uv │ └─ columns: [u v] └─ Filter ├─ Eq │ ├─ scalarSubq1.a:0!null │ └─ 2 (tinyint) └─ TableAlias(scalarSubq1) └─ Table ├─ name: ab └─ columns: [a]
- 1696: resolve recursive CTE in scalar subquery
Re: dolthub/dolt#5631 - 1695: Fix value out of range bug
re: dolthub/dolt#5642 - 1693: update separator parsing in group_concat
Updates separator parsing for group_concat to use newSeparator
struct. This change is needed to allow''
as a separator.
fixes: dolthub/dolt#5570
related: dolthub/vitess#230 - 1692: don't push
Filter
belowLimit
When there's a query with a filter over a subquery with a limit, we incorrectly push filters down to the subquery.
Example:
Thisis not equivalent toselect * from (select * from t limit 1) t where i > 1;
Fix for: dolthub/dolt#5568select * from (select * from t where i > 1) t limit 1;
- 1690: Use ordinals to force stable TopN heap sort
Re: dolthub/dolt#5640
... ORDER BY <expr> LIMIT 1
and... ORDER BY <expr> LIMIT 1 OFFSET 1
both use TopN in a way that surprises users expecting stable output orders.
For example, the second query returning the first row is within the SQL spec but not user friendly:This PR adds ordinals to rows in the TopN heap to differentiate insertion order in sort-groupings.-- setup create table xy (x int primary key, y int); insert into xy values (1,0),(2,0),(3,0),(4,0); -- expect first row select * from xy order by y asc limit 1; +---+------+ | x | y | +---+------+ | 0 | 1 | +---+------+ -- expect second row select * from xy order by y asc limit 1 offset 1; +---+------+ | x | y | +---+------+ | 0 | 1 | +---+------+
- 1689: Make merge_join pushdown safe
Index lookups were being pushed into merge join:In rare cases like above, this pushes an unsafe index into merge join that is not sorted monotonically on the join condition. A merge join index that violates join condition monotonicity will miss matches. In even rarer cases, the out of order value for the join condition included NULLs, causing this error: dolthub/dolt#5633.tmp1> explain select /*+ merge_join(xyz,uv) join_order(xy,uv) */ * from xyz join uv on x = u where y = 1; +--------------------------------+ | plan | +--------------------------------+ | MergeJoin | | ├─ cmp: (xyz.x = uv.u) | | ├─ IndexedTableAccess(xyz) | | │ ├─ index: [xyz.y] | | │ ├─ filters: [{[1, 1]}] | | │ └─ columns: [x y z] | | └─ IndexedTableAccess(uv) | | ├─ index: [uv.u] | | ├─ filters: [{[NULL, ∞)}] | | └─ columns: [u v] | +--------------------------------+
- 1688: Support for recursive CTEs with no union
Ex:Re: dolthub/dolt#5657with recursive a as (select 1) select * from a union select * from a;
- [1687](https://...
Nov 2022 Release
This is a periodic rollup release. It contains many features, bug fixes, and improvements.
The core API has still not stabilized and will not be guaranteed until 1.0.
Merged PRs
go-mysql-server
- 1364: Updated and expanded engine examples
TheREADME.md
had an updated example, but the actual_example/main.go
file did not. I've expanded the example file a bit to include information on setting up users, and slightly simplified theREADME.md
example. I've also added tests for everything, so that if anything breaks, we'll know we need to update both the example file and theREADME.md
portion. - 1363: Convert errors during
ComPrepare
toSQLError
We were already converting errors inComStmtExecute
,ComMultiQuery
, andComQuery
toSQLError
so that the correct error codes would be sent to clients. This change adds that support toComPrepare
, too.
Added a unit test for that case and took the opportunity to simplify the interface forCastSQLError
a little bit.
This change helps get Prisma support a little further along (dolthub/dolt#4511), but it doesn't look like it fully resolves everything Prisma needs to work with Dolt. - 1361: Fixed collation check on foreign key columns
- 1358: fix example package
close #1357
This fixes runtime panic raised by example app in/_example
.
I checked SQL client can obtain response in my local machine.~/go-mysql-server/_example$ go build ~/go-mysql-server/_example$ ./_example
$ mysql --host=127.0.0.1 --port=3306 --database=mydb -u root Reading table information for completion of table and column names You can turn off this feature to get a quicker startup with -A Welcome to the MySQL monitor. Commands end with ; or \g. Your MySQL connection id is 1 Server version: 5.7.9-Vitess Copyright (c) 2000, 2022, Oracle and/or its affiliates. Oracle is a registered trademark of Oracle Corporation and/or its affiliates. Other names may be trademarks of their respective owners. Type 'help;' or '\h' for help. Type '\c' to clear the current input statement. mysql> select * from mytable; +----------+-------------------+-------------------------------+---------------------+ | name | email | phone_numbers | created_at | +----------+-------------------+-------------------------------+---------------------+ | Evil Bob | evilbob@gmail.com | ["555-666-555","666-666-666"] | 2018-04-18 09:41:13 | | Jane Doe | jane@doe.com | [] | 2018-04-18 09:41:13 | | John Doe | john@doe.com | ["555-555-555"] | 2018-04-18 09:41:13 | | John Doe | johnalt@doe.com | [] | 2018-04-18 09:41:13 | +----------+-------------------+-------------------------------+---------------------+ 4 rows in set (0.00 sec)
- 1356: add tests for sql type Zero() functions
Implementers of GMS might expect similar values to be returned by Convert() and Zero(). For decimal and enum implementations this was not the case and has been fixed. - 1355: Allow any select statement for
CREATE TABLE AS SELECT ...
Also fixes a semantics bug in the schema produced by some such statements. - 1354: Bug fix for pushdownSort handling missing cols qualified with a table name
The Dolt bump for my GMS change to fix an alias issue in sort node pushdown triggered an error with matching missing column names now that we can include qualified column names.
This PR adds a repro for that case to GMS and fixes the issue by ensuring we create aUnresolvedQualifiedColumn
when the missing column is qualified with a table name. I've run Dolt tests locally and confirmed there shouldn't be any other test failures in the next bump. - 1353: Fix panic for
show keys from information_schema.columns
Does this by removing the special handling ofinformation_schema.columns
as a separate node type, treats it just like any otherResolvedTable.
In the process, effectively rewrote how we handle column default values by 1) moving most logic to happen in theOnceBefore
batch, rather than default rules, and 2) splitting it up into multiple passes that each have a single purpose. I found in the process of 1) that the previous rules had a lot of side effects and unintended ordering constraints, so introduced new rules and tweaked others to eliminate those. - 1352: Update sort node to use alias reference when it will resolve missing columns
Fixes: dolthub/dolt#3016
Other changes:- Refactors the existing OrderBy/GroupBy tests into a ScriptTest.
- Introduces a new interface,
sql.Projector
, that unitesGroupBy
,Window
, andProject
if a caller just needs to get the projected expressions.
- 1351: Add support for database collations
This allows setting the database collation, such that all newly created tables within a database (that do not explicitly set their collation) will inherit the database collation.
Builds on dolthub/vitess#199 - 1350: Subquery row iter fix field indexes
Recent changes to subquery scope visibility use the scope to communicate column definition availability; i.e., we do not pass the scope into subqueries we have determined to not depend on the outer scope, marking the same scope as cacheable. This analysis change needs a corresponding runtime change to indicate whether the scope row is expected at execution time. - 1349: Skip process tracking when prepreparing queries
When we prepare a statement, theQueryProcess
node we create is bound to the current context's PID. This is not the same PID as the context that will execute a statement created from that template, which results in ProcessList metadata not being properly cleaned up after a query has finished processing.
Fixes: dolthub/dolt#4601
I couldn't find a great way to test this in the GMS package, but I'm working on a test in dolt that I'll link to shortly. - 1348: fix visibility for on duplicate key update
Currently, we treatplan.InsertInto.Source
independently fromDestination
, and is not considered one ofInsertInto
's children. It is evaluated much later in the analysis process in the ruleresolveInsertRules
in a similar way as subqueries (we recurse the analyzer on it). This is problematic if we want to reference tables fromSource
.
In this PR, I resolve the tables forInsertInto.Source
and added extra logic to correctly index those tables' columns.
There is a special case foron duplicate key update <expr>
in that the LHS of the expr can only seeInsert.Destination
while the RHS can seeInsert.Destination
andInsert.Source
.
Partial fix for: dolthub/dolt#4562
Note: This does not work for CTEs
This is only kind of a fix for the issue. The right way to fix this is probably to completely resolveInsertInto.Source
before doing anything else, but I wasn't able to get that working yet. - 1343: allow adding new primary key to table with > 1 row iff it has auto_increment
fix for: dolthub/dolt#4581
tests in dolt becausememory.Table
doesn't implementRewriteableTable
dolthub/dolt#4593 - 1341: fix comparison for geometry types
fix for: dolthub/dolt#3451
Maybe all geometry comparisons should just default to their EWKB formats; pretty confident this is what MySQL does. - 1339: Update DateAdd/DateSub to return correct types.
This change fixes dolthub/dolt#4376
Previous implementation was hard-coded to return sql.Date, but now we are following the MySQL standard and return a type based on the inputs.
The tests in the repo are verifying that the correct data is returned, but we're not testing the actual SQL data that is received, so none of our tests are catching this case yet. We should open a new work item to start testing the actual SQL that's being returned by dolt. For this bug, testing was performed using a local version of dolt with these changes.
The failing query from the original bug is now working:SELECT NOW(), DATE_ADD(NOW(), INTERVAL 14 DAY), dolt_version(); +----------------------------+----------------------------------+----------------+ | NOW() | DATE_ADD(NOW(), INTERVAL 14 DAY) | dolt_version() | +----------------------------+----------------------------------+----------------+ | 2022-10-18 18:53:24.406345 | 2022-11-01 18:53:24.406345 | 0.50.4 | +----------------------------+----------------------------------+----------------+ 1 row in set (0.02 sec)
- 1337: add support for
GeometryCollection
pt. 3
Part 10 of fix for: dolthub/dolt#3638
Changes:- these functions no...
June 2022 Release
This is a periodic rollup release. It contains many features, bug fixes, and improvements.
The core API has still not stabilized and will not be guaranteed until 1.0.
Merged PRs
go-mysql-server
- 1056: Allow
REGEXP
args to be converted
fix for #1055 - 1054: Fix prep AS OF
Prepared AS OFs errored when the asof target was not a bindvar. All of our previous tests treated ASOF also as a bindvar.
companion PR: dolthub/dolt#3592 - 1053: Added type wire tests
This enforces that we're returning the correct data over the wire (which may differ from the storage values). Majority of the fixes made have been moved to the parent PR (#1044) so that this one is not quite as large. They're all fairly straightforward, hence I don't think they require review.
The expected test output has been validated against MySQL (using the shim). In addition, this also enforces that the results are coming from theType.SQL()
function by running the output of asql.RowIter
(gathered from the directly querying the engine) through theType.SQL()
function and ensuring it matches. - 1052: enginetest: Parameterize query plan tests
- 1051: Better join commutativity count
We excluded joins on the basis of join factors in the logical join tree,
rather than distinct join subtrees subject to commutativity. The
difference is that we have to permute n! to optimize the search space
of all valid commutative trees, versus a k^n (k = table rows) execution
runtime. 12! blocks analysis, k^12 is steep but can be OK depending on
cardinality and indexes. We impose no limits on k^n joins. - 1047: sql/parse: Test round-tripping
sql.Type
as string - 1046: Introduced OrderedIndex interface to deal with indexes that don't return ordered results
- 1044: Type value changes & row type assertions
This PR has two primary goals:- All types now pass their values around in a form most suitable to that type.
- Enforce that the aforementioned types are always passed to integrators, such that integrators do not need to do type validation on their end.
To elaborate on these points, some types already passed around values that were sensible and a best fit for that type, such asMEDIUMINT
returning anint32
. Other types, such asDECIMAL
, passed around strings, which necessitated conversions for integrators to be able to properly persist the values. Not only that, there is currently no guarantee that a row's values each have their best fit type (aBIGINT
can work with anint8
, but it should be able to always expectint64
). To make these guarantees, I'm adding a check at all GMS-integrator junctions that pass asql.Row
and verifying that the value types are exactly what that column expects.
This may have the side effect of changing the output of aSELECT
statement for integrators. As aSELECT
statement simply returns asql.RowIter
, additional logic will be needed to convert all values to their canonical MySQL representation. This can easily be achieved by passing all values throughType.SQL()
before display.
- 1043: enginetest: Validating Enginetest Harness
- 1042: exclude procedure aliases from showing information_schema.routines table
Not show procedure aliases in information_schema.routines table
Dolt PR tests this change - 1041: cleanup enginetest
Splits upenginetest/enginetest.go
intoinitialization.go
andevaluation.go
. The intent is to leave only test banks inenginetest.go
This is a pure refactor, no logical changes were made. - 1038: Added a few foreign key tests
- 1037: Parallelize IndexedTableAccess on DoltDiff tables
- 1033: fix drop current database with case insensitive name
- 1032: Bug fixes for ModifyColumn related to primary keys
- 1029: fix some column values of information_schema.columns table
Fixescolumn_key
,character_maximum_length
,data_type
andcolumn_type
column values ofinformation_schema.columns
table - 1027: Moved responsibility for rewriting tables on add / drop primary key into engine
- 1026: More tests converted to new format
- 1025: adds
mod()
function
fix for: dolthub/dolt#3423 - 1022: Rewrite table for drop column
- 1021: Allow SHOW CREATE PROCEDURE for external procedures + display a fake CREATE PROCEDURE
A note regarding the fakeCREATE PROCEDURE
. It seems that Java's JDBC checks that the procedure creation statement is a valid statement, regardless of whether the statement remotely matches that of a stored procedure. So rather than returning a random statement, we return a "valid"CREATE PROCEDURE
statement.
Fixes dolthub/dolt#3428 and dolthub/dolt#3424 - 1020: example memory.NewTable add "PrimaryKey:true" flag.
example memory.NewTable add "PrimaryKey:true" flag. - 1019: UUID function is no longer based off current time
fix for: dolthub/dolt#3323 - 1018: support
SRID
for spatial type column definition
Added SRID value syntax functionality for column definition.
Added tests cover CREATE TABLE, ALTER TABLE ADD/MODIFY COLUMN, and INSERT statements.
Fixes dolthub/dolt#3425 - 1015: Thread safe procedure cache
- 1014: Added field for connected db to logger
- 1013: Moving transaction initialization before query analysis
Moves transaction creation ahead of query analysis, so that queries can be executed with the latest committed state from other transactions.
Fixes: dolthub/dolt#3402 - 1011: Add FK Checks for Insert on Duplicate
- 1009: sql/analyzer: Fix reresolveTables to not drop IndexedTableAccess and DeferredAsOfTable nodes on its transform.
enginetest: Add a test to assert IndexedTableAccess in a prepared statement
behaves correctly when querying a table that has been modified since the
prepare. - 1008: Bug fix for rewriting table during modify column
- 1007: Refactor
grant_tables
tomysql_db
Mostly renaming variables and files to better reflect what they are now.
Added flatbuffer files for MySQL DB. - 1005: Allowing "DEFAULT NULL" for blob column definitions
Fixes: dolthub/dolt#3388 - 1004: Support for rewriting tables on column modifications
- 1003: Replace statements that order by primary key, to use indexed table access
- 1002: New interface to rewrite table on certain schema change operations
Interface isn't quite final but this can be checked in. Complementary dolt changes are done and tested but want to get feedback on this approach first. - 1000: Regression test case for dolthub/dolt#3247
dolthub/dolt#3247 reported a panic that our test cases didn't cover. Another commit (324e43b) already fixed the panic, so this PR just adds a quick test to ensure we don't regress with the same bug. - 997: adding
COLLATE
toSHOW CREATE TABLE
Fix for: dolthub/dolt#3351
Also fix for: dolthub/dolt#3352 - 995:
CheckpointHarness
and analyzer mutexes - 992: Allow unresolved tables and procedures in trigger body in CREATE TRIGGER
In order to allow non-existent tables and non-exitent procedures in trigger body in CREATE TRIGGER statement, we no longer run analyzer on trigger body, instead all validation checks are performed on a single rule,validateCreateTrigger
.
Added tests for cases of di...
October 2021 Release
This is a normally scheduled quarterly release of the library with many improvements and feature additions.
APIs are not guaranteed to settle until 1.0.
Merged PRs
go-mysql-server
- 594: sql/plan: Fix join iterators to always Close() their secondary RowIter. Fix Subquery to Dispose its subquery.
- 593: Make some update queries determinate
- 591: Introduced a ViewProvider extension
As part of this, got rid of the IndexRegistry and ViewRegistry on the context, put them on the Session instead. These changes significantly simplify the process of constructing an engine and running queries. - 590: Add dolt discord to readme
- 588: Made it possible to use variables in AS OF expressions
As part of this, also pulled resolving variables out of the resolve_columns step into its own rule. - 583: Update INNER JOIN Alpha
- 582: sql/plan: exchange.go: Make Exchange's RowIter wait for all goroutines to shutdown cleanly.
If we do not block on shutting down the goroutines that are handling the
partitions, we cannot guarantee that we will not race with later uses of the
sql.Session, for example.
This converts the implementation to x/sync/errgroup and restructures things
quite a bit. - 581: Skip fk validation on CREATE TABLE when fks are disabled
- 579: Fixed panic on using an alias of a subquery expression in a where clause (now just an error)
- 578: enginetest: Add BrokenQueries tests for projections in group by nodes not handling subquery expressions well.
- 577: sql/analyzer: validation_rules: Disable validateSubqueryColumns, which was enabled in the latest releases but is still broken in some cases.
- 576: Add Read Only transaction functionality
- 575: Fixed bug in indexed joins
When a query had two copies of the same table, and it could use an index to join them, it was non-deterministic which of the tables' (identical) indexes would be used. This choice doesn't matter for some implementations or even most queries, but in Dolt, if a query involves the same table at two different revisions (because of AS OF), it was arbitrary which table's index got returned. Because the dolt index implementation gets its data from its parent table, if it chose the wrong index it got the wrong data.
This change restricts the choice of index to a single table name.
Need tests demonstrating the bug, but those will have to live in Dolt for the time being. - 573: Add several features that unblock mysql workbench
This pr implements- Show STATUS
- Set CHARACTER SET
- 572: Add the EXISTS Operator for Select Where Filters
Adds the exists operator as described here: https://dev.mysql.com/doc/refman/8.0/en/exists-and-not-exists-subqueries.html - 568: sql/analyzer: indexed_joins: Keep searching for usable indexes when there are NOT(? = ?) and NOT(? <=> ?) clauses in a join conjunction.
The indexed join logic is perfectly happy to go forward with a join plan if it
doesn't find a perfect index, or even a usable index at all, for one or more
table factors in the join. This logic is probably left over from when that was
not the case, but for now we make it a little more liberal to cover some cases
we need to cover for a customer. - 567: sql/plan,analyzer: Fix HashLookup for cases where there is a schema prefix not visible to the direct join parent.
This adds aTransformUpCtx
function that passes along aSchemaPrefix
in theTransformContext
if the schema prefix is scrutable from resolved children at that point in the analysis.apply_hash_lookup
makes use of this to make the transformed expressions in theHashLookup
lookup node refer to the right place, even whenJoinNode.Left().Schema()
doesn't have the whole prefix.
This schema prefix is probably useful in other places and I am exploring rationalizing certain places where the analyzer makes use of the schema by using it or something like it.
In the mean time, converted some of the more obscure transform variants (UpWithParent, UpWithSelector) to use TransformUpCtx as well. Held off on moving TransformUp to TransformUpCtx.
Very open to suggestions on names for TransformUpCtx. - 566: sql/analyzer: Make SubqueryAlias nodes always cacheable.
- 565: sql: Removing ctx parameter from Expression.WithChildren.
Also removes it from FunctionFn types, TransformExpression... functions, and Aggregation.NewBuffer.
We think thisctx
parameter might have been added a few months ago as part of some optimization work which never made it across the line. Instead of threading a*sql.Context
everywhere, if we have need of a*sql.Context
during analysis or to precompute or materialize a certain result in the future, I think I'm going to advocate for a specific optional interface that the analyzer is aware of. It could then pass the context through at a specific analyzer phase and the function/expression node would have the opportunity to get ready to do what it needs to. - 564: sql/plan: Defer returning error from IndexedInSubqueryFilter.RowIter until the Next() call.
Fixes some interactions between INSERT IGNORE INTO and expressions which will
fail on evaluation.
This might be a credible strategy everywhere we .Eval within RowIter, for
example, in indexed_table_access. - 563: Added an extension point for custom function providers
As part of this, embarked on a major refactor:- Extracted interfaces for sql.Catalog and sql.ProcessList
- Moved existing sql.Catalog to analyzer package
- Moved ProcessList and MemoryManager out of Catalog
- Changed Analyzer and Engine to take a DatabaseProvider instead of a Catalog
- 560: Update some analyzer rules and expression behavior to deal with tuples in a more principled way.
Subquery expression nodes can now return tuples.
InSubquery expression nodes can work with tuples as expected.
An analyzer validation step now returns operand errors in more cases, expecting
almost all expressions to return one column, but special casing certain
operators and functions which support tuples.
Added some TODO tests for cases where our tuple comparisons are still not
behaving as we want them to. In particular, null safe vs. non-null safe
comparisons and type coercion of tuple subtypes in comparisons still need work. - 559: sql/analyzer: aliases.go: Make sure we add the right table alias when traversing a DecoratedNode.
- 557: sql/analyzer/optimization_rules.go: moveJoinConditionsToFilter: Fix small inaccuracy where computed topJoin could be wrong.
This could result in the optimization pass adding the same Filter node to
multiple places in the join tree. - 556: sql/plan/indexed_table_access.go: Change static index lookup nodes to not keep returning their unused key expressions.
These needs take key expressions, but they do not evaluate them. They are not
part of the evaluation tree, but they cause some problems with things like
FixFieldIndexes and prune_columns, where all GetField expressions in the plan
tree are expected to resolve to fields that are actually in scope and
resolvable at the node that is being evaluated.
This fixes a particular evaluation bug where a subquery expression in a join
condition gets moved to the filter above the joins. If the moved subquery
expression made use of a static index table lookup, the moved expression would
fail to rewrite its field indexes appropriately and the query would return
incorrect results. - 555: Fix handling of secure_file_priv for LOAD DATA
- 554: Added locks, todo to mutable session state
- 553: Wrote proper number conversion logic
Previously we were relying on thecast
library, which had a few issues with how we expect SQL numbers to function. One such issue was how number strings were handled (such as interpreting"01000"
as a binary number rather than in decimal). There have been more issues in the past that have gone undocumented, but this should fix them (or allow for an easy fix since it's all custom logic now). - 552: Fixed actual default collation representative
- 551: Check Error when building Session
- 549: Mutable DatabaseProvider refactor
- 548: Fix Resolve Defaults
- [547](https:/...
June 2021 Release
This is a development release, including several new features and performance improvements.
Merged PRs
go-mysql-server
- 445: Allow limits and offsets to use value args (? in prepared statements)
This fixes #439 - 441: Test fixes for savepoints
- 440: /{.github,go}: bump min go version to 1.15
- 437: /.github/workflows/format.yml: enable autoformatting
- 435: /.github/workflows/bump-dependency.yaml: remove working dir, doesnt exist
- 434: Rewrote transaction tests to be much more succinct, and added several more
- 433: Fixed non-deterministic test query
- 432: Repro test case for ErrFieldMissing bug
- 431: Fixed foreign key errors in NTC test setup
- 430: Fixed bug in field indexes related to pushdown of indexes in subqueries
- 429: Added statement boundaries
- 427: Initial percent_rank fn
- percent_rank function implemented, which is basically the step function quantile of column-sorted rows within a partition.
- moved helpers to
window/window.go
- created
WindowExpression
interface to facilitate helper function re-use (there are multiple ways of doing that, extra interface isn't super necessary but made sense to me at the time)
todo: - bats with great expectation specific query
- 426: added ErrIndexOutOfBounds test
- 425: Bug fixes for date comparisons
This fixes dolthub/dolt#1572 - 424: Transaction support
This change:- Moves a bunch of transaction and other session management code out of Handler into Engine
- Introduces a new set of tests around transactions
- Fixes the error message for duplicate key violations
- Removes AsyncNode and related code
- Eliminates duplicate query parsing
- 423: Vinai/distinct expressions
This pr does the following- Fix incorrect behavior with AVG
- Enable the use of DISTINCT within aggregation eg. SUM(DISTINCT )
- Enables the use of DISTINCT on aggregation that is grouped
- 422: Fix for hanging joins when the secondary table is empty using an in-memory strategy
This fixes #222 - 421: fix bug in join search
- 420: Update sqlogic test harness and bump sqllogic dependency
This pr enables users of gms to run and parse the sqllogic harness against all of the sqlogictests (with filtering). It also bump the sqllogic dependency - 419: [WIP]: LastIncrementId and name resolution collision fix
- 418: Add ISNULL(expr) and compatible with Navicat Premium
When I use Navicat Premium 15.0.28 to design tables.
Get error:1105 - function: 'isnull, maybe you mean ifnull?' not found
After I implementISNULL(expr)
.
Get error:1105 - column "PARTITION_DESCRIPTION" could not be found in any table in scope
This RP is fixed this problem. - 417: set autocommit status flag
- 416: ReleaseSavepoint fix
- 415: Rollback and savepoint support
- 414: Vinai/show create table check constraints
This pr allow for the printing of check constraints during "SHOW CREATE TABLE" - 410: Commit transactions without beginning them, to support commit without a selected database
- 409: Add the partitions table
Adds the empty partition table - 404: sql/analyzer: Push filters under SubqueryAliases which they apply to.
This allows optimizations to be applied to the filter clauses within the SubqueryAlias node, including pushdown and index lookup. - 403: sql/analyzer: Split analysis of subqueries so that once-after rules are applied uniformly.
This makes is so that rules through default-rules get applied as part of
analyzing the subquery. Later, early in the once-after phase, union and
subquery queries get their own once-after processing.
This allows us to keep the node tree in a state where we can apply
transformations and optimizations across Opaque node boundaries, and come back
for further processing later. - 402: Added _binary support & REGEXP_LIKE
RemovedREGEXP_MATCHES
as it does not exist in MySQL. It is a PostgreSQL function.
5.7 documentation: https://dev.mysql.com/doc/search/?d=12&p=1&q=regexp_matches
8.0 documentation: https://dev.mysql.com/doc/search/?d=201&p=1&q=regexp_matches
PostgreSQL: https://www.postgresql.org/docs/9.6/functions-matching.html
It was added before we forked the project, so I removed it. Not sure why it was ever added to begin with, as no MySQL client will ever expect it nor know how to interact with the results. - 401: Vinai/json contains
Adds the json_contains function - 400: Support for transactions. Start transaction and Commit implemented so far, as well as @@autocommit handling
Needs tests, which are harder to write than standard engine tests - 398: Fixed view test script
- 397: Changed default handling & delay fk table resolution
- 396: /server: Unwrap
netutil.ConnWithTimeouts
in pollForClosedConnection - 395: Refactor nil check handling
nil responses won't throw check constraint errors anymore - 394: Fixed bug that only allowed some select statements as view definitions
- 392: Fixed REPLACE logic to now match MySQL
Previously we thought thatREPLACE
was aDELETE
thenINSERT
, but this is not the actual order that MySQL takes. According to the MySQL documentation (https://dev.mysql.com/doc/refman/8.0/en/replace.html):Special emphasis on the While, as you have to keep trying until you either get a different error, or it succeeds. So this has now been duplicated.MySQL uses the following algorithm for REPLACE (and LOAD DATA ... REPLACE): Try to insert the new row into the table While the insertion fails because a duplicate-key error occurs for a primary key or unique index: Delete from the table the conflicting row that has the duplicate key value Try again to insert the new row into the table
- 391: Install a format checker script and workflow
- 390: Adds the key column usage table
https://dev.mysql.com/doc/refman/8.0/en/information-schema-key-column-usage-table.html - 389: Vinai/constraints table
Created new pr due to past problems - 388: Added savepoint no-ops
- 387: server/handler.go: Use exposed mysql.Conn.Conn instead of trying to track it through Accept.
- 385: fix deadlock with exchange node go routines
This fixes deadlocks in exchange node go routines which can happen when the exchange node is closed before all the data is read as occurs with a query that includes a limit clause. This also can occur if the context is canceled. These deadlocked go routines will retain references and prevent garbage collection. - 382: Fixed system variables
- 381: Vinai/insert ignore into
This PR adds support for INSERT IGNORE INTO. It is currently missing functionality for ignoring typer errors. After scoping the type errors component of this it looked to a lot of work to capture all of them with the correct MySQL notation
I also noticed a lack of partition support in MySQL terms as well so I ignored that. - 380: add existing row to unique constraint failure
- 379: sql/plan/if_else.go: Handle conditional expressions evaluating to null as false.
- 378: sql/{plan,analyzer}: Add hash lookup functionality for equality condition joins on already cached results.
- 377: sql/plan/join.go: Improve correctness and tes...