infiniflow / infinity Goto Github PK

The AI-native database built for LLM applications, providing incredibly fast full-text and vector search

License: Apache License 2.0

CMake 0.46% C++ 89.08% C 0.49% Lex 0.13% Yacc 1.22% Python 8.37% Shell 0.14% Thrift 0.11%

ai-native llms nearest-neighbor-search rag retrieval-augmented-generation vector-search information-retrival operational-analytics bm25 embedding

infinity's Issues

v0.2.0

v0.1.0

index not checking if flushed caused checkpoint failed

What happens?

A short, clear and concise description of what the bug is.

To Reproduce

Steps to reproduce the behavior. Bonus points if those are only SQL queries.

Environment (please complete the following information):

OS: [ubuntu]
infinity Version: [e.g. 0.0.1]
infinity Client: [e.g. PG-Client]

Before Submitting

Have you tried this on the latest main branch?
Give your commit id cf0dcff
Have you tried the steps to reproduce? Do they include all relevant data and configuration? Does the issue you report still appear there?

This line does nothing

infinity/CMakeLists.txt

Line 5 in e3ee494

set(CMAKE_GENERATOR "Ninja")

You can not set the generator in CMake. It is a read only variable. The it is specified by the -G option to CMake and once picked can not be changed. This could be changed to a fatal error if the generator is not ninja.

[Subtask]: Encapsulate interface of index reader/writer

Parent Issue

#358
Posting codec is already available under repo, next step is to wrap them into index reader/writer

Add a .net API

Is there an existing issue for the same feature request?

I have checked the existing issues.

Is your feature request related to a problem?

My stack is all C# and Azure. I don't want to use any Python code or interop.

Describe the feature you'd like

A .net API please?

Describe implementation you've considered

I use Azure RAG now.

Documentation, adoption, use case

Massive c# community.

Additional information

No response

[Feature Request]: New full text index

Current full text index is based on iresearch library which is tightly bounded to document oriented data models and does not support real time indexing.
We need a new full text index implementation start from scratch such that it could work more smoothly with infinity with higher performance, real time indexing.

Exception occurred during concurrent operation

What happens?

Exception occurred during concurrent operation

To Reproduce

    SizeT thread_num = 16;
    SizeT total_times = 2 * 10 * 1000;

Environment (please complete the following information):

OS: [e.g. iOS]
infinity Version: [e.g. 0.0.1]
infinity Client: [e.g. PG-Client]

Before Submitting

Have you tried this on the latest main branch?
Give your commit id: 29abad80b592537b7bb71af8c5d297216b0003cb
Have you tried the steps to reproduce? Do they include all relevant data and configuration? Does the issue you report still appear there?

[Refactor]: Refactor catalog

Is there an existing issue for the same feature request?

I have checked the existing issues.

Is your feature request related to a problem?

In the current interface of the catalog module, many functions have multiple return values. However, now we do not use tuple or pair as the return value, but instead place the return value in the function parameters and obtain the output result by reference.

Describe the feature you'd like

Use tuple as return value of function.

Describe implementation you've considered

No response

Documentation, adoption, use case

No response

Additional information

No response

nano benchmark source code should be removed.

Nano benchmark source code need to be removed from the git history to reduce the whole repository size.

[Feature Request]: WAL physical log

Is there an existing issue for the same feature request?

I have checked the existing issues.

Is your feature request related to a problem?

Now the index creation is a logical log. The index file needs to be rebuilt when playing back the log, resulting in slow playback speed.

Describe the feature you'd like

Flushing created index immediately. Make sure it's a physical log. #435

Describe implementation you've considered

Writing the path information of the index file brushed to disk to the wal file

Documentation, adoption, use case

No response

Additional information

No response

[Feature Request]: Refactor `ColumnVector`

Is there an existing issue for the same feature request?

I have checked the existing issues.

Is your feature request related to a problem?

Unnecessary data copy from `ColumnBuffer` to `ColumnVector`

Describe the feature you'd like

Read from file to ColumnVector
Remove ColumnBuffer.

Add interface GetColumnVector in BlockColumnEntry, which load the column of entry from disk. The data lifetime of returned column vector is managed by buffer_manager.
Varchar type use FixHeapManager to allocate and read/load chunk. One chunk is mapped to one outline file on disk.

Describe implementation you've considered

No response

Documentation, adoption, use case

No response

Additional information

No response

[Bug]: Cannot start with docker on macOS

Is there an existing issue for the same bug?

I have checked the existing issues.

Branch name

main

Commit ID

docker image id 1f1ebe620523

Other environment information

Hardware: MacBook Pro, Intel Core i7
OS type: macOS Ventura 13.6.1
Others: Docker Desktop for macOS, Version 4.24.0 (122432)

Actual behavior

# librae @ mbpl in ~/work/repo/infinity on git:main o [21:18:48] 
$ docker images
REPOSITORY            TAG       IMAGE ID       CREATED        SIZE
infiniflow/infinity   latest    1f1ebe620523   5 days ago     122MB
nodered/node-red      latest    aad8a8d13b50   3 months ago   549MB

# librae @ mbpl in ~/work/repo/infinity on git:main o [21:23:13] 
$ docker run -d --name infinity -v /tmp/infinity/:/tmp/infinity --network=host infiniflow/infinity bash ./opt/bin/infinity 
 
eb9bf7949bab2474fca51e3852f0ad77d38f2e49bf6fedf5cdda97af0cee80db

# librae @ mbpl in ~/work/repo/infinity on git:main o [21:25:18] 
$ docker ps -a
CONTAINER ID   IMAGE                 COMMAND                  CREATED         STATUS                       PORTS     NAMES
eb9bf7949bab   infiniflow/infinity   "bash ./opt/bin/infi…"   8 seconds ago   Exited (126) 7 seconds ago             infinity

# librae @ mbpl in ~/work/repo/infinity on git:main o [21:25:25] 
$ docker logs infinity
./opt/bin/infinity: ./opt/bin/infinity: cannot execute binary file

Expected behavior

Expect the docker container to run successfully.

Steps to reproduce

docker run -d --name infinity -v /tmp/infinity/:/tmp/infinity --network=host infiniflow/infinity bash ./opt/bin/infinity



### Additional information

_No response_

[Feature Request]: Secondary index

Secondary index is used for numeric filtering. It is composed of two parts:

The data of each numeric column is stored in an inverted sorted form, with a compressed format.
Another in-memory part of index data which is based on pgm, it could provide very fast approximate range query with bounded error, which has already been added into the repository.

The mechanism of range filtering of secondary index is as follows:

Query the pgm index to get the bounded range.
Scan the raw index data according to bounded range to get the RowIDs of the query filter.

[Feature Request]: Incremental checkpoint

Is there an existing issue for the same feature request?

I have checked the existing issues.

Is your feature request related to a problem?

No response

Describe the feature you'd like

Describe implementation you've considered

No response

Documentation, adoption, use case

No response

Additional information

No response

[Feature Request]: Replace round-robin scheduler with better one.

Is there an existing issue for the same feature request?

I have checked the existing issues.

Is your feature request related to a problem?

Current strategy to schedule task is round-robin **all** tasks in a `PlanFragment`.
For the task that depends on other tasks, plain round-robin simply schedules on a random(next) cpu.
For example, assume a complete serialize fragment which length is 16 and with no parallel task.
Current strategy will schedule all the task on all different 16 cpu core.
The problem is:
1. Some core is allocated a unready task, which will check every time the cpu is runable.
2. The context switch cost is big.

Describe the feature you'd like

The scheduler can allocate the task that has dependency relation on the same cpu and remain their sequence.

Describe implementation you've considered

Schedule the task when it is runable.

Documentation, adoption, use case

No response

Additional information

No response

[Feature Request]: Support BOOL data type

Is there an existing issue for the same feature request?

I have checked the existing issues.

Is your feature request related to a problem?

No response

Describe the feature you'd like

BOOL type should be similar to the std::bitset.

Describe implementation you've considered

No response

Documentation, adoption, use case

No response

Additional information

No response

[Subtask]: Implement index merger

Parent Issue

#358

[Feature Request]: Segment compaction

Is there an existing issue for the same feature request?

I have checked the existing issues.

Is your feature request related to a problem?

1. Each import create a new segment and import data in new block in the new segment. The not filled block may waste disk space.
2. The compaction also remove delete row to save disk space.
3. The index is created in segment granulrity, small segment will degrade the performance of index.
4. The index rebuild is not solved in this issue

Describe the feature you'd like

A backend task scan the table in period, if segment can be merged then merge it.

The merge create new segment and datablock. Apply greedy algorithm to choose to-merge segments. (This is np problem)
If compacted segment is altered(only deleted here, because compacting segment is closed) in compact process, then do the alter log in new segment until no alter is made.
Mark old segment as deprecated and commit the new segment to replace old.
For frontend delete operation, when commit, check if the segment is deprecated. If so, abort.

Describe implementation you've considered

No response

Documentation, adoption, use case

No response

Additional information

No response

[Feature Request]: Add parallel construction of knn index.

What is the feature?

Allow construction of knn index (hnsw) in parallel.

How to make the feature.

Rewrite hnsw algorithm to support concurrent build
Refactor create statement binder to add a TableRef member to PhysicalCreateIndexOperator
Add multiple tasks for create index.

[Feature Request]: Unified error message and error code

Is there an existing issue for the same feature request?

I have checked the existing issues.

Is your feature request related to a problem?

No unified error message and error code before. For software error, maybe let infinity crash and provide back trace. For recoverable error, we need the error code and error message returned to client.

Describe the feature you'd like

A unified error code and error message to return to client.

Describe implementation you've considered

success
0000 ok
auth error
2001 passwd is wrong
2002 insufficient privilege
syntax error or access rule violation
3001 invalid username
3002 invalid password
3003 invalid db/schema name
3004 invalid table name
3005 invalid column name
3006 invalid index name
3007 invalid column definition
3008 invalid table definition
3009 invalid index definition
3010 data type mismatch
3011 name too long
3012 reserved name
3013 syntax error
3014 invalid parameter value
3015 duplicate user
3016 duplicate database
3017 duplicate table
3018 duplicate index name
3019 duplicate index
3020 no such user
3021 database not exist
3022 table not exist
3023 index not exist
3024 column not exist
3025 aggregate can't be in where clause
3026 column name in select list must appear in group by or aggregate function.
3027 no such system variable
3028 set invalid value to system variable
3029 system variable is read-only
txn error
4001 txn rollback
4002 txn conflict
insufficient resources or exceed limits
5001 disk_full
5002 out of memory
5003 too many connections
5004 configuration limit exceed
5005 query is too complex
operation intervention
6006 query_canceled
6007 not supported
system error
7001 io_error
7002 duplicated file
7003 config file error
7004 lock file exists
7005 catalog is corrupted
7006 data corrupted
7007 index corrupted
7008 file not found
7009 dir not found

Documentation, adoption, use case

No response

Additional information

No response

[Subtask]: Document scorer framework

Parent Issue

#358

Predicate condition does not work

What happens?

SELECT a , b FROM test_table_star where a =4

To Reproduce

Steps to reproduce the behavior. Bonus points if those are only SQL queries.

SELECT a , b FROM test_table_star where a =4;

Environment (please complete the following information):

OS: [e.g. ubuntu22.04]
infinity Version: [e.g. 0.0.1]
infinity Client: [PG-Client]

Before Submitting

Have you tried this on the latest main branch?
Give your commit id eaf2c88
Have you tried the steps to reproduce? Do they include all relevant data and configuration? Does the issue you report still appear there?

[Bug]: incorrect class forward declarations in module interface

Is there an existing issue for the same bug?

I have checked the existing issues.

Branch name

main

Commit ID

d022098

Other environment information

No response

Actual behavior

There are a lot of forward declarations of classes which are actually defined in another modules, this is incorrect.
For instance, here

infinity/src/planner/node/logical_fusion.cppm

Line 30 in d022098

class TableCollectionEntry;

class TableCollectionEntry is declared in the module logical_fusion, and it contradicts to the fact that it's actually defined in the module table_collection_entry

infinity/src/storage/meta/entry/table_collection_entry.cppm

Line 44 in cd70397

export struct TableCollectionEntry : public BaseEntry {

This is very bad situation (IFNDR): https://eel.is/c++draft/basic.link#10.

Expected behavior

No response

Steps to reproduce

...

Additional information

No response

[Subtask]: Implement analyzer for English and CJK language

Parent Issue

#358

Minmax of column data.

Infinity need the min max column value information of each column in the segment/block. With this information and condition expression, infinity may filter out some data segments/blocks before table scan.

Currently, I suppose these information will co-located with the information of segment / block which is in catalog.

[Feature Request]: Support `Limit`

Feature Request
Supports SQL Limit clauses

e.g. select * from t1 limit 3 offset 1

import csv crash If there are more commas in the last column of data

What happens?

COPY NATION FROM 'test/sql/copy/nation.csv' WITH ( DELIMITER ',' );
crash

To Reproduce

Steps to reproduce the behavior. Bonus points if those are only SQL queries.

CREATE TABLE NATION (N_NATIONKEY INT, N_REGIONKEY INT );
COPY NATION FROM 'test/sql/copy/nation.csv' WITH ( DELIMITER ',' );

nation.csv
1,2,
3,4,

Environment (please complete the following information):

OS: ubuntu22.04
infinity Version: [e.g. 0.0.1]
infinity Client: sqllogictest-rs

Before Submitting

Have you tried this on the latest main branch?
Give your commit id eaf2c88
Have you tried the steps to reproduce? Do they include all relevant data and configuration? Does the issue you report still appear there?

Re-run function test without clean up the data directory will trigger program crash

What happens?

Restart server after running function.

To Reproduce

clean up the data directory.
start up infinity server, run function test and shutdown server.
restart infinity.

error message:
"terminate called after throwing an instance of 'infinity::StorageException@infinity_exception'
what(): Storage Error: index_def_meta should have at least one entry @src/storage/meta/entry/table_collection_entry.cpp:410"

Environment (please complete the following information):

OS: Ubuntu 22.04
infinity Version: 0.1.0-main
infinity Client: pg_client

Before Submitting

Have you tried this on the latest main branch?
Give your commit id: 7a0ef11
Have you tried the steps to reproduce? Do they include all relevant data and configuration? Does the issue you report still appear there?

[Feature Request]: Refactor executor: Supports task suspend and resume.

Is there an existing issue for the same feature request?

I have checked the existing issues.

Is your feature request related to a problem?

Current task is synchronous. IO operation blocks the task.

Describe the feature you'd like

Refactor the task to allow suspend and resume when IO happens.

Describe implementation you've considered

TODO

Documentation, adoption, use case

No response

Additional information

No response

Concurrent creation of Table may cause blocking

What happens?

Blocking occurs when multiple threads create Database

To Reproduce

Environment (please complete the following information):

OS: Ubuntu
infinity Version: 0.1.0-main
infinity Client: Local

Before Submitting

Have you tried this on the latest main branch?
Give your commit id: e0f3b8209ae8cc3ba3483f2f1401195098f74098
Have you tried the steps to reproduce? Do they include all relevant data and configuration? Does the issue you report still appear there?

Executor Error Occurred

OS: Ubuntu
Statements:

CREATE TABLE mytable (
   id INTEGER PRIMARY KEY,
   name VARCHAR(50),
   age INTEGER
 );
 INSERT INTO mytable (id, name, age) VALUES (1, 'John', 30);
 INSERT INTO mytable (id, name, age) VALUES (2, 'Jane', 25);
SELECT * FROM mytable;

Error Message:

Executor Error: Not value expression. @src/executor/operator/physical_insert.cpp:25

[Bug]: Import data is missing

Is there an existing issue for the same bug?

I have checked the existing issues.

Branch name

main

Commit ID

d11ebe5

Other environment information

kould-21j0                  
    description: Computer
    width: 64 bits
    capabilities: smp vsyscall32
  *-core
       description: Motherboard
       physical id: 0
     *-memory
          description: System memory
          physical id: 0
          size: 28GiB
     *-cpu
          product: AMD Ryzen 7 7735H with Radeon Graphics
          vendor: Advanced Micro Devices [AMD]
          physical id: 1
          bus info: cpu@0
          version: 25.68.1
          size: 2311MHz
          capacity: 4828MHz
          width: 64 bits

Distributor ID:	Ubuntu
Description:	Ubuntu 23.04
Release:	23.04
Codename:	lunar

Actual behavior

Import 9000 pieces of data, but in fact there are only 808 pieces, and it can be reproduced repeatedly

Expected behavior

After importing 9000 pieces of data, select * from table can display 9000 pieces of data.

Steps to reproduce

kould=> CREATE TABLE test_limit (c1 int, c2 int);
 OK 
----
(0 rows)

kould=> COPY test_limit FROM '/home/kould/CLionProjects/infinity-k/test/data/csv/test_limit.csv' WITH ( DELIMITER ',' );
IMPORT 9000 Rows
kould=> select * from test_limit;



Tips: Use the csv file attached below

Additional information

test_limit.csv

Expression evaluation result does not match

What happens?

SELECT a + 1, b FROM test_table_star

but the data in the table is

To Reproduce

Steps to reproduce the behavior. Bonus points if those are only SQL queries.
SELECT a + 1, b FROM test_table_star

Environment (please complete the following information):

OS: [Ubuntu22.04]
infinity Version: [e.g. 0.0.1]
infinity Client: [PG-Client]

Before Submitting

Have you tried this on the latest main branch?
Give your commit ideaf2c88
Have you tried the steps to reproduce? Do they include all relevant data and configuration? Does the issue you report still appear there?

An exception occurred while Insert string into column whose `DataType` is `Varchar`

What happens?

I created a table with a Varchar field and inserted a string into the corresponding field before an exception occurred

Tips: src/function/cast/varchar_cast.h:47

To Reproduce

create table t7 (a int primary key, z varchar(298) unique null);

insert into t7 (a, z) values (1, 'k');

Environment (please complete the following information):

Ubuntu 12.3.0-1ubuntu1~23.04
infinity Version: 0.0.1
infinity Client: PG-Client

Before Submitting

Have you tried this on the latest main branch?
Give your commit id 7e69246
Have you tried the steps to reproduce? Do they include all relevant data and configuration? Does the issue you report still appear there?

[Subtask]: Document iterator framework

Parent Issue

#358

Feature request 20230322

CREATE TABLE mytable (
   id INTEGER PRIMARY KEY,
   name VARCHAR(50),
   age INTEGER
 );
 INSERT INTO mytable (id, name, age) VALUES (1, 'John', 30);
 INSERT INTO mytable (id, name, age) VALUES (2, 'Jane', 25);

system fails when sql syntax error.

What happens?

The system fails when sql syntax error.

To Reproduce

show * from t1 (where t1 is a table name)
or click tab on keyboard when has a wrong syntax.

Environment (please complete the following information):

OS: ubuntu22.04 [e.g. iOS]
infinity Version: 0.0.1 [e.g. 0.0.1]
infinity Client: PG-Client [e.g. PG-Client]

Before Submitting

Have you tried this on the latest main branch?
Give your commit id
Have you tried the steps to reproduce? Do they include all relevant data and configuration? Does the issue you report still appear there?

[Bug]: After function test finished, don't clean data directory and restart infinity will trigger crash

Is there an existing issue for the same bug?

I have checked the existing issues.

Branch name

main

Commit ID

c5d004a

Other environment information

No response

Actual behavior

After this commit:

commit c5d004a
Author: shen yushi [email protected]
Date: Fri Dec 22 16:30:19 2023 +0800

Try to fix CI bug. Add more log. (#351)

* Fix bug: add lock in `BufferObj` when close file. Add extra log for ci debug.

* Remove lock and add log.

When I run slt test from scratch, everything is OK. Then I shutdown the server and restart again. Following crash information is given:

[23:51:37.194] [120875] [info] Load base catalog1 from: /tmp/infinity/data/catalog/META_550.delta.json
[23:51:37.196] [120875] [info] Load delta catalog1 from: /tmp/infinity/data/catalog/META_1072.delta.json
[23:51:37.197] [120875] [info] Load delta catalog1 from: /tmp/infinity/data/catalog/META_1108.delta.json
terminate called after throwing an instance of 'infinity::StorageException@infinity_exception'
  what():  Storage Error: SegmentEntry::MergeFrom requires min_row_ts_ match @src/storage/meta/entry/segment_entry.cpp:46

Expected behavior

No response

Steps to reproduce

1. Clean data directory.
2. Start infinity server.
3. Run slt test.
4. After all cases passed, shutdown the server.
5. Start infinity server again, which will trigger the fault.

Additional information

No response

[Feature Request]: Support DATE data type

Is there an existing issue for the same feature request?

I have checked the existing issues.

Is your feature request related to a problem?

DATE data type is not functioning

Describe the feature you'd like

Support DATE data type

Describe implementation you've considered

No response

Documentation, adoption, use case

No response

Additional information

No response

`free(): invalid size` will occur after Insert reaches 20000 times

What happens?

To Reproduce

    SizeT thread_num = 1;
    SizeT total_times = 2 * 10 * 1000;

Environment (please complete the following information):

OS: [e.g. iOS]
infinity Version: [e.g. 0.0.1]
infinity Client: [e.g. PG-Client]

Before Submitting

Have you tried this on the latest main branch?
Give your commit id: 29abad80b592537b7bb71af8c5d297216b0003cb
Have you tried the steps to reproduce? Do they include all relevant data and configuration? Does the issue you report still appear there?

[Bug]: The doc, build from source, has multiple errors and issues. Follow the doc and go through it. I can not build

Is there an existing issue for the same bug?

I have checked the existing issues.

Branch name

main

Commit ID

47a1e7e

Other environment information

Distributor ID:	Ubuntu
Description:	Ubuntu 22.04.3 LTS
Release:	22.04
Codename:	jammy

Actual behavior

https://github.com/infiniflow/infinity/blob/main/docs/build_from_source.md

Once I have git, I can use git clone, so I don't need to install git again.

sudo only works for echo
wget -O - https://apt.llvm.org/llvm-snapshot.gpg.key | sudo gpg --dearmor -o /usr/share/keyrings/llvm-archive-keyring.gpg
echo "deb [signed-by=/usr/share/keyrings/llvm-archive-keyring.gpg] https://apt.llvm.org/jammy/ llvm-toolchain-jammy-17 main" | sudo tee /etc/apt/sources.list.d/llvm17.list
sudo apt update
sudo apt install clang-17 clang-tools-17

Installing clang-17 but using clang-18
There are dependencies on lz4 and boost, but they are not installed.

Expected behavior

No response

Steps to reproduce

Build from source on Ubuntu 22.04

Additional information

No response

[Feature Request]: Support `Order By`

Feature Request

Supports SQL OrderBy clauses

e.g. select * from t1 order by c1

[Feature Request]: Support order by + limit as top operation

Is there an existing issue for the same feature request?

I have checked the existing issues.

Is your feature request related to a problem?

No response

Describe the feature you'd like

treat order by + limit as top operation

Describe implementation you've considered

No response

Documentation, adoption, use case

No response

Additional information

No response

[Feature Request]: Supports aggregate operation.

What is the feature?
Supports aggregate operation.

How to make the feature.

Finish basic simple aggregate
Support multi tasks executing (more block)
Support 'COUNT(*)'

[Subtask]: Implement in memory index for real-time/near real time indexing

Parent Issue

#358

Detail of Subtask

In memory index is based on a lock-free btree for both dictionary and posting. When dumped to disk, it is compressed according to posting format

Describe implementation you've considered

No response

Column whose `DataType` is `Varchar`, default `dimension` = 0

What happens?

default dimension of VarcharInfo should not be 0

src/planner/logical_planner.cpp LogicalPlanner::BuildInsertValue

To Reproduce

create table t3 (a int primary key, z varchar unique null);

insert into t3 (a, z) values (1, 'k');

Environment (please complete the following information):

Ubuntu 12.3.0-1ubuntu1~23.04
infinity Version: 0.0.1
infinity Client: PG-Client

Before Submitting

Have you tried this on the latest main branch?
Give your commit id 7e69246
Have you tried the steps to reproduce? Do they include all relevant data and configuration? Does the issue you report still appear there?

[Subtask]: Implement dictionary required by full text index

Parent Issue

#358

iresearch has adopted openfst as the dictionary implementation which is similar with lucene. This design is over complicated but the performance is not that good, we need better fst implementation for new full text index.

table star expressions

What happens?

SELECT test_table_star.* FROM test;

To Reproduce

Steps to reproduce the behavior. Bonus points if those are only SQL queries.

CREATE TABLE test_table_star(a INTEGER, b INTEGER, c INTEGER);
COPY test_table_star FROM 'test/data/csv/integer.csv' WITH ( DELIMITER ',' );
SELECT test_table_star.* FROM test;

Environment (please complete the following information):

OS: ubuntu22.04
infinity Version: [e.g. 0.0.1]
infinity Client: sqllogictest-rs

Before Submitting

Have you tried this on the latest main branch?
Give your commit id eaf2c88
Have you tried the steps to reproduce? Do they include all relevant data and configuration? Does the issue you report still appear there?

[Bug]: Multiple threads query benchmark crash and single thread query benchmark performance downgrade.

Is there an existing issue for the same bug?

I have checked the existing issues.

Branch name

main

Commit ID

e7b1bdc

Other environment information

i5-12500, 16c, 16GB, Ubuntu 22.04

Actual behavior

As title, system crash when use 16 thread to test the query_benchmark. And use 1 thread to test query_benchmark will cost about 3s, which cost 2.2~2.3s before.

Expected behavior

No crash and no performance downgrade.

Steps to reproduce

1. Checkout d4af653975c9ce4642142d9276f3904a07ade8ac (before Add new scheduler #395)
Single thread performance OK and no crash on multiple thread query benchmark. 

2. Checkout ada746cfa22f37ead2edcb8dfe857a3371951736 (after Add new scheduler #395)
Single thread performance OK, but crashed on multiple thread query benchmark.

3. Checkout 0d199792e228e904bb5deacf1fa8edc577a0ca74 (after Add lock when set fragment task status. #401)
Single thread performance downgrade and crash on multiple thread query benchmark.

Additional information

No response