UPGRAGDE NOTICE If your CubeFS version is v2.3.x or before, please refer to the UPGRADE NOTICE in v2.4.0 for upgrading steps. And also please make sure that your fuse client or objectnode version is equal to or older than the servers, i.e. master, metanode and datanode. In another word, newer versioned client can not be used in a cluster with older versioned servers.
If your Blobstore version is v1.1.0 or before which built with cubefs-blobstore (https://github.com/cubefs/cubefs-blobstore) , please refer to UPGRADE to v3.2.0 following these steps #1556.
meta\client
: add quota of children dentries for directory (#1763, @wuchunhuan )master
: add qps limiter for individual API on master (#1766, @wuchunhuan )client
: add audit log for fuse client (#1764, @wuchunhuan )objectnode\sdk
: objectnode supports docking erasure-code subsytem(blobstore) (#1765, @wuchunhuan)master\datanode
: Add speed control of dp decommission for disk or datanode (#1778, @bboyCH4)
master
: master support leader switching by external trigger (#1772, @leonrayang )metanode\master
: metanode support follower read (#1775, @leonrayang)master
: Add metrics for meta-partition consistency detection (#1776, @leonrayang)master
: add API statistics log for master (#1767, @wuchunhuan)datanode
: Datanode startup should not be influnced by missing data partitions (#1773, @leonrayang)master
: volume's capacity must be bigger than 120% used size (#1723, @Victor1319)metanode
: metanode support config memRatio of the machine. (#1725, @Victor1319)datanode
: datanode support config diskPathPrefix to manage disk path list. (#1727, @Victor1319)metanode
: reduce extent alloc memory when metanode snapshot (#1728, @Victor1319)metanode
: random write may cause metanode memory grow (#1729, @Victor1319)datanode
: bad disk space is calculated in datanode's total space (#1601, @Victor1319)datanode
: no premission disk can't be detected (#1602, @Victor1319)master\cli
: add display of unavailable replica for cfs-cli datapartition check (#1771 ,@true1064)datanode
: datanode should compute datapartition used size right after loading extents from disk (#1782 ,@true1064)client
:enhance support nfs access operation (#1798, @leonrayang )master
: add some metircs (#1785, @liubingxing @litao)master\datanode\client\cli
: fix the typos and format code (#1787, @liubingxing @litao)datanode
: add lack datapartitions monitor (#1789, @guojunhao)master
: create data partition exclude decommissioned disk (#1790 , @guojunhao)datanode
: Optimize the qos for random write (#1791, @litao)metanode
: speed up metanode startup (#1792, @liubingxing)cli
: add more info to cli/fsck cmd (#1793, @liubingxing @litao)
master
: master snapshot recover not reset local rocksdb info (#1522, @wuchunhuan )master
:Memory cost too fast during restart in case of data partition‘s count is magnity (#1774 , @leonrayang)client
: Readonly dp can still accept write request from client (#1779, @bboyCH4)metanode
: Metanode should not establish connection to blobstore for cold volume (#1781, @bboyCH4)client
: blockcache service may be oom when the client caches many large files concurrently (#1783, @zhangtianjiong)datanode
: too may delete requests may cause a lot of tcp connections. (#1724, @Victor1319)master
: when master creating data partition, should use vol.dataPartitionSize as datapartition.total (#1777 ,@true1064)master
: In addition to all replica status is readwrtie, still need to consider volume used space before check set datapartition status as readwrite (#1780 ,@true1064)master
: master auto creates dataparitions when volume goes from full to not full (#1784 ,@true1064)unit test
: Fix the pipeline cubefs-ci error problem (#1795, @baijiaruo)metanode\client
:do rename on the file which is soft link not worked well (#1797 , @leonrayang)meta\master
: fix some bug to improve cluster stability (#1786, @liubingxing @litao)
If your CubeFS version is v2.3.x or before, please refer to the UPGRADE NOTICE in v2.4.0 for upgrading steps. And also please make sure that your fuse client or objectnode version is equal to or older than the servers, i.e. master, metanode and datanode. In another word, newer versioned client can not be used in a cluster with older versioned servers.
If your Blobstore version is v1.1.0 or before which built with cubefs-blobstore (https://github.com/cubefs/cubefs-blobstore) , please refer to UPGRADE to v3.2.0 following these steps #1556.
- Merge data subsystem blobstore(ec) to main branch.
blobstore
: Merge some service modules #1563
blobstore
: ClusterMgr support simple key-value persistent storage #1566blobstore
: Remove blobstore's dependency on monogdb #1493blobstore
: Remove blobstore's dependency on consul #1507blobstore
: Support audit log filtering #1506
blobstore
: Blobnode use the same erasure encoder and buff-pool libraries as Access #1498blobstore
: Separate flow control for repair and migration traffic #1508blobstore
: Retry http body under rpc request #1567
If your CubeFS version is v2.3.x or before, please refer to the UPGRADE NOTICE in v2.4.0 for upgrading steps. And also please make sure that your fuse client or objectnode version is equal to or older than the servers, i.e. master, metanode and datanode. In another word, newer versioned client can not be used in a cluster with older versioned servers.
master
: add leaderSize param in create partition req #1456
master
: support decommission broken replica for 2-replica no leader dp #1456datanode
: in case disk full, repair size is also thought as used size #1456raft
: raftstore.monitor can be optimized #1543
If your CubeFS version is v2.3.x or before, please refer to the UPGRADE NOTICE in v2.4.0 for upgrading steps. And also please make sure that your fuse client or objectnode version is equal to or older than the servers, i.e. master, metanode and datanode. In another word, newer versioned client can not be used in a cluster with older versioned servers.
metanode
: Show metapartition id correctly #1554metanode
: remove redundant memcopy when reading raft snapshot #1552
master
: update metanode or datanode id need drop old one before #1555metanode
: add metric mpDentryCount #1553datanode
: fallocate return interrupt err try again #1551
If your CubeFS version is v2.3.x or before, please refer to the UPGRADE NOTICE in v2.4.0 for upgrading steps. And also please make sure that your fuse client or objectnode version is equal to or older than the servers, i.e. master, metanode and datanode. In another word, newer versioned client can not be used in a cluster with older versioned servers.
master\client\datanode
: :Provide QoS services to enhance multi-tenant isolation #1447client
: Caching acceleration #1446master\datanode
: Support two replicas data storage #1389master
: Add vol label in cfs_metanode_mpInodeCount #1482client
: Support config posixAcl for volmue #1485master
: Add datapartition count limit for datanode. #1481
datanode
: TinyDeleteRecord file will be very big when one replica is down #1433client
: EnablePosixACL is not vaild in centos system #1393blockcache
:Blockcache memory overflow and be killed #1422yum source
: Yum install script fails due to missing packages #1410unit test
: S3test random path busy to produce data and timeout #1515master
: Zone name check not strict enough while crossZone enabled #1479datanode\master
: Avoid writing operation failed because of disk full #1519
master
: Enable volume update from 3 to 2 replics #1516
UPGRAGDE NOTICE If your CubeFS version is v2.3.x or before, please refer to the UPGRADE NOTICE in v2.4.0 for upgrading steps. And also please make sure that your fuse client or objectnode version is equal to or older than the servers, i.e. master, metanode and datanode. In another word, newer versioned client can not be used in a cluster with older versioned servers.
master
: support to delete data replica with force #1258
master
: revert the deletion process in the tiny extent to avoid deletion failure caused by different logical and physical sizes #1439master
: add metapartition replica check to fix replicas status still writeable while metanode unavalible #1440
If your CubeFS version is v2.3.x or before, please refer to the UPGRADE NOTICE in v2.4.0 for upgrading steps. And also please make sure that your fuse client or objectnode version is equal to or older than the servers, i.e. master, metanode and datanode. In another word, newer versioned client can not be used in a cluster with older versioned servers.
This is a significant release based on master branch with a lot of code mainly to support multi layer data layer including blobstore (Erasure-Code) storage, blobstore supports erasure coding storage method(https://github.com/cubeFS/cubefs-blobstore) to support low frequency volume and reduce cost .
Before this we released v3.0.0-beta based on release v2.4.0 (https://github.com/cubeFS/cubefs/releases/tag/v3.0.0-beta) , which make volume can be create as standard(3 replica) or low frequency access type(Erasure-Code), but not support multi data layer.
Release formal version release-v3.0.0 which based on the newest master commit, below is a list of the main features.
- BlobStore subsystem which supports erasure coding storage. #212
- multicache layer, 1st level:client local read cache; 2nd level:hot pool replica; 3nd level:code pool based on erasure code. #1291
- single data replica supported. releated project: https://github.com/oppo-bigdata/shuttle used for remote shuffle. #1227
- client hot online restart(experimental).
- optimization reduce extents fragmentation. #1318
- manage third part library with go mod.
Please refer to the documentation for details,the content already updated.
- English version: https://cubefs.io/docs/master/overview/introduction.html
- Chinese version: https://cubefs.io/zh/docs/master/overview/introduction.html
log
: support write ebs log to file and update ebs log level #212client\datanode\master
: support one replica of data storage #1227master
: support get master's raft status info #1227master
: support preload data from blobstore to cubefs #212client
: client local read cache #212client
: Time-consuming statistics of client sdk interfaceclient
: client support blobstore #212client
: support ebs index #212master
: support different type dataparitition(normal dp,cache dp,preload dp) #212master
: support mulit volume type(replcias volume,ec volume) #212project
: enable go mod and update vendor #1343datanode
: intruduce metrics degrade level to datanode #1438master
: not do ttl & updateSize for hot vol #212compile
: change go mod version from 1.14 to 1.16 and consistent with compile #1343compile
: enable go mod and update vendor #1343project
: change from chubaofs to cubefs #1343
master
: single replica create volume should set followeread with true #1227raft
: raft snapshot still running while peer remove happened which lead to the snapshot gorutine cann't end. #1227master
: Refactor finding partitins to be offlined #1189master
: Avoid race syncing node's info when setting rdonly #1189client
: follow recover handlers when closing open handler #1316master
: remove 5 GB left limit for disk to create datapartition #1189server
: if server recive kill-signal before being running-state, server will can't be killed graceful #1261
master
: not do ttl & updateSize for hot vol #212master
: support disk, datanode, metanode decommissionclient
: add cli-command for cold volume #212master
: ec-volume can't be deleted if size not equal 0 #212client
: add cfs-cli request parameters #212client
: add scan cli #212master
: add volume type to newVolCreateCmd #212master
: not support force delete cold volume #212datanode
: add cfg diskRdonlySpace for datanode #1353datanode
: use os.ReadDir instead of ioutil.ReadDir #1438datanode
: improve performance of write. data allocated didn't put back to buffer pool #1438datanode
: reduce datanode cpu usage #1438datanode
: remove unnecessary json unmarshal when getting local extents #1438datanode
: reduce datanode cpu usage by eliminate file seeks #1438client
: mitigate the pain of extents fragmentation #1318
If your ChubaoFS version is v2.3.x or before, please refer to the UPGRADE NOTICE in v2.4.0 for upgrading steps. And also please make sure that your fuse client or objectnode version is equal to or older than the servers, i.e. master, metanode and datanode. In another word, newer versioned client can not be used in a cluster with older versioned servers.
- this release mainly used to optimize datanode cpu usage
datanode
feat: intruduce metrics degrade level to datanode [#1234]
datanode
remove 5 GB left limit for disk to create datapartition[#1345]fuse client
follow recover handlers when closing open handler[#1344]
datanode
improve performance of write. data allocated didn't put back to buffer pool [#1346]datanode
remove unnecessary json unmarshal when getting local extents [#1346]
If your CubeFS version is v2.3.x or before, please refer to the UPGRADE NOTICE in v2.4.0 for upgrading steps. And also please make sure that your fuse client or objectnode version is equal to or older than the servers, i.e. master, metanode and datanode. In another word, newer versioned client can not be used in a cluster with older versioned servers.
fuse client
:support auto push data to push gatewaymaster
:reduce flow from interface of "client/partiton" that follower support it and will not redirect to leaderraft
:the receive buffer channel size of raft support to be configurablemetanode
:implement the content summarymaster
:domain for cross zone
fuse client
:fix: currAddr of stream conn is emptyfuse client
:fix: update extent cache no matter whether eh is dirty or notmetanode
:when kill metanode(mean while mp stoped firstly) and if apply snapshot happen at the same time, snapshot will be block, causeing metanode can't be killedmetanode
:meta node migration be recorded in badmetapartitions concurrently without lock which lead to mp id missmetanode
:return file nodes for statfsmetanode
:meta not use warn when consulMeta not setmetanode
:makes AppendExtentKeyWithCheck request idempotentfuse client
:push addr shadowed if export port is not setobjectnode
:readdir gets insufficient dentries if prefix larger than markerraft
:remove redundant memcopy when reading raft snapshotraft
start tcp listen before starting raftltp test
:broken test case ftest01,update and unlock ltp test casesltp test
:fix ltptest ci bugdocker
:finstall killall command in the docker imagecli
:fix cli tool typo
master
: support disk, datanode, metanode decommission and assign target nodemaster
:return response cache directly, not copy againmaster
:add vol usedRatio warn log when usage > 90%master
:add master metrics vol_meta_count to export vol dp/mp/inode/dentrymaster
:del replica with force if decommission hangmetanode
:add log for loading meta partitionsfuse client
:increase client retry times to avoid mp raft election timeout.grafna
: change grafana volume used size rate from rate to derivgrafna
:support push monitor data to gatewayobjectnode
: add subdir authorized check for objectnodeobjectnode
: tracking objectnode modification for cfs-serverraft
:check nil when get leader term inforaft
: check size when read data from heartbeat portraft
: add getRaftStatus for metanodecli
:update gitlab-cicli
:Enhancement: add ci check for ltptest resultcli
:enhance: upload docker_data when ci tests finishcli
:Create and Update ci.ymlstyle
:rename cfs to cbfs & optimize the libsdk moduledocker
:change: use ghcr instead of dockerhubbuild
: add version information and rules for building fsckbuild
:change: update build status badge
1. Purpose In the cross zone scenario, the reliability need to be improved. Compared with the 2.5 version before, the number of copysets in probability can be reduced. The key point is to use fault domains to group nodesets between multiple zones.
-
Reliable papers https://www.usenix.org/conference/atc13/technical-sessions/presentation/cidon
-
Chinese can refer to https://zhuanlan.zhihu.com/p/28417779
2. configuration
1) Master
-
Config file:master.json Enable faultDomain set item "faultDomain": true
-
Zone count to build domain faultDomainGrpBatchCnt,default count:3,can also set 2 or 1
If zone is unavaliable caused by network partition interruption,create nodeset group according to usable zone Set “faultDomainBuildAsPossible” true, default is false
The distribution of nodesets under the number of different faultDomainGrpBatchCnt 3 zone(1 nodeset per zone) 2 zone(2 nodesets zone,1 nodeset zone,Take the size of the space as the weight, and build 2 nodeset with the larger space remaining) 1 zone(3 nodeset in 1 zone)
-
Ratio 1) The use space threshold of the non-fault domain(origin zone) After the upgrade, the zone used by the previous volume can be expanded, or operated and maintained in the previous way, or the space of the fault domain can be used, but the original space usage ratio needs to reach a threshold, that is, the current configuration item The proportion of the overall space used by meta or data Default:0.90 UpdateInterface: AdminUpdateZoneExcludeRatio = "/admin/updateZoneExcludeRatio"
2) the use space threshold of the nodeset group in the domain Nodeset group will not be used in dp or mp allocation default:0.75 Update interface: AdminUpdateDataUseRatio = "/admin/updateDomainDataRatio"
2) Datanode && metanode
After the fault domain is enabled, a minimum configuration of the fault domain is constructed under the default configuration: Each zone contains 1 datanode and 1 metanode, and the zone name needs to be specified in the configuration file There are 3 datanodes and 3 metanodes in 3 zones
For example, three datanodes (metanode) are configured separately: "zoneName": "z1", "zoneName": "z2", "zoneName": "z3", Start after configuration, the master will build a nodeset for z1, z2, and z3, and component a nodesetgrp
3. Note 1) After the fault domain is enabled, all devices in the new zone will join the fault domain 2) The created volume will preferentially select the resources of the original zone 3) Need add configuration items to use domain resources when creating a new volume according to the table below. By default, the original zone resources are used first if it’s avaliable
| Cluster:faultDomain | Vol:crossZone | Vol:defaultPriority | Rules for volume to use domain | h| ------ | ------ | ------ |------ | | N | N/A | N/A | Do not support domain | | Y | N | N/A | Write origin resources first before fault domain until origin reach threshold | | Y | Y | N | Write fault domain only | | Y | Y | Y | Write origin resources first before fault domain until origin reach threshold |
Note: the fault domain is designed for cross zone by default. The fault domain of a single zone is considered as a special case of cross zone, and the options are consistent
example : curl "http://10.177.200.119:17010/admin/createVol?name=vol_cross5&capacity=1000&owner=cfs&crossZone=true&defaultPriority=true"|jq .
1. Purpose In order to query the content summary information of a directory efficiently, e.g. total file size, total files and total directories, v2.5 stores such information as the parent directory’s xattr.
The parent directory stores the files, directories and total file size of the current directory. Then only need to make recursive of the sub directories, and accumulate the information stored by the directories to query the content summary information of a directory.
2. Configuration Client config file: fuse.json 1) Enable XAttr ”enableXattr”:”true” 2) Enable Summary ”enableSummary”:”true” Both of xattr and summay have to be set if you want to mount a volume to the local disk. Set summary is enough if you want to access the volume via libsdk.so.
3. How to use There are two different ways to get the content summary of a directory. 1) Fuse mount getfattr -n DirStat yourDirPath getfattr can be installed by: yum install attr or apt install attr 2) libsdk.so cfs_getsummary (libsdk/libsdk.go)
4. Note 1)The incremental files’ summary information will be held by their parent directories. But the old files will not. Use cfs_refreshsummary (libsdk/libsdk.go) interface to rebuild the content summary information. 2)The files, directories and total file size are updated asynchronously in the background. Users are not aware of these operations, but it does increase the requests to meta servers (usually doubled). You are recommended to evaluate the impact to your cluster before using this feature.
If your CubeFS version is v2.3.x or before, please refer to the UPGRADE NOTICE in v2.4.0 for upgrading steps. And also please make sure that your fuse client or objectnode version is equal to or older than the servers, i.e. master, metanode and datanode. In another word, newer versioned client can not be used in a cluster with older versioned servers.
meta&object
introduce ReadDirLimit interface to retrieve partial results #1234fuse client
use ReadDirLimit in fuse client #1244
sdk
makes AppendExtentKeyWithCheck request idempotent #1224meta
start tcp listen before starting raft 1256raft
remove redundant memcopy when reading raft snapshot to avoid snapshot hanging 1264object
handling range read request in a behavior compatible with S3 #1286 #1298
sdk
add version in the http requests issued to master to collect client info 1262sdk
mitigate the pain of extents fragmentation 1282
If your CubeFS version is v2.3.x or before, and need to upgrade to v2.4.x , please follow the following upgrade steps:
-
first of all, firewall open two more port (17710 & 17810) in your machine to support tcp multiplexing;
17710 is calculated by 17210, the port listened by metanode, add 500 17810 is calculated by 17310, the port listened by datanode, add 500
-
then add item '"enableSmuxConnPool"=false' in your datanode's config file, upgrade all datanodes; detail steps as follow:
a. using new dataNode bin file replace the old version b. add '"enableSmuxConnPool"=false' in your datanode's config file c. restart datanode process d. update all the datanodes in the custer
-
update '"enableSmuxConnPool"=true' in your datanode's config file, restart all datanode again;
notes: before set enableSmuxConnPool as true, you must finsh step 2.
-
upgrade all metanode bin file and restart meta process;
-
upgrade all master bin file and restart master process;
-
upgrade all client bin file and restart client process;
The key point of upgrading from previous versions to v2.4.0 is to make sure ALL datanodes are upgraded to the new version with "enableSmuxConnPool" set to "false" before upgrading other components, such as metanode, master and client. Then datanode can be configured to enable smux conn pool, i.e. "enableSmuxConnPool" set to "true".
dataNode
smux support #1124metaNode
smux support #1124datanode
metanode
support TickInterval configurable #1117
master
remove token-based volume read-write permission control #1119
datanode
fix build repair task panic #1124metanode
too many open/close stream when delete limit #1124master
do split on metanode partion if inode count larger than maxinodecnt #1124datanode
dp.raftPartition nil pointer #1124datanode
fix datanode repair log print error #1124master
partition be split can be write if not full and less then on step #1124metanode
new a gauge instance every time to avoid a gauge instance modified concurrently #1124master
fix conflict for histogram data when report data concurrently #1124datanode
fix bug when packet opCode=OpExtentRepairRead and network is unstable #1124
client
export init should be after init log in fuse client #1124util
flush log before os.exit #1124datanode
format datanode log when delete #1124metanode
update log level to info if mp is not exist #1124
Note that when upgrading from versions prior to v2.3.0
to version v2.3.0
or beyond, servers(i.e. resource manager, metanode and datanode) must be upgraded before client(i.e. fuse client and objectnode) due to #1098.
client
: support multiple subdir permissions for an individual user. #1051sdk
: introducing libsdk so applications can embed the use of cubefs to a single binary and no additional process is required at runtime. #1082
datanode
: improve the shut down process of datanode so the raft apply id is persisted. #1030raft
: auto fix crc-mismatch raft log crc. #1039- improve stability of the whole system. #1098
objectnode
: fix key encoding issue of S3 ListObjects API. #953datanode
: fix data partition decommission timeout. #972metanode
: change mtime of the directory when creating or deleting dentry. #1000datanode
: fix statfs deviation when using EXT4 as the underlying local filesystem for datanode. #1031client
: func sortHostsByDistance change dp's hosts. #1106
datanode
: Introducing data partition selector which allow users to customize the client's selection logic for data partition when reading and writing, and can switch at any time. #853
client
: Add enablePosixACL configuration to enable POSIX ACL feature. #906datanode
: Improved data recovery speed. #899
datanode
: Fix the issue that the information in the memory is not released immediately after deleting the extent, which causes the status information of the data partition to be incorrect. #938datanode
: Fix the issue that repeatedly sending requests to the failed data partitions. #937objectnode
: Fix the issue that recursive making directory concurrent with sample multi-layer path prefix. #904
master
: Fix the concurrency safe issue when removing raft member. #893sdk
: Fix the panic issue while remove data partition concurrently. #894
object
: Implemented S3 api 'DeleteBucketPolicy' for bucket deletion. #757master
: Introducing management API for volume capacity expanding and shrinking. New capacity must be set more than 20% lager than used. #764master
: Introducing management API for updating node address. #813master
: Introducing management API for checking nodes. #813metanode
: Add header and checksum verification for EXTENT_DEL files. #813cli
: Add logging for CLI tool, and added several commands to call master API. #764 #801compile
:Support direct compile or docker cross compile on Arm64 platform. #779client
: Introducing nearRead option that allow client read data priority from the nearest DataNode to improve reading performance. #810client
: Update volume follower read config from master periodicity. #837
object
: Improved compatibility for ListObjects and ListObjectsV2 interfaces. #769master
: Check whether the used space between the replicas is consistent when performing the load partition operation. #813metanode
datanode
: Optimize batch delete Extent; add autoRepairLimitRater on DataNode. #781datanode
: Introducing autoRepair option for limit data repair speed on DataNode. #842datanode
: The data repair task will skip process extent file which has been already deleted. #842
master
: When loading metadata of cluster, using the current ID instead of the loaded ID. #821metanode
: Fix the deadlock problem while MetaNode deleting dentry. #785metanode
: Fixes #760 Add inode to freeList when NLink is 0, and delete inode 7 days later. #767datanode
metanode
: When DataNode and MetaNode start, the partitions not exist in cluster view will be renamed to expiredPartition and skip loading. #824datanode
metanode
: Fixes issue#698, Raft instance delete itself by applying ConfChange raft log. #866metanode
: MetaNode may not free space. #838client
: Fix the problem that client opening file with outdated extents information. #783client
: Batch inode get mechanism is out of service. #847client
: Fix the problem that makes authorized user mount volume failure. #828datanode
: Fix several issue in automatic repair process for tiny extent. #855 #857datanode
: Fix the problem that the client still applies to create a new extent to a data partition that has no space. #867
- Updated Q&A, environment&capacity planing documentation. #801
console
: CubeFS Console is a web application, which provides management for volume, file, s3, user, node, monitoring, and alarm. The CubeFS Console makes it much easier to access services provided by CubeFS. #728
Please refer to https://cubefs.readthedocs.io/en/latest/user-guide/console.html to start the Console.metanode
: Provide compatibility verification tool for meta partition. #684object
: CORS access control. #507fuse
: Introduce fsyncOnClose mount option. Choose if closing system call shall trigger fsync. #494
Release2.1.0 did a lot of work to optimize memory usage.
- Modify the max write size to 128k; limit the rate of Forget requests. #533
- If concurrent write operations are more than 256, bufferPool is no longer used. #538
- Eliminates the unnecessary memory allocations for Inode struct by using proto InodeInfo struct directly. #545
- Use sync Pool and rate limiter to limit the memory usage. #639
- Use total count instead of rate. #642
- Uses sorted extents instead of BTree. #646
master
: Add set/get deleteBatchCount interfaces. #608master
: Delete partitions on DataNode/MetaNode concurrently in decommission action. #724master
: Added api for getting & setting parameters for batch deleting extents. #726metanode
: Filter inode candidates that will be sent to a partition in batch-inode-get to save memory. #481metanode
: Batch delete inode, unlink&evict dentry. #586datanode
: Prioritize healthy data node when sending message to data partition. #562metanode
datanode
: Check expired partition before loading. #624object
: Implemented several conditional parameters in GetObject action. #471object
: Making S3 credential compatible with earlier version of CubeFS. #508object
: Totally support presigned url; Partial support user-defined metadata; Making ETag versioned. #540object
: CopyObject across volumes; Support coping directory; Support coping xattr. #563object
: Parallel downloading. #548object
: Add 'Expires' and 'Cache-Control' in putObject/copyObject/getObject/headObject. #589object
: Modify part system metadata. #636fuse
: Decrease request channel size to save memory. #512fuse
: Let the client daemon inherit all the environment variables but not just PATH. #529fuse
: Introduces a debug interface /debug/freeosmemory to trigger garbage collection manually. #539fuse
: Add configuration MaxCPUs. #546fuse
: View client log through http proto. #552fuse
: Support modifying ModifyTime of inode by setAttr. #733deployment
: Add log map in docker-compose.yaml. #478deployment
: Introduce nginx to improve compatibility. #534test
: Add testing script for S3 APIs in python. #514 #595monitor
: Add detailed vol info in grafana dashboard. #522cli
: Some new features for CLI tool: bash completions, configuration setting, decommission, diagnose etc. #555 #695
metanode
: Accelerate deletion. #582 #600datanode
: Limit replica number of data partition to at least 3. #587datanode
: Set if auto-repair by http interface. #672metanode
datanode
fuse
: Optimizations on port checking. #543 #531metanode
datanode
: Validate creating partition request from master. #611object
: Refactor copy object function. #563fuse
: Replace process of token validation from meta wrapper to client. #498fuse
: Adjust rlimit config for client. #521
metanode
: When overwriting a file, if the inode in a dentry is updated successfully, ignore unlink and evict inode errors. #500metanode
: Fix incorrect metrics collection for volume usedGB in monitor system. #503metanode
: VolStat can correctly update when force update metaPartitions frequently. #537metanode
: The MaxInodeID of meta partition is not synchronized when recovered from snapshot. #571metanode
: Free Inodes by raft protocol. #582metanode
: Painc with deleteMarkedInodes. #597object
: Fix the parsing issue of the two preconditions of If-Match and If-None-Match. #516object
: Change the time format in list buckets API to UTC time. #525object
: Fix xml element of DeleteResult. #532object
: Empty result in list operation when run in parallel with delete. #509object
: Change from hard link to soft link in CopyObject action. #563object
: Solved parallel-safety issue; Clean up useless data on failure in upload part. #553object
: Fixed a problem in listing multipart uploads. #595object
: Solve the problem that back-end report “NotExistErr” error when uploading files with the same key in parallel. #685fuse
: Evict inode cache when dealing with forget. #523
- Update related documents of ObjectNode. #554
- Added user and CLI introduction, synchronize documentation according to the latest code. #564
- Added F&Q section. #573
- Multi-Zone replication & Create volume on a specified zone. #407 #416
- Support token authentication for readwrite-mount & readonly-mount of fuse client. #435
- A command line tool for cluster operations. #441
- Implemented user security and authorization system to improve resource access control. #441
- Implemented extend attributes (xattr) for metadata and posix-compatible file system interface (mountable client). #441
- Support folder operations in S3 APIs. #450
- Reduce blocking under concurrency. #458
- Implemented more Amazon S3-compatible object storage interfaces to improve compatibility. #441
- Make replicas of the data partition a specific option when creating volume. #377
- If meta node reaches threshold,set meta partition status to readonly. #411
- Add cluster status API. #457
- Checks only file type instead of the whole mode. #381
- Make
followerRead
a client specific option. #382 - Support command line argument for fuse client. #418
- Introduce disable
dentry-cache
to client option. #453 - Filter target meta partitions in batch iget. #472
- Yum tool for deploying CubeFS cluster. #385
- Fix the signature algorithm issues. #369 #476
- Avoid inode unlink due to net error. #402
- A map structure locked during serialization. #413
- Wait for data sync in close syscall. #419
- Fix empty result on list objects. #433
- Set lookup valid duration for newly created file. #437
- Fix
iget
error due to metapartition split. #446 - Fix mount fail when volume is full. #453
- Fix offline strategy for raft peers of
data partition
andmeta partition
. #467
- Add guide for running CubeFS by yum tools. #386
- Update FUSE client mount options. #439
- Add documentation for client token. #449
- Support building docker image that contains both cfs-server and cfs-client. #353
- Only one replica of meta partition can be taken offline at the same time. #345
- Check if server port is open before raft leader change. #348
- Solved several issue in signature algorithm version 2. #357
- Solved the issue related to copying files across folder through object storage interface. #361
- Update dashboard configuration of grafana. #347
- Unified the configuration of master address and listening port in documentation. #362
- Added benchmark data and guidelines for deploying CubeFS cluster with Helm in README file. #350
- Add a general Authentication & Authorization framework for CubeFS. commit
- Object storage interface. Add ObjectNode to provide S3-compatible APIs. commit
- Check disk path size in run-docker script. commit
- Support building CubeFS docker image that contains cfs-server and cfs-client. commit
- Add go test in docker-compose. commit
- Add authorization to master api getVol. commit
- Support building under the Darwin(Apple MacOS) and Microsoft Windows operating system environment. commit
- Set DataNode disk size in docker script to be compatible with lower version of
df
. commit - The
reservedSpace
parameter invalid when gt 30GB. commit - The mtime of parent inode does not change when create or delete a
dentry
. commit - MetaNode
opResponseLoadPartition
removes duplicate locks. commit - MetaNode leak memory on DeleteMetaPartition operator. commit
- Change configuration file of Master daemon. commit
- Remove unnecessary function in raft store. commit
- Change metaNode
loadSnapshotSign
command aboutinodeCount
anddentryCount
. commit - Rename
rack
tocell
. commit - Improve meta partition replicas verification by
MaxInode
andDentryCnt
. commit - Change parameter name in MetaNode configuration file. commit
- Using current applyID to replace snapshot applyID in MetaNode LoadMetaPartition response action. commit
- Delete data partition and meta partition synchronously. commit
- Add design and user guide document for AuthNode. commit
- Update configuration file sample in document. commit
- Add design and user guide document for object subsystem. commit
- Add IO, small file and metadata performance benchmark data to document. commit
- Datanode : support read from follower and if packet is tinyExtent,then do write it once commit
- Vol add followerRead field to support reading data from foll owner commit
- Support read from raft follower instead of just leader commit
- Support to modify whether vol supports reading data from a replica commit
- Introduce read and write iops rate limit commit
- Add metrics commit
- If vol has been marked deleted,data partitions, meta partition information reported by heartbeat will no longer be accepted commit
- Use static ip for meta and data nodes commit
- Improve debug environment using docker commit
- Support custom meta node reserved memory commit
- Data partition and meta partition must have three replicas except reducing replicas to 2 commit
- Adjust demo config parameters commit
- Update grafana dashbord for disk error metric commit
- OpFollowerRead if read eof,return error commit
- Get follower read option in init commit
- Stream traverse process never gets triggered in some situation commit
- Check LoadConfigFile before starting daemon commit
- Return error from function LoadConfigFile to the caller commit
- ExtentStorage engine :autoComputeCrc compute crc error commit
- Clean up async delete process of metanode commit
- Set default port to non-system reserved port commit
- Leader change not warning on raft commit
- Remove go module files for now commit
- Clean up response of get all inodes info commit
- Master, DataNode and MetaNode Fix dp or mp offline process commit
- Use AddNodeWithPort replace AddNode,and delete AddNode API commit
- Delete reserved space on DataNode config file commit
- Refine labels of the disk error metric commit
- Optimize auto compute crc commit
- Add use cases commit
- Introduce writecache mount option. commit
- Introduce keepcache mount option. commit
- Add admin API for get all meta parititons under vol commit
- Support for truncating raft log. commit
- Dynamiclly reduce the num of replicas for vol. commit
- The specified number of replica num is supported when creating vol. commit
- Feature: daemonize server commit
- Support log module change loglevel on line. commit
- Extent_store LoadTinyDeleteFileOffset return s.baseTinyDeleteOffset. commit
- Enable async read by default. commit
- Improve log message details for clientv2. commit
- Compatible with string when get bool config. commit
- Add performance tracepoint for clientv2. commit
- Align out message buffer size with max read size. commit
- For splitting meta partition,updating meta partition and creating new meta partition are persisted within a single transaction. commit
- If metanode used memory is full,then the partition must set to readonly. commit
- Set report time to now after creating data partition. commit
- Set writeDeadLineTime to one minute,avoid metanode gc reset conn which snapshot used as much as possible. commit
- Add raft monitor. commit
- If the creation of a data partition fails, the successfully created replica is deleted. commit
- If the creation of a meta partition fails, the successfully created replica is deleted. commit
- Add unit test case. commit
- Passing create data partition type to datanode. commit
- If create dp is normal,must start Raft else backend start raft. commit
- The tickInterval and electionTick support reading from a configuration file commit
- Fix: add del vol step after ltptest in travis-ci test script. commit
- Clientv2 file handle memory leak. commit
- Redirect stderr to an output file in daemon. commit
- Exclude data partition only when connection refused. commit
- When delete DataParittion,the forwardToLeader mayBe painc. commit
- Metanode load error mayme shield. commit
- Truncate raft corrupt data. commit
- When meta node memory usage arrive threshold, split meta partition occurred dead lock. commit
- SplitMetaPartition race lock with updateViewCache. commit
- After the vol is created and before the heartbeat report, the status of the data partition is set to read only after the check dp operation is performed. commit
- When disk error,the raft cannot start on new data server first. commit
- OpDecommissionDataPartition delete dataPartition on new server. commit
- Datanode may be painc. commit
- Datanode auto compute crc. commit
- DataNode: when dataPartition load,if applyId ==0 ,then start Raft. commit
- The reported data partition usage decreased, and the statistical usage did not decrease accordingly. commit
- Docker metanode.cfg add totalMem parameter. commit
- Datanode deadlock on deletePartition. commit
- DataNode may be painc. commit
- Exclude dir inode in the orphan list. commit
- Evict inode cache after successful deletion. commit
- The actual reduction in the number of replicas exceeds the expected reduction in the number of replicas. commit
- Compatible with old heartbeat mode, old heartbeat mode does not report volname. commit
- Metanode mistakenly delete empty dir inode. commit
- Treat ddelete not exist error as successful. commit
- Fuse directIO read size can exceeds buffer size. commit
- Fix Datanode retain RaftLog commit
- Fix: Datanode: when tinyExtentRepair auto repair,it has been commit
- Fix: Storage :when write tinyExtent,if offset!=e.datasize,return error commit
- The buf is not reset and the old data is repeatedly written to the ump warning log, resulting in the overall performance degradation of the cluster commit
- DataPartitionMap occurred error which is concurrent map iteration and map write commit
- Client gets stale file size if streamer is auto evicted. commit
- Update export init for consul register commit
- Err is shadowed in server main function commit
- If master only create DataPartitionCnt is 10,then cannnot mount commit
- Update export init for consul register commit
- DataPartition disk error ,not recvoery raft log on new datanode commit
- Datanode register hang bug commit
- Sdk When creating a datapartition, select the datapartition retry strategy. commit
- Refactoring SDK: when write datapartition num greater 10 ,then trust master. commit
- When disk error,the datapartition recover only recover avali data on tinyExtent. commit
- Decommission Meta or Data Partition must sync response to master. commit
- Synchronized decommission the data partition. commit
- Datanode api /partition add raftStatus. commit
- StartRaftLoggingSchedule not use goroutine. commit
- Metanode must config totalMem. commit
- Sync tinyDeleteExtent time change to 1 days. commit
- Change partitionId varliable to partitionID. commit
- Datanode delete SnapshotFile Pool. commit
- Exporter add ump. commit
- DataNode: compatible old dataPartition Meta info. commit
- Add ltptest log. commit
- Docker metanode config.json change totalMem to 6GB. commit
- Datanode create dataPartition select disk function change. commit
- Datanode start must start StartRaftLoggingSchedule func. commit
- Master create vol min default DataPartition set to 10. commit
- Docker: run ltptest print errorinfo. commit
- When load dp,if dp status is normal,then start raft ,else wait snapshot has recover. commit
- Datanode delete unused func. commit
- If not config warnLogDir,then donnot write umplog to disk. commit
- DataNode: if not config raftDir,then not start server. commit
- Add log on metanode delete extent. commit
- Doc :delete warnLogDir config. commit
- Keep mount point in error stataus when client is killed commit
- Log checkroration checkTime change to 1 second commit
- Metanode change deleteDentry or deleteInode api to log.LogDebugf commit
- Increase the judgment condition of disk error,add syscall.EROFS commit
- Sync code from git.jd.com/cubefs/cubefs commit
- When datanode or metanode start,must compare partition with master and gofmt project. commit
- Datanode add backend check disk status func. commit
- Update rocksdb build depends. commit
- Metanode panic when deleting meta partition. commit
- When raft member become leader,must apply from appliyID to commitID. commit
- Clientv2: add dentry cache. commit
- Vendor: introduce jacobsa daemonize package. commit
- Client: start client as a daemon. commit
- Clientv2: daemonize client.commit
- Update docker helper script. commit
- Data sdk: use ip address instead of partition id to rule out unavailable data partitions. commit
- Add monitor add prometheus, grafana in docker helper script.commit
- Add GOPATH check in build.sh. commit
- Integrated rocksdb compilation. commit
- Update makefile. commit
- Docs: start client as a daemon. commit
- Update docs: add make build in readme and docs. commit
- Use one thread to send and recive from follower on primary backup replication protocol .commit
- Change random write raft serialize not use json. commit
- Fix: when datanode is killed, maxBaseExtentId is corrupted.commit
- Once vol is created successfully, the state of the data partition it has should be writable. commit
- Metanode: fix create dentry exist error. commit
- Add privileged for docker-compose server node. commit
- The retain space field in config file is not allowed to be larger than the default value. commit
- Fix: when data node gets a disk error, the disk status does not changed. commit
- Fix: build error. commit
- Fix build with lua bug. commit
- Check whether vol is valid. If there is no meta partition with start being. commit
- Build: update build.sh for user permission. commit
- Clientv2: fix to get the latest file size. commit
- When create dp failed,then Check all dp host if the host is alive. commit
- Fix bug: when read tinyDeleteFile return eof error. commit
- Build: fix multi users build bug. [commit](https://github.com/cubefs/cubefs/commit/a885777b9da97957cc1eddfee102be3707763b64[commit](https://github.com/cubefs/cubefs/commit/a885777b9da97957cc1eddfee102be3707763b64)
- Build: fix no root user permission bug. commit
- Init replicas info from create data partition request. commit
- Add cfs-base dockerfile. commit
- Specification code and metanode config file. commit
- Add go mod. commit
- Docs: update according to recent changes. commit
- If warnLogDir is not specified in the config files, server or client will return an error. commit
- Rename RestSize to ReservedSize. commit
- Print more detailed error information when server starts failed. commit
- Remove docker client security opt. commit
- Add disk config detailed description. commit
- By default create 10 data partitions and 3 meta partitions when creating volume. commit
- Doc: add clarification that resource manager is AKA master. commit
- Normalize exporter cluster name. commit
- Set the value of rack for data node to default. commit
- Refine decommission disk. commit
- Change document about metanode config. commit
- Doc: update docker helper section. commit
- Update quick-start-guide.rst. commit
- Change dataPartition IntervalToUpdateReplica to 600 seconds. commit
- Add third-party directory. commit
- Write data partition decommission url message to log. commit
- Go fmt project. commit
- Data sdk: check amount of writable data partitions when mount. commit
- Delete third-party directory. commit
- Update docs: remove server and client. commit
- Pass the hosts to data node when creating the data partition. commit
- Warn log message unicode encoding is converted to utf8. commit
- Ignore generated build files. commit
- Build: update build.sh and Makefile. commit
- Change packet attr func and ReadTinyDelete impl log. commit
- Add log when read tinyDeleteFile failed. commit
- Add log when streamRead and ExtentRepairRead. commit
- Remove unused source code. commit
- Build: update build.sh. commit
- Update README.md. commit
- Build: remove client unused depends. commit
- Client: enable support to fifo and socket file types. commit
- Clientv2: use jacobsa fuse package instead of bazil. commit
- Docker: introduce docker compose to create a local testing CubeFS cluster. commit
- Meta: update atime in inode get operation. [metanode] commit
- Fix: potential panic if send returns nil resp with no error. [client] commit
- Fix: raft election takes a long time and timeout; issue a panic if raft server is down. [raft] commit
- Fix: potential deadlock if applyHandler. [master] commit
- Fix: put vol to cache after it is persistent. [master] commit
- Fix: partition is nil when apply remove raft node. [datanode] commit
- Fix: metanode painc. [metanode] commit
- Fix: panic when pprof does not start. commit
- Sdk: retry if mount failed in case master is unavailable temporarily. commit
- Build: add verbose build info. commit
- Master: introduce data partition over-provision. commit
- Master: reserve writable data partition amount according to capacity instead of a const. commit
- Monitor: Use UMP performance monitor if exporter is not enabled. commit
- Rename the repository from cfs to cubefs.
- Use own errors module instead of juju errors due to license incompatibility.
- Metanode: Change not raft leader error to tryOtherAddr error.
- Master: Partition recovered but the status not changed.
- Datanode: Report to client with proto.OptryAgain resultcode when datapartition does not exsit.
- Raft: A member must apply playback from old apply id to commit id after elected as leader.
- Metanode: generate identical inode number under extreme conditions. commit
- Master: Add ump warn packet.
- Master: Remove redundant calling of loadMetaData method.
- Master: Reload meta data after leader changed.
- Master: make dataPartition disk Path persistent. commit
- Master: Volume creation supports specifying the amount of meta partitions. commit
- Metanode: Add totalMem in configFile.
- Datanode: Change default disk reserved space.
- Datanode: Add volname in heartbeat report.
- Raft: Use raft.ErrNotLeader instead of ErrNotLeader.
- Client: Create a dummy node instance if inode does not exist.
- Client: Add UMP monitor alarms for read/write/fsync errors.
- Client: Suppress some error messages. commit
- Log: Automatically create subdirectory under the log directory.
The initial release version.