v3.2

关于 SequoiaDB
快速入门
安装
基本操作
- CRUD操作
  - 插入
  - 查询
  - 更新
  - 删除
- 索引
- 全文索引
- 聚集
- 事务
- 数据分区
数据模型
- 概述
- 文档
- 数据类型
- 集合
- 集合空间
- 大对象
- 自增字段
SQL引擎
- PostgreSQL实例组件
- MySQL实例组件
  - 概述
  - 安装
    - 安装部署
    - 高可用
    - 升级
    - 卸载
  - 使用
  - 配置
  - 工具
    - 实例管理工具
  - 数据类型映射表
  - 错误码
  - 注意事项
S3引擎
系统架构
数据库管理
连接器
- Hadoop
  - 部署
  - 集成
- Hive
- Spark实例组件
  - 概述
  - 安装
  - SparkSQL
- PostgreSQL实例组件
  - 概述
  - 部署
  - 连接
  - 开发
驱动
参考手册
- SequoiaDB Shell方法
  - 命令位置参数
  - Global
    - help()
    - print()
    - sleep()
    - forceGC()
    - showClass()
    - getErr()
    - getLastErrMsg()
    - getLastErrObj()
    - getLastError()
    - setLastErrMsg()
    - setLastErrObj()
    - setLastError()
    - getExePath()
    - getRootPath()
    - getSelfPath()
    - import()
    - importOnce()
    - jsonFormat()
    - traceFmt()
  - Sdb
    - Sdb()
    - analyze()
    - backup()
    - cancelTask()
    - close()
    - createCataRG()
    - createCoordRG()
    - createCS()
    - createDataSource()
    - createDomain()
    - createProcedure()
    - createRG()
    - createSpareRG()
    - createUsr()
    - dropCS()
    - dropDataSource()
    - dropDomain()
    - dropUsr()
    - eval()
    - exec()
    - execUpdate()
    - flushConfigure()
    - forceSession()
    - forceStepUp()
    - getCataRG()
    - getCoordRG()
    - getCS()
    - getDataSource()
    - getDomain()
    - getRG()
    - getSessionAttr()
    - getSpareRG()
    - invalidateCache()
    - list()
    - listBackup()
    - listCollections()
    - listCollectionSpaces()
    - listDataSources()
    - listDomains()
    - listProcedures()
    - listReplicaGroups()
    - listSequences()
    - listTasks()
    - loadCS()
    - removeBackup()
    - removeCataRG()
    - removeCoordRG()
    - removeSpareRG()
    - removeProcedure()
    - removeRG()
    - renameCS()
    - resetSnapshot()
    - reloadConf()
    - deleteConf()
    - setSessionAttr()
    - snapshot()
    - startRG()
    - stopRG()
    - sync()
    - setPDLevel()
    - traceOff()
    - traceOn()
    - traceResume()
    - traceStatus()
    - transBegin()
    - transCommit()
    - transRollback()
    - unloadCS()
    - updateConf()
    - waitTasks()
  - SecureSdb
  - SdbCS
    - alter()
    - createCL()
    - dropCL()
    - getCL()
    - removeDomain()
    - renameCL()
    - setAttributes()
    - setDomain()
  - SdbCollection
    - aggregate()
    - alter()
    - attachCL()
    - count()
    - createAutoIncrement()
    - createIdIndex()
    - createIndex()
    - createLobID()
    - deleteLob()
    - detachCL()
    - disableCompression()
    - disableSharding()
    - dropAutoIncrement()
    - dropIdIndex()
    - dropIndex()
    - enableCompression()
    - enableSharding()
    - find()
    - findOne()
    - getDetail()
    - getIndex()
    - getLob()
    - insert()
    - listIndexes()
    - listLobs()
    - putLob()
    - remove()
    - setAttributes()
    - split()
    - splitAsync()
    - truncate()
    - truncateLob()
    - update()
    - upsert()
  - SdbCursor
    - close()
    - current()
    - next()
    - arrayAccess()
  - SdbQuery
    - arrayAccess()
    - close()
    - count()
    - current()
    - explain()
    - flags()
    - getQueryMeta()
    - hint()
    - limit()
    - next()
    - query()
    - remove()
    - size()
    - skip()
    - sort()
    - toArray()
    - update()
  - SdbReplicaGroup
    - attachNode()
    - createNode()
    - detachNode()
    - getDetailObj()
    - getMaster()
    - getNode()
    - getSlave()
    - reelect()
    - removeNode()
    - start()
    - stop()
  - SdbNode
    - connect()
    - getHostName()
    - getDetailObj()
    - getServiceName()
    - start()
    - stop()
  - SdbDomain
    - addGroups()
    - alter()
    - listCollections()
    - listCollectionSpaces()
    - removeGroups()
    - setAttributes()
    - setGroups()
    - listGroups()
  - SdbDataSource
    - alter()
  - Oma
    - Oma()
    - addAOmaSvcName()
    - delAOmaSvcName()
    - close()
    - createCoord()
    - createData()
    - createOM()
    - getAOmaSvcName()
    - getIniConfigs()
    - getNodeConfigs()
    - getOmaConfigFile()
    - getOmaConfigs()
    - getOmaInstallFile()
    - getOmaInstallInfo()
    - listNodes()
    - reloadConfigs()
    - removeCoord()
    - removeData()
    - removeOM()
    - setIniConfigs()
    - setNodeConfigs()
    - setOmaConfigs()
    - start()
    - startAllNodes()
    - stopAllNodes()
    - startNode()
    - stopNode()
    - startNodes()
    - stopNodes()
    - updateNodeConfigs()
  - File
    - File()
    - chgrp()
    - chmod()
    - chown()
    - close()
    - copy()
    - exist()
    - find()
    - getSize()
    - getUmask()
    - isDir()
    - isEmptyDir()
    - isFile()
    - list()
    - md5()
    - mkdir()
    - move()
    - read()
    - readContent()
    - readLine()
    - remove()
    - scp()
    - seek()
    - setUmask()
    - stat()
    - write()
    - writeContent()
    - truncate()
  - FileContent
    - clear()
    - getLength()
    - toBase64Code()
  - Cmd
    - Cmd()
    - getCommand()
    - getInfo()
    - getLastOut()
    - getLastRet()
    - run()
    - runJS()
    - start()
  - Remote
    - close()
    - getCmd()
    - getFile()
    - getIniFile()
    - getInfo()
    - Remote()
  - Hash
    - fileMD5()
    - md5()
  - IniFile
    - IniFile()
    - addComment()
    - addSectionComment()
    - addLastComment()
    - setComment()
    - setSectionComment()
    - setLastComment()
    - delComment()
    - delSectionComment()
    - delLastComment()
    - enableItem()
    - disableItem()
    - disableAllItem()
    - getComment()
    - getSectionComment()
    - getLastComment()
    - getValue()
    - setValue()
    - toObj()
    - toString()
    - save()
  - Sdbtool
    - listNodes()
  - Ssh
    - Ssh()
    - close()
    - exec()
    - getLastOut()
    - getLastRet()
    - getLocalIP()
    - getPeerIP()
    - pull()
    - push()
  - System
    - addAHostMap
    - addGroup
    - addUser
    - delAHostMap
    - delGroup
    - delUser
    - getAHostMap
    - getCpuInfo
    - getCurrentUser
    - getDiskInfo
    - getEWD
    - getHostName
    - getHostsMap
    - getIpTablesInfo
    - getMemInfo
    - getNetcardInfo
    - getPID
    - getProcUlimitConfigs
    - getReleaseInfo
    - getSystemConfigs
    - getTID
    - getUserEnv
    - isGroupExist
    - isProcExist
    - isUserExist
    - killProcess
    - listAllUsers
    - listGroups
    - listLoginUsers
    - listProcess
    - ping
    - runService
    - setProcUlimitConfigs
    - setUserConfigs
    - snapshotCpuInfo
    - snapshotDiskInfo
    - snapshotMemInfo
    - snapshotNetcardInfo
    - sniffPort
    - type
  - 辅助类型对象
    - SdbSnapshotOption
    - SdbQueryOption
    - SdbTraceOption
    - CLCount
    - User
    - CipherUser
  - 特殊类型对象
    - BinData
    - BSONArray
    - BSONObj
    - MaxKey
    - MinKey
    - NumberDecimal
    - NumberLong
    - OID
    - Regex
    - SdbDate
    - Timestamp
- 操作符
  - 匹配符
    - 概述
    - $gt
    - $gte
    - $lt
    - $lte
    - $ne
    - $et
    - $mod
    - $in
    - $isnull
    - $nin
    - $all
    - $and
    - $not
    - $or
    - $type
    - $exists
    - $elemMatch
    - $+标识符
    - $size
    - $regex
    - $expand
    - $returnMatch
  - 选择符
    - 概述
    - $include
    - $default
    - $elemMatch
    - $elemMatchOne
  - 函数操作
    - 概述
    - $abs
    - $ceiling
    - $floor
    - $mod
    - $add
    - $subtract
    - $multiply
    - $divide
    - $substr
    - $strlen
    - $lower
    - $upper
    - $ltrim
    - $rtrim
    - $trim
    - $cast
    - $size
    - $type
    - $slice
  - 更新符
    - 概述
    - $inc
    - $set
    - $unset
    - $addtoset
    - $pop
    - $pull
    - $pull_all
    - $pull_by
    - $pull_all_by
    - $push
    - $push_all
    - $replace
  - 聚集符
    - 概述
    - $project
    - $match
    - $limit
    - $sort
    - $skip
    - $group
    - SQL to Aggregate 映射表
  - 字段操作符
    - $field
- SQL语法
- SQL to SequoiaDB 映射表
- 限制
- 错误码
故障排除
- 常见错误
  - 常见错误处理指南
SAC 管控中心
Web服务
- 概述
- WebSphere
  - 部署
  - 连接
  - web应用
- Tomcat
  - 部署
  - 连接
  - web应用
- JBoss
  - 部署
  - 连接
  - web应用
版本信息

概述

Apache的Spark是一个高速的通用集群式计算系统。Spark是一个可扩展的数据分析平台，该平台集成了原生的内存计算，因此它在使用中相比Hadoop 的集群存储来说，会有不少的性能优势。

Apache Spark提供了高级的Java、Scala和Python APIs，同时还拥有优化的引擎来支持常用的执行图。Spark 还支持多样化的高级工具，其中包括了处理结构化数据和SQL的SparkSQL，处理机器学习的MLlib，图形处理的 GraphX，还有SparkStreaming。

Spark组成

在集群中，Spark应用以独立的进程集合的方式运行，并由主程序（driver program）中的SparkContext 对象进行统一的调度。当需要在集群上运行时，SparkContext会连接到几个不同类的ClusterManager（集群管理器）上（Spark 自己的Standalone/Mesos/YARN）, 集群管理器将给各个应用分配资源。连接成功后，Spark 会请求集群各个节点的Executor（执行器），它是为应用执行计算和存储数据的进程的总称。之后，Spark会将应用提供的代码（应用已经提交给 SparkContext 的JAR或Python文件）交给executor。最后，由SparkContext 发送tasks提供给其执行。

关于这个架构的几点介绍：

每一个应用有其独立的Executor进程，这些进程将会在应用整个生命周期内为应用服务，并且会在多个线程中执行任务tasks。这种做法能有效的隔离不同的应用，在调度和执行端都能很好的隔离（每个驱动调度自己的任务，不同的任务在不同的JVM中执行）。但是，这也意味着，如果不写入外部的存储设备，那数据就不能在不同的Spark 应用（SparkContext 实例）之中共享。
Spark 对于下列的集群管理者是不可知的：只要Spark 能请求executor进程，且这些进程之间能互相通信，那么他就相对容易的去运行支持其他应用的集群管理器（如Mesos/YARN）。
因为驱动在集群中调度任务，它将在worker nodes（工作节点）附近运行，最好是在相同的局域网当中。如果你不喜欢远程向集群发送请求，那么最好为驱动打开一个RPC然后让其在附近提交操作而不是在远离worker nodes 处运行驱动。

Spark-SequoiaDB 连接组件

通过使用Spark-SequoiaDB连接组件，SequoiaDB可以作为Spark的数据源，从而可以通过SparkSQL实例对SequoiaDB数据存储引擎的数据进行查询、统计操作。

收起

本页导航

粤ICP备16118040号

回到顶部