Releases: byzer-org/byzer-lang
MLSQL v1.1.6 Release
We are glad to announce that MLSQL v1.1.6 is released. There are huge new features, bug fix and code refractor in this half month' development:
Improvements:
- PR-747: Load balance, Dynamic Resource Allocation,Instances manager based on labels。More details please check:mlsql-cluster doc.
- PR-735: Supports Using load statement to view API doc and Configuration of MLSQL。
- PR-728: Upgrade to latest carbondata
- PR-781: A new transformer supports complex father-children relation computing which is hard if you use SQL to do this.
- PR-794: Provides shell/cmd script to make you compile/package MLSQL more easy.
BugFix:
- PR-773: The last As keyword is upper will cause exception。
- PR-777: Save partition by can not support multi columns.
- PR-760: conflict in multi kafka-client.(MLSQLsupport kafka 0.8/0.9/1.x at the same time)
- PR-757: MLSQL Auth bug
- PR-795: PS cluster enabled will fail the exetuor in yarn mode.
Code refractor:
MLSQL v1.1.5 Release
We are glad to announce that MLSQL-v1.1.5 version released.
V1.1.5 is a maintainable release of 1.1.x which is based on branch b1.1.5.
This release we have done a lot of things to make MLSQL work on spark 2.4.0. The changes applied in this release can be tracked by All Issue tagged with MSLQL-1.1.5 and All PR tagged with MSLQL-1.1.5
MLSQL 1.1.5 still not contains following jars by default:
If you want to analyze text content, please download it manually and add them to --jars
.
We plan to release v1.1.6 in next two weeks.
Usage:
- download distribution based on your spark version.
- unzip and make sure SPARK_HOME have be confgiured.
- use
./start-local.sh
to start MLSQL server.
MLSQL v1.1.4 Release
V1.1.4 is a maintainable release of 1.1.x which is based on branch b1.1.4.
The changes applied in this release can be tracked by All Issue tagged with MSLQL-1.1.4 and All PR tagged with MSLQL-1.1.4
Notice that we do not tag the issue before, so there are many bugfix and features may not be tracked by the tag MLSQL-1.1.4. However, since this release, we will make sure that all Issue And PR tracked.
MLSQL v1.1.4 still not contains by default:
If you want to analyze text content, please download it manually and add them to --jars
.
We plan to release v1.1.5 in next two weeks.
Usage:
- download distribution based on your spark version.
- unzip and make sure SPARK_HOME have be confgiured.
- use
./start-local.sh
to start MLSQL server.
v1.1.3-dev(dev preview version)
StremaingPro V1.1.3 Released
There are really a bunch of features added to this release.
Main features:
- MLSQL syntax check, any grammar error will be thrown as an exception.
- Enhance the stream job support mlsql-stream
- JDBC in stream/bach mlsql-jdbc
- Scala/Python script UDF support mlsql-script-udf
- include statement make MLSQL snippet is reusable.
- job manager in build-in web UI.
StreamingPro v1.1.2
功能特性:
NLP分词相关的功能依赖ansj分词包,可到最早的那个版本中下载附件:https://github.com/allwefantasy/streamingpro/releases
StreamingPro v1.1.1版本发布
StreamingPro 经过实际生产环境的验证,添加了很多针对算法的功能,近期大部分PR都是针对算法的。
更多细节可参看PR列表
有用的功能有:
- PythonAlg模块具备同步资源文件的能力自定义python算法支持自定义资源文
- 更新了交互界面,使用更友好。web交互
- 强化了词向量预处理功能,可以更好的接入深度学习模型SQLWord2VecInPlace 功能扩展
- 可以对训练集各个分类的数据进行统计,计算分类权重,可以查看应训练好的sklearn模型参数,支持混淆矩阵等
- 大量矩阵运算的UDF函数加入
- 准备了两个新的模块,方便支持spark 2.3.x
- 添加了图片数据源,实现了java/opencv对图片的缩放功能
- more ....
NLP分词相关的功能依赖ansj分词包,可到之前发布的版本的附件中下载:https://github.com/allwefantasy/streamingpro/releases
StreamingPro v1.1.0版本发布
StreamingPro支持通过配置文件的方式编写Spark应用,包括批处理和流式。
StreamingPro同时也支持将Spark Application作为服务,然后暴露出Rest接口,使用SQL Script作为交互格式。目前支持:
- 爬虫,通过SQL脚本完成爬取网页功能。
- 支持流式计算SQL脚本
- 支持批处理SQL脚本
- 支持用SQL做算法训练和预测,以及部署成API服务的能力。
- 剥离dl4j,减少jar包大小
- 支持carbondata 1.3.1,支持流式写入carbondata
- 启动时通过--jars带上 ansj_seg-5.1.6.jar 和nlp-lang-1.7.8.jar 可在MLSQL中运用分词抽词工具
注意: 该预编译版本仅测试过spark 2.2.0。如果需要2.1.0 或者2.3.0 则需要自己进行编译。