Alink version 1.1.0
Enhancements & New Features
- Improvement of UDF/UDTF operators, Java and PyAlink have consistent usage and behaviors. #32 #44.
- Publish to maven central and PyPI.
- Support Flink 1.10 and Flink 1.9. #46
- Support more Kafka connectors. #41.
API change
- Modify Naive Bayes algorithm as a text classifier. #47
- Modify and enhance the parameter, model in QuantileDiscretizer, OneHotEncoder and Bucketizer. #48
Documentation
Fix & Refinements
- Fix the problem in LDA online method and refine comments in FeatureLabelUtil. #29
- Fit the bug that initial data of KMeansAssignCluster is not cleared. #31
- Fix the int overflow bug in reading large csv file, and dd test cases for CsvFileInputSplit. See #27
- Cleanup some code. #15
- Remove a redundant test case whose data source is unaccessible. see #28
- Fix the NEP in PCA. see #42
PyPI support
- Support PyAlink installation using
pip install pyalink
Maven Dependencies
Alink is now synchronized to the Maven central repository, which you can easily add to Maven projects.
With Flink-1.10
<dependency>
<groupId>com.alibaba.alink</groupId>
<artifactId>alink_core_flink-1.10_2.11</artifactId>
<version>1.1.0</version>
</dependency>
<dependency>
<groupId>org.apache.flink</groupId>
<artifactId>flink-streaming-scala_2.11</artifactId>
<version>1.10.0</version>
</dependency>
<dependency>
<groupId>org.apache.flink</groupId>
<artifactId>flink-table-planner_2.11</artifactId>
<version>1.10.0</version>
</dependency>
With Flink-1.9
<dependency>
<groupId>com.alibaba.alink</groupId>
<artifactId>alink_core_flink-1.9_2.11</artifactId>
<version>1.1.0</version>
</dependency>
<dependency>
<groupId>org.apache.flink</groupId>
<artifactId>flink-streaming-scala_2.11</artifactId>
<version>1.9.0</version>
</dependency>
<dependency>
<groupId>org.apache.flink</groupId>
<artifactId>flink-table-planner_2.11</artifactId>
<version>1.9.0</version>
</dependency>