Hide keyboard shortcuts

Hot-keys on this page

r m x p   toggle line displays

j k   next/prev highlighted chunk

0   (zero) top of page

1   (one) first highlighted chunk

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

# 

# Licensed to the Apache Software Foundation (ASF) under one or more 

# contributor license agreements. See the NOTICE file distributed with 

# this work for additional information regarding copyright ownership. 

# The ASF licenses this file to You under the Apache License, Version 2.0 

# (the "License"); you may not use this file except in compliance with 

# the License. You may obtain a copy of the License at 

# 

# http://www.apache.org/licenses/LICENSE-2.0 

# 

# Unless required by applicable law or agreed to in writing, software 

# distributed under the License is distributed on an "AS IS" BASIS, 

# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. 

# See the License for the specific language governing permissions and 

# limitations under the License. 

# 

 

""" 

RDD-based machine learning APIs for Python (in maintenance mode). 

 

The `pyspark.mllib` package is in maintenance mode as of the Spark 2.0.0 release to encourage 

migration to the DataFrame-based APIs under the `pyspark.ml` package. 

""" 

# MLlib currently needs NumPy 1.4+, so complain if lower 

 

import numpy 

 

ver = [int(x) for x in numpy.version.version.split('.')[:2]] 

29 ↛ 30line 29 didn't jump to line 30, because the condition on line 29 was never trueif ver < [1, 4]: 

raise RuntimeError("MLlib requires NumPy 1.4+") 

 

__all__ = ['classification', 'clustering', 'feature', 'fpm', 'linalg', 'random', 

'recommendation', 'regression', 'stat', 'tree', 'util']