The program reads in a dataset into a pandas dataframe, and uses the train_test_split function in the sklearn library to split the data into training and test sets. The code goes thus :
import pandas as pd
#import the pandas dataframe and alias it as pd
from sklearn.model_selection import train_test_split
#import the train_test_split function
housing_df = pd.read_csv('housing price.csv')
#read in the housing data
features_df = df.iloc[:,1:]
#seperate the features from the label ;
target_df = df.iloc[:,0]
#put the label into a seperate dataframe as well.
X_train, X_test, Y_train, Y_test = train_test_split(features_df, target_df, test_size = 0.1, random_state = 1)
#uses tuple unpacking to randomly assign the data each of the 4 variables.
#Test size is test percent of the entire dataset
Learn more :https://brainly.com/question/4257657?referrer=searchResults