A few months ago I downloaded the scikit-learn package for Python, for those of you who might not be aware, scikit-learn is a powerful yet very simple package useful to apply machine learning. Basically they give you “all” the algorithms you may need and you “only” have to get the data and make it ready to feed into the scikit-learn algorithm.
Anyway, I only recently had time to check it out and write some code, furthermore only recently I found a great site full of sample datasets dedicated to machine learning (check it out here). Since the data from this great site is more or less in the right shape for being ready to import (.txt files or similar) the only task left to the programmer is to put it into numpy arrays.
This is one of my first attempt at generating a “real” machine learning program.
I won’t post the pre-settings script since it is pretty boring, instead I’ll briefly describe the pre-setting: basically, using a simple script I’ve splitted the training samples from the database in two .txt files trainingP.txt and target.txt respectively. TrainingP.txt contains the questions (the hands of poker) and target.txt contains the answers (=the score, that’s to say poker, full house etc… coded into numbers according to the description given in the database description file).
Below is the ml script: it is composed of 3 parts: setting, training and testing
setting: import the training sets and fit them into numpy arrays of the correct shape
training: fit the data into the model using scikit-learn
testing: test the algorithm and check what it has learned so far! Some statistics are printed, for reference. Check the results below!
import numpy as np | |
import sklearn | |
import os | |
# Import of support vector machine (svm) | |
from sklearn import svm | |
"""---------------------------- SETTING -----------------------------------""" | |
# Changing current working directory | |
os.chdir("C:\\") | |
# Set up ML and scikit-learn | |
clf = svm.SVC(gamma=0.01, C=100) | |
# Training data (digits: questions, target: answers) | |
digits = open("trainingP.txt").read().split("\n") | |
target = open("target.txt").read().split("\n") | |
# Initializing numpy arrays | |
digitsD = np.array(np.fromstring(digits[0],sep=",")) | |
# Completing numpy array for digits | |
for line in digits[1:]: | |
try: | |
sample = np.fromstring(line,sep=",") | |
digitsD = np.vstack((digitsD,sample)) | |
except: | |
pass | |
# Converts strings into integers (not necessary if you | |
# choose option 1 below) | |
t = [int(i) for i in target] | |
# 1 : All strings | |
# 2 : All integers | |
t = np.array(target) # 1 | |
#t = np.array(t) # 2 | |
"""---------------------------- TRAINING -----------------------------------""" | |
# Model fitting (training) | |
x,y = digitsD,t | |
print("Training...") | |
clf.fit(x,y) | |
# Answer: 1 | |
print("Prediction:",clf.predict(np.array([3,11,1,9,2,5,3,5,4,1]))) | |
"""---------------------------- TESTING -----------------------------------""" | |
# Load questions and answers | |
rawQuestions = open("testQuestions.txt").read().split("\n") | |
rawAnswers = open("Answers.txt").read().split("\n") | |
# Initialize numpy arrays | |
questions = np.array(np.fromstring(rawQuestions[0],sep=",")) | |
answers = np.array(rawAnswers) | |
for q in rawQuestions[1:]: | |
try: | |
q1 = np.fromstring(q,sep=",") | |
questions = np.vstack((questions,q1)) | |
except: | |
pass | |
# Testing function | |
def tryAnswer(qu,an): | |
machineAnswer = clf.predict(qu) | |
if machineAnswer == an: | |
#print("Correct!") | |
return True | |
else: | |
#print("Wrong!") | |
return False | |
print("Testing...") | |
# Start the test | |
j = 0 | |
right = 0 | |
wrong = 0 | |
while j < 10000: | |
result = tryAnswer(questions[j],answers[j]) | |
if result: | |
right += 1 | |
else: | |
wrong += 1 | |
j += 1 | |
# Print some statistics | |
print("Statistics, correct rate:",right/(right+wrong)) |
So far the best score is 56.4%, I wonder if I did everything correctly! Anyway soon I will post a script with better results and achievements!
# >>> ================================ RESTART ================================ | |
# >>> | |
# Training... | |
# Prediction: ['0'] | |
# Testing... | |
# Statistics, correct rate: 0.564 | |
# >>> |
Below is the citation of the source of the database, according to their citation policy.
The name of the dataset is Poker Hand and it is from:
Bache, K. & Lichman, M. (2013). UCI Machine Learning Repository[http://archive.ics.uci.edu/ml]. Irvine, CA: University of California, School of Information and Computer Science.
you code doesnt work
ReplyDeleteI'm certainly very happy to read this blog site posts which carries plenty of helpful data, thanks for providing such information.https://juegosdecasinoonlinecolombia.com.co
ReplyDeleteYour blog provided us with valuable information to work with. Each & every tips of your post are awesome. Thanks a lot for sharing. Keep blogging.. LorenPoker
ReplyDeleteThis comment has been removed by the author.
ReplyDeleteI’ve been searching for some decent stuff on the subject and haven't had any luck up until this point, You just got a new biggest fan!.. http://103.9.100.48
ReplyDeleteThis is a brilliant blog! I'm very happy with the comments!.. http://216.83.47.66
ReplyDeleteNice to be visiting your blog again, it has been months for me. Well this article that i've been waited for so long. I need this article to complete my assignment in the college, and it has same topic with your article. Thanks, great share. pkv games
ReplyDeleteThanks for the blog filled with so many information. Stopping by your blog helped me to get what I was looking for. Now my task has become as easy as ABC. black satta
ReplyDeleteI have bookmarked your blog, the articles are way better than other similar blogs.. thanks for a great blog! DominoQQ Virtual Namun Asli
ReplyDeleteExcellent .. Amazing .. I’ll bookmark your blog and take the feeds also…I’m happy to find so many useful info here in the post, we need work out more techniques in this regard, thanks for sharing. bet pt
ReplyDeleteIt is a fantastic post – immense clear and easy to understand. I am also holding out for the sharks too that made me laugh. joker หน้าเว็บ
ReplyDeleteเบทฟิก You possess lifted an essential offspring..Blesss for using..I would want to study better latest transactions from this blog..preserve posting..
ReplyDeleteI found your this post while searching for some related information on blog search...Its a good post..keep posting and update the information. 먹튀검증사이트
ReplyDeleteThe best development environment for businesses looking to hasten the delivery of flawless, high-quality apps. With its robust debugging tools, image annotation for farming, and seamless connectivity with well-known development environments, this team can work more productively and achieve exceptional outcomes. So check out the useful link below for more helpful information!
ReplyDeleteMachine learning in poker hands recognition demonstrates how AI can analyze patterns for better outcomes. Similarly, christmas outfits 2024 reflect a pattern of festive creativity and style. Whether designing a smart algorithm or a stunning holiday wardrobe, both require attention to detail and innovation for the perfect result.
ReplyDelete