Assignment 4 information

Announcements
Files for this assignment
Writeup specifications
Playing with your evaluation function
Webtesters
Opening databases

Announcements

[11/10] Results from the round-robin tournament.
[11/10] If you are interested, you can get all the transcripts from the round-robin tournament.
[11/2] I've released an updated file containing compiled versions of all the student players in the pretournament as of about 17:15 today. (Note that I renamed the old cplayers.com file to cplayers1031.com, but it is still available.)
[10/31] I've released a file containing compiled versions of all the student players in the pretournament as of this evening.
[10/28] There are new versions of connect4.scm (which has the pattern-match procedure that I wrote in class), a4code.com, and play-myc4.com (which print move numbers in the transcript of a game).
[10/19] The Connect 4 Pretournament system is up. The flash applet that Kris wrote still has a few little problems, but the pretournament system itself is up and working.
[10/18] There is a new version of a4code.com (1.3.1) available below. You will need this to use the play-myc4.com code.
[10/16] I've released two compiled support code files that contain my alpha-beta minimax solutions. The first is designed to help you examine why your evaluation function is is making its move on a particular board state. The second will allow you to play your evaluation function or to play it against other evaluation functions. See below for details.
[10/15] I've released some code (minimax-test.scm, below) that should help you test your alpha-beta minimax. The comments in this file explain what this code does.
[10/14] Slight change of plans — here is the reference player evaluation function that you must beat for problem 3. This is a weaker player than the previous reference player evaluation function that I had originally intended to use. FYI, the original reference player (playing O) does not beat the current reference player (playing X) in the webtester.
[10/13] I have just released the evaluation function for the reference player that you must beat for problem 3. This is not the entire player — the complete player uses an opening database for the first two moves.
The Problem 3 webtester will first play your evaluation function against the random player and then against this reference player. Your evaluation function will play X against the random player and O against the reference player. Both your evaluation function and the reference player will be used upto depth 4 in the alpha-beta minimax search.
[10/4] Most of the restrictions on Scheme constructs have been removed. You can now use mutators such as set!, delete!, etc., but make sure you know what you're doing! Internal definitions are also now permitted. However, eval remains a prohibited procedure.

Files for this assignment

Stub files:
- assign4p2.scm
- assign4p3.scm
Support code
- connect4.scm
- a4code.com (new version (1.3.2) released Oct 28)
Problem 3 webtester evaluation function: ref-player.scm
Old problem 3 webtester evaluation function: ref-player2.scm
Code to help test your ab-minimax: minimax-test.scm
Code to help test your evaluation function (see the Playing... section for details.
- test-myc4.com
- play-myc4.com (version 2.0.1 released Oct 28)
Some compiled evaluation functions:
- cplayers1102.com — all the student evaluation functions as of Monday evening 10/31. See the cplayers1102-readme.txt file for details.
- cplayers1031.com — all the student evaluation functions as of Monday evening 10/31. See the cplayers1031-readme.txt file for details.
- ref-cplayer.com — the evaluation function used by the problem 3 webtester, including the opening database
- ref2-cplayer.com — the other reference player in the pretournament
- whuang-1-cplayer.com — a player I wrote last year, not the most sophisticated, but it did place 13th in the round-robin tournament. It incorporates some ideas of positional play. My evaluation function is run to depth 6 in this player (same as in the pretournament system).
- test1-cplayer.com
- test2-cplayer.com
File for tests you need to do for the problem 7 writeup:
- testboards.scm

Writeup specifications

You should do your writeup in your favorite word processor rather than turning in handwritten work. This makes it easier for Kris to grade, and generally this results in better quality writeups since it is easy to go back and edit. There is no minimum or maximum length requirement for the writeup, however I expect that most writeups will be between 5 and 15 pages. This, of course, will depend on how you format the transcripts of the games against the test players, among other things. Keep in mind that Kris isn't going to devote an entire evening reading your writeup just because you turn in a tome, so make it easy for him to grade by being concise, coherent, and to the point, but you should also cover everything that you need to cover. You should make sure the boards in your writeup are printed with a monospace font so that they are understandable.

Here are the details of what your writeup for problem 7 should cover.

Give a prose description of how your evaluation function works. In particular, outline how you calculate the value of the function, what features you looked for and incorporated into your calculations, and why your evaluation function is an accurate indicator of how good the game is for MAX.
If you used an opening database, explain how you created it, i.e., which states did you include in your opening database and how did you select the moves from those states. Do not include your opening database in your writeup, but feel free to use examples from it as necessary in your writeup.
Run your evaluation function on the five boards in the file testboards.scm to the specified depth(s) as given in the file and report which moves it chooses. You should use the test-myc4.com code to run these tests. For each of these boards, do the following analysis:
1. Why did your evaluation function cause minimax to choose each move?
2. Is it a good move? Explain why. If there is a better move, why didn't your evaluation function choose it?
3. For the three boards (b3, b4, and b5) that you run once to depth 4 and once to depth 6, explain why your evaluation function chose the same or a different move at the different depths.
Play your evaluation function:
- as X against the test1 player (which plays O)
- as O against the test2 player (which plays X)
These players are available in the "files" section as compiled players.
ANALYZE the resulting games, pointing out places where your evaluation function made particularly good or bad moves (and why they were good or bad moves). Also, EXPLAIN why your evaluation function caused minimax to make those moves and (for bad moves) why a better move wasn't picked. Include a transcript of these games in your writeup.

Playing with your evaluation function

The test-myc4.com file contains the procedure:
```
    (test-myc4 eval-fn board player depth-cutoff)
```
When you call this procedure with an evaluation function, a board state, the player to move (either the symbol X or O) and the depth cutoff, it will run the evaluation function using my alpha-beta minimax solutions. It can print out a "narration" of the alpha-beta minimax process, and, at the end, it can print out the path corresponding to an optimal game, including the alternatives that were checked.
There are several parameters that control the behavior of test-myc4:
- print-narration — default value is #f
  When #t, this will print a narration of the alpha-beta minimax search process. Be forewarned that this can generate a lot of output!
- print-final-boards — default value is #t
  When #t, prints out the path along the optimal game, along with the alternatives that were searched.
- print-raw-boards — default value is #t
  When #t, it prints the raw board state whenever it prints out a board. This can be useful when you want to do some further investigation on how your evaluation function rates moves that were not chosen.
The play-myc4.com file contains code that will let you play your evaluation function using my alpha-beta minimax solutions. This file contains a single procedure:
```
    (play-myc4 X-player-args... O-player-args)
```
The play-myc4.com file does not load connect4.scm although it does depend on it!
There are three "kinds" of players that you can use with the play-myc4 procedure which have different numbers of arguments:
- a player procedure, such as the human-player or random-player procedures in connect4.scm
  To specify a player procedure requires 1 argument: the procedure itself
- an evaluation function
  To specify an evaluation function, you need to give 3 arguments: the evaluation function, its depth limit, and its opening database.
  If you aren't using an opening database with the evaluation function, the third argument should be the empty list.
- a compiled player
  To specify a compiled player, you must give 1 argument: the name of the compiled player procedure.
  By convention, the name of the file will be the name of the compiled player procedure (without any version number or scheme extension). For example, the file whuang-cplayer.7.scm would contain the compiled player procedure whuang-cplayer.
The play-myc4 procedure requires that at least one of the players be an evaluation function.
Here are a few examples of calling play-myc4:
- To play X against your evaluation function (c4-eval) to depth 6 without an opening database:
```
    (play-myc4 human-player c4-eval 6 '())
```
- If you have two evaluation functions, you can play them against each other. Here, they are using the same opening database.
```
    (play-myc4 c4-eval1 4 opening-db c4-eval2 4 opening-db)
```
- To play your evaluation function as O to depth 4 without an opening database against the compiled player procedure ref-cplayer:
```
    (play-myc4 ref-cplayer c4-eval 4 '())
```
There are a few variables that control the behavior of play-myc4
- print-raw-boards — default value is #t
  If #t, it prints the "raw" board state after each move. This is useful if you want to copy one of the board states so you can investigate why your evaluation function made a certain move.
- randomize-children — default value is #f
  If #t, it randomly permutes the children of each node before doing alpha-beta minimax. This is not done in the pre-tournament, nor will it be done in the round-robin tournament. When this is #t, your evaluation function(s) will be picking an optimal move at random. Normally, however, (when #fthe game between two players is always the same).

Webtesters

The Policy for Electronic Submission is the same as for Assignment 1.
Submit problem 2 (alpha-beta minimax) to the web tester
This web tester will run your alpha-beta minimax on a Connect 4 state using a special evaluation function that records the arguments you give it for each call. This is compared with the correct sequence of calls. It is important that you evaluate child states in the order that c4-children procedure gives them to you, or your sequence of calls will be completely different.
One thing that can causes students' code to test incorrectly is that they do not pass the correct current-player and max-player arguments to the evaluation function. The max-player argument (either the symbol X or O) is the player on whose behalf you are evaluating the board. The current-player argument (also either the symbol X or O) is the player whose turn it is to play in the given board state.
Submit problem 3 ("minimally competent" evaluation function) to the web tester
This webtester will play two games: one against the random player (you play X), and one against the reference player (you play O). Note that the reference player file does not contain the two move opening database it uses. You must beat both these players for full credit on this problem.
Connect 4 Pretournament System
You can upload an evaluation function as many times as you like to the pretournament system. We will take your most recent evaluation function as of the deadline for this assignment. There is no late period for this deadline, so make sure you get your evaluation function into the pretournament.
Note: the flash applet that Kris wrote for this page still has a few little problems, but the pretournament system itself is up and working.

Opening databases

The details of the opening database mechanism have changed because the implementation had to be different than I originally planned. The details and support code procedures (in a4code.com) are described below.

You can specify a move that you would like your player to make from a given state (not move sequence). Note that there are, in general, multiple move sequences that result in the same state. The problem with specifying states is that our state representation (primarily a string) takes up more memory than it should. Therefore, the opening database mechanism uses a more compact representation. In order to use the opening database mechanism, you must transform your opening database into this form.

Here are the steps to create an opening database:

You first need to create a "board-state opening database". For example:
```
    (define my-db
      '(; X's first move should be in column 2
	("       |       |       |       |       |       " 2)
	; if X moves in column 4, then O should move in column 7
	("       |       |       |       |       |   X   " 7)
	; if X moves in column 2, then O should move in column 3
	("       |       |       |       |       | X     " 3)
	; X's second move, if O played in column 4, should be in column 3
	("       |       |       |       |       | X O   " 3)
	; X's second move, if O played in column 7, should be in column 6
	("       |       |       |       |       | X    O" 6))
```
Note that your database need not be complete; you only have to specify the states that you want to. Also note that this mixes states in which X is to move with those in which O is to move. The order of the states in your list does not matter.
You can create this "board-state opening database" in a number of ways. The best way might be to write a program to generate these states (taking advantage of the support code). To generate the database above, I used the play-piece procedure in the support code. There are also procedures in the support code to transform a "movestring" into a "boardstring". See details below.
Next, you need to transform this into an opening database by using the procedure convert-bsdb which will write the converted opening database to a file:
```
    (convert-bsdb my-db "my-db.scm")
    ; Validating database...
    ; Converting database...
    ; Writing to file...
    ;Unspecified return value
```
WARNING — this will overwrite the filename you give it.
This file, in the above case, looks like this:
```
    (define opening-db '((0 0 2) (0 0 15) (0 3 3) (0 3 19) (2 3 6) ))
```
The reason that the converted database is written to a file is that it can be somewhat large.
You need to put this code (insert or copy/paste) in your assign4p3.scm file.
Note that this code will defines a variable opening-db. However, when you load a4code.com, the opening-db variable will be reset to ()! I'd suggest putting your opening database definition at the end of your file.

Opening database procedures in the support code

(movestring->boardstring ms) — converts a "movestring" into a board (state) string. For example:

    (movestring->boardstring "2673")
    ;Value: "       |       |       |       |       | XO  OX"

    (print-board (movestring->boardstring "2673"))
	+---------------+
       6|               |
       5|               |
       4|               |
       3|               |
       2|               |
       1|   X O     O X |
	+---------------+
	  1 2 3 4 5 6 7

Note that the movestring "7326" would produce the same boardstring.

BTW, I modified print-board in Version 1.3 so that it can take a full state (list containing the string and a list of pieces in each column) or just the board string itself.

(db-movestring->db-boardstring dms) — takes a "database movestring" and turns it into a "database boardstring", suitable for including in your "board state opening database". The difference between a "database movestring" and a regular "movestring" is that the last character of the former is the move you want to make after the state that results from the sequence of moves upto the second to last character. For example:
```
    (db-movestring->db-boardstring "26735")
    ;Value: ("       |       |       |       |       | XO  OX" 5)
```
This is the same board state as above, but I've indicated that I want to move in column five after the first four moves.
(get-opening-db-move board) — checks if the given board state is in the opening database. If so, it returns the move, and otherwise returns #f.
This procedure has been written so that it can take a complete board state or just a boardstring. For example:
```
    (get-opening-db-move (play-piece  c4-start 'X 2))
    ;Value: 3

    (get-opening-db-move "       |       |       |       |       | X    O")
    ;Value: 6
```
(play-piece board player col) — plays a piece for the given player, returning a newly allocated board state. board must be a full board state, player must be the symbol X or O, and col must be an integer between 1 and 7 inclusive. For example:
```
(play-piece c4-start 'X 1)
;Value: ("       |       |       |       |       |X      " (1 0 0 0 0 0 0))
```