Wrote proposal
Submitted thesis proposal to DeBruhl
Discussed logistics of testing
Began arrangements with Mammen on using 357 students.
Need to talk to Nico about O.S.
Researched NLP and Hidden Markov Chains
Requested VirtualBox be installed on CSL machines
Acquired V.M. for router
Researched methods of tracking/tagging SSH connections
Found several simple keyloggers
Plan: Find one which can monitor one process at a time. Must also include timestamps
Realized using V.M. as router and be on the internet doesn't work because I can't force it to be the default gateway
How do I tell the difference between a legitimate SSH connection or one to be forwarded?
Might be easier to just use VPNs and hook on new connections
Gathered preliminary data on Unix commands
315 non-privileged commands, most of them at length 7
Began IRB paperwork
Began writing prompts for students
Plan: Talk to Lupo/Pentoja about Fall Quarter
Spoke with Debbie Hart
Submitted IRB paperwork
Wrote keylogger script for SSH
Wrote up Context-Aware SSH docs
Extensively tested key logger
IRB approved
Talked to Tedd about getting VirtualBox on lab machines
Tested OpenVPN
Wrote tcpdump filter for only traffic SSHing into another system
Tested pairing network packets to keystrokes
First SYN packet is within 100ms of SSH starting in log
Discussed thesis progress with DeBruhl
Ethan (357 student) expressed interest in project
Set up CRON to automatically run packet tap on reboot + app armor permissions
Server suddenly cannot connect to Unix1
Forgot to save IPTables rule
Started writing script to automatically pair keylog files to packet flows
Got to the point where it could match the start of a file to a TCP SYN
Finished and tested script
Was able to match all packets with 50ms time difference
Tested multiple keylog files.
Discovered that different keylogs have different delays.
Delay within one log file fairly consistent
Created scripts which copy keylog file to router VM.
Changes in delay attributable to AWS server instances changing
Tested two SSH sessions at same time
Found issue with VM routing
Set up VPN to automatically enable when VM boots
Plan: Separate out each flow into a separate pcap
Worked on separating each TCP flow into a separate pcap
Apparently editcap can't do this on it's own so I'm writing my own utility for this
Deployed VM to Unix machines upstairs
Tested running Dataset 1 with Griffian
Set up Lubuntu VM (~1/2 the size)
Fixed issues with VirtualBox version differences (CSL was running 6, I was running 5)
Gathered data from Lucy
Wrote script to automatically filter the packets before sending them to server.
Got VM's working on CSL computers - Had to install to /tmp for space reasons
Tested client configuration with 3 people simultaneously
Added prompt to script to give students a chance to review/approve data before submitting
Discussed progress with DeBruhl
Worked on the disconnect problem
Gather data from a few students:
Captured log of what happened
Solution: Use TCP
Talked with Nico about testing O.S./S.S. sections
Found bug that caused network to disconnect
Can fix within first two minutes by adding the route back
Gathered more data from 357 students
Option: Reduce dataset requirements
Asked for volunteers from O.S.
Met with 357 students from Griffian's section/Elie
Checked that pcaps, log files, and consent forms line up
Because of Timezone offsets, some logs on day boundary
Finished refactoring packet matcher to work with flow-seperated pcaps.
Began looking over data
Seems like a lot of files have incomplete data - may have to rerecord all of it.
Looked into issues with files
Time stamps are way off. Some data has timestamps way after my last recorded flow
See if I have a syn-packet for one of my problem flows
Some packet captures empty. Seems data capture system needs serious work.
Emailed Ethan
Emailed DeBruhl
Bought Amazon gift cards
Looked for patterns in missing data