Difference between revisions of "DataShop Pipeline"

From LearnLab
Jump to: navigation, search
(Updates on Ryan data)
 
(23 intermediate revisions by 5 users not shown)
Line 5: Line 5:
 
(on [http://learnlab.web.cmu.edu/datashop/ learnlab.org] - ordered by alphabetically by project, researchers get a gold star when their datasets move up into this table!)
 
(on [http://learnlab.web.cmu.edu/datashop/ learnlab.org] - ordered by alphabetically by project, researchers get a gold star when their datasets move up into this table!)
  
{| width="1061" class="pipeLineTable" id="pipeLineTable1"
+
{| class="pipeLineTable FCK__ShowTableBorders" id="pipeLineTable1" width="1061"
 
|-
 
|-
! width="33" height="25" style="width: 25pt;" | <br>
+
! width="33" height="25" | <br>
! width="134" style="width: 101pt;" | Project
+
! width="134" | Project
! width="283" style="width: 130pt;" | Dataset Name
+
! width="283" | Dataset Name
! width="85" style="width: 64pt;" | DB ID
+
! width="85" | DB ID
! width="85" style="width: 64pt;" | Tool
+
! width="85" | Tool
! width="82" style="width: 62pt;" | LearnLab
+
! width="82" | LearnLab
! width="107" style="width: 80pt;" | P.I.
+
! width="107" | P.I.
! width="104" style="width: 78pt;" | School(s)
+
! width="104" | School(s)
! width="94" style="width: 71pt;" | Date
+
! width="94" | Date
! style="width: 236px;" | Notes/Status Remarks
+
! Notes/Status Remarks
 
|-
 
|-
| <br>
+
|  
 +
| Chemistry Collaboration
 +
| Effects of collaboration in virtual laboratory environments
 +
| 95
 +
| VLAB
 +
| Chemistry
 +
| Bruce McLaren
 +
|
 +
| Spring 2007
 +
| "The CoChemEx Project: Conceptual Chemistry Learning through Experimentation and Adaptive Collaboration"
 +
|-
 +
|
 +
| Contiguity
 +
| Contiguity CWCTC Spring 2006
 +
| 79
 +
| CL
 +
| Geometry
 +
| Vincent Aleven
 +
| CWCTC HS
 +
| Spring 2006
 +
|
 +
|-
 +
|
 
| Geometry Course
 
| Geometry Course
 
| Geometry Area (1996-97)
 
| Geometry Area (1996-97)
Line 27: Line 49:
 
| unknown
 
| unknown
 
| 1996-97
 
| 1996-97
|  
+
| Paper written -- get reference!!<br>8/17: Primary analysis complete. Cen (2006), Cen (2007)<br>
Paper written -- get reference!!<br> 8/17: Primary analysis complete. Cen (2006), Cen (2007)<br>
+
 
+
 
|-
 
|-
| <br>
+
|  
 
| Geometry Course
 
| Geometry Course
 
| Geometry Angles - Fox Chapel 1998
 
| Geometry Angles - Fox Chapel 1998
Line 40: Line 60:
 
| unknown
 
| unknown
 
| unknown
 
| unknown
| Files-only<br>  
+
| Files-only<br>
7/23: Low priority to do: get raw logs and distill to transaction level.<br> 8/17: Two best paper awards!
+
7/23: Low priority to do: get raw logs and distill to transaction level.<br>8/17: Two best paper awards!
 +
 
 +
|-
 +
|
 +
| Improving Algebra Learning and Collaboration
 +
| CPS Algebra 1 2005
 +
| 109
 +
| CL-2005
 +
| Algebra
 +
| Bruce McLaren, Nikol Rummel
 +
| Hampton HS
 +
| 2005-2006
 +
|
 +
|-
 +
|
 +
| Knowledge Tracking
 +
| Chinese Vocabulary Fall 2006
 +
| 160
 +
| FaCT
 +
| Chinese
 +
| Phil Pavlik
 +
| CMU
 +
| Spring 2006
 +
| 2 papers
 +
|-
 +
|
 +
| Physics Course
 +
| USNA Physics Fall 2006
 +
| 126
 +
| Andes
 +
| Physics
 +
| Kurt VanLehn
 +
| US Naval Academy
 +
| Fall 2006
 +
|
 +
 
 +
|-
 +
|
 +
| Stoichiometry Study
 +
| PSLC Stoichiometry Study 1
 +
| 2
 +
| CTAT-Flash
 +
| Chemistry
 +
| Bruce McLaren
 +
| UBC, Hampton HS, NJ High school
 +
| 2005+
 +
|
 +
|-
 +
|
 +
| The Effect of Generation and Interaction on Robust Learning
 +
| Self Explanation - Electric Fields - USNA - Spring 2006
 +
| 104
 +
| Andes
 +
| Physics
 +
| Kurt VanLehn/Bob Hausman
 +
| US Naval Academy
 +
| Spring 2006
 +
|
 +
 
  
 
|}
 
|}
Line 49: Line 127:
 
(on [http://learnlab.web.cmu.edu/datashop/ learnlab.org] - ordered by alphabetically by project)
 
(on [http://learnlab.web.cmu.edu/datashop/ learnlab.org] - ordered by alphabetically by project)
  
{| class="pipeLineTable" id="pipeLineTable2"
+
{| class="pipeLineTable FCK__ShowTableBorders" id="pipeLineTable2"
 
|-
 
|-
! width="33" height="25" style="width: 25pt;" | <br>
+
! width="33" height="25" | <br>
! width="134" style="width: 101pt;" | Project
+
! width="134" | Project
! width="283" style="width: 130pt;" | Dataset Name
+
! width="283" | Dataset Name
! width="85" style="width: 64pt;" | DB ID
+
! width="85" | DB ID
! width="85" style="width: 64pt;" | Tool
+
! width="85" | Tool
! width="82" style="width: 62pt;" | LearnLab
+
! width="82" | LearnLab
! width="107" style="width: 80pt;" | P.I.
+
! width="107" | P.I.
! width="104" style="width: 78pt;" | School(s)
+
! width="104" | School(s)
! width="94" style="width: 71pt;" | Date
+
! width="94" | Date
! style="width: 236px;" | Notes/Status Remarks
+
! Notes/Status Remarks
 
|-
 
|-
 
| <br>
 
| <br>
Line 71: Line 149:
 
| <br>
 
| <br>
 
| Winter 2006
 
| Winter 2006
| 07/06/07: Being QA'd by Tristan<br>  
+
| 07/06/07: Being QA'd by Tristan<br>
 
09/22/07: Data loaded to production.
 
09/22/07: Data loaded to production.
 +
01/18/08: Reloaded on production with bug fixes.
 +
 +
|-
 +
| <br>
 +
| A Multimodal Interface for Solving Equations
 +
| Handwriting - Spring 2007
 +
| 165
 +
| CL-2006 (Munger)
 +
| Algebra
 +
| Lisa Anthony
 +
| <br>
 +
| Spring 2007
 +
| 07/06/07: Being QA'd by Tristan<br>
 +
09/24/07: Data loaded to the-cooker.<br>09/28/07: Updated on cooker to v2.3, waiting for new version of munger to load on production with v2.3 analysis_db<br>10/26/07: Bug fixes completed in Munger, waiting to receive new version. <br>11/5/07 New munger version had errors on run, awaiting word from tristan on causes.<br>1/18/08 Loaded onto production.
  
 
|-
 
|-
Line 84: Line 176:
 
| CWCTC<br>Hampton<br>Wilkinsburg HSs
 
| CWCTC<br>Hampton<br>Wilkinsburg HSs
 
| 2005-2006
 
| 2005-2006
| 2/20: Received DataMunger.<br>  
+
| 2/20: Received DataMunger.<br>
2/27: Started running data munger for wilkinsburg-algebra i.<br> 3/5: Munger done, have 1,378,891 txs instead of 1,187,841.<br> 5/4: Munger fixed, Jon Steinhart running the conversion, Tristan to QA. Supposed to have data week of 5/7<br> Load of Wilkinsburg student data delayed due to problems copying the student files.<br> 6/25: Hampton numbers on production not right. CWCTC numbers on the-cooker not right. Wilks data doesn't load. Requested a new complete CD from CL.<br> 7/9: Modifying analysis database to support improved pruning, waiting for new CD with latest version of munger.<br> 7/23: Loaded onto production with all 3 schools, one student from Wilkinsburg failed to load.
+
2/27: Started running data munger for wilkinsburg-algebra i.<br>3/5: Munger done, have 1,378,891 txs instead of 1,187,841.<br>5/4: Munger fixed, Jon Steinhart running the conversion, Tristan to QA. Supposed to have data week of 5/7<br>Load of Wilkinsburg student data delayed due to problems copying the student files.<br>6/25: Hampton numbers on production not right. CWCTC numbers on the-cooker not right. Wilks data doesn't load. Requested a new complete CD from CL.<br>7/9: Modifying analysis database to support improved pruning, waiting for new CD with latest version of munger.<br>7/23: Loaded onto production with all 3 schools, one student from Wilkinsburg failed to load.
  
 
|-
 
|-
Line 120: Line 212:
 
| Spring 2007
 
| Spring 2007
 
| <br>
 
| <br>
|-
 
| <br>
 
| Chemistry Collaboration
 
| Effects of collaboration in virtual laboratory environments
 
| 95
 
| VLAB
 
| Chemistry
 
| Bruce McLaren
 
| <br>
 
| Spring 2007
 
| Files only.
 
 
|-
 
|-
 
| <br>
 
| <br>
Line 164: Line 245:
 
| Winter 2006
 
| Winter 2006
 
| Files only
 
| Files only
|-
 
| <br>
 
| Contiguity-CWCTC
 
| Contiguity CWCTC Spring 2006
 
| 79
 
| CL
 
| Geometry
 
| Vincent Aleven
 
| CWCTC HS
 
| Spring 2006
 
 
| <br>
 
| <br>
 
|-
 
|-
Line 196: Line 267:
 
| CWCTC
 
| CWCTC
 
| Fall 2006
 
| Fall 2006
| 6/21: Reloaded.<br>  
+
| 6/21: Reloaded.<br>
07/18/07: v4.13 Loaded on test machine for reload verify<br> 7/23: Received missing CTAT data from Frank. Kyle to work on 7/24.<br> 7/26: Missing CTAT uploaded to production.<br>11/14 reloaded CL data, CTAT data needs reload<br>
+
07/18/07: v4.13 Loaded on test machine for reload verify<br>7/23: Received missing CTAT data from Frank. Kyle to work on 7/24.<br>7/26: Missing CTAT uploaded to production.<br>11/14 reloaded CL data, CTAT data needs reload<br>
  
 
|-
 
|-
Line 319: Line 390:
 
| CMU
 
| CMU
 
| Summer 2006
 
| Summer 2006
| 6/19: Need to reload CL data. Need to reload CTAT data.<br>  
+
| 6/19: Need to reload CL data. Need to reload CTAT data.<br>
6/21: Reloading CTAT to QA machine. CL will follow.<br> 6/21: Data reloaded to production.<br> 6/25: Bad (not all students were anonymized correctly) CTAT logs removed from production.<br> 7/18: CL data: v4.13 Loaded on test machine for reload verify by Octav.<br> 7/24: CTAT data: ready to load<br> 7/31: CTAT data reloaded.<br>
+
6/21: Reloading CTAT to QA machine. CL will follow.<br>6/21: Data reloaded to production.<br>6/25: Bad (not all students were anonymized correctly) CTAT logs removed from production.<br>7/18: CL data: v4.13 Loaded on test machine for reload verify by Octav.<br>7/24: CTAT data: ready to load<br>7/31: CTAT data reloaded.<br>
  
 
|-
 
|-
Line 332: Line 403:
 
| Steel Valley HS
 
| Steel Valley HS
 
| Spring 2006
 
| Spring 2006
| 6/19: Need to reload CL data.<br>  
+
| 6/19: Need to reload CL data.<br>
 
6/21: Data reloaded to production.
 
6/21: Data reloaded to production.
  
Line 345: Line 416:
 
| CWCTC &amp; Wilkinsburg
 
| CWCTC &amp; Wilkinsburg
 
| Spring 2007
 
| Spring 2007
| 07/06/07: Will receive from CL week of 7/10/07<br>  
+
| 07/06/07: Will receive from CL week of 7/10/07<br>
07/09/07: Octav received data<br> 07/18/07: Loaded on cooker<br> 07/21/07: QUESTION: is this 3 datasets or 1 with 3 schools?<br> 7/29/07: Loaded to production.
+
07/09/07: Octav received data<br>07/18/07: Loaded on cooker<br>07/21/07: QUESTION: is this 3 datasets or 1 with 3 schools?<br>7/29/07: Loaded to production.
  
 
11/14: reloaded onto production.
 
11/14: reloaded onto production.
Line 371: Line 442:
 
| Gymnasium
 
| Gymnasium
 
| Spring 2007
 
| Spring 2007
| 07/10/07: Octav verifying conversion<br>  
+
| 07/10/07: Octav verifying conversion<br>
 
7/29/07: Loaded to production.
 
7/29/07: Loaded to production.
  
Line 466: Line 537:
 
| Geometry Course
 
| Geometry Course
 
| Geometry Angles - North Hills Spring 2003
 
| Geometry Angles - North Hills Spring 2003
| 63
+
| 68
 
| CL
 
| CL
 
| Geometry
 
| Geometry
Line 495: Line 566:
 
| unknown
 
| unknown
 
| Bad Version
 
| Bad Version
|-
 
| <br>
 
| Geometry Course
 
| Geometry Angles -Fox Chapel 1998
 
| 122
 
| CL
 
| Geometry
 
| Vincent Aleven
 
| Fox Chapel
 
| 1998
 
| Files Only
 
|-
 
| <br>
 
| Geometry Course
 
| Geometry Area (1996-97)
 
| 76
 
| CL
 
| Geometry
 
| Vincent Aleven
 
| unknown
 
| <br>
 
| <br>
 
 
|-
 
|-
 
| 30
 
| 30
Line 526: Line 575:
 
| <br>
 
| <br>
 
| <br>
 
| <br>
| <br>
 
|-
 
| 2
 
| Improving Algebra Learning and Collaboration
 
| CPS Algebra I 2005
 
| 109
 
| CL-2005
 
| Algebra
 
| Bruce McLaren, Nikol Rummel
 
| Hampton HS
 
| 2005-2006
 
 
| <br>
 
| <br>
 
|-
 
|-
Line 559: Line 597:
 
| Golden Valley High School
 
| Golden Valley High School
 
| Feb, 2007
 
| Feb, 2007
| 07/06/07: Being QA'd by Tristan<br>  
+
| 07/06/07: Being QA'd by Tristan<br>
8/13/07: CTAT files received from Frank. Need to insert course_name, check for errors.<br> 8/15/07: CTAT logs loaded to production. Some dataset info updated as well.<br> 9/21/07: Munger failure due to CTAT logs already loaded. Need to clean out CTAT logs and try again.<br> 9/22/37: Data loaded to production.<br> 9/25/07: CTAT logs loaded to production (were removed for munger reload)
+
8/13/07: CTAT files received from Frank. Need to insert course_name, check for errors.<br>8/15/07: CTAT logs loaded to production. Some dataset info updated as well.<br>9/21/07: Munger failure due to CTAT logs already loaded. Need to clean out CTAT logs and try again.<br>9/22/37: Data loaded to production.<br>9/25/07: CTAT logs loaded to production (were removed for munger reload)<br>1/18/2007: reoaded to production
  
 
|-
 
|-
Line 616: Line 654:
 
| CMU
 
| CMU
 
| Spring 2007
 
| Spring 2007
| 8/2/07: Received log files from Jonathan Sewall. Will load to QA.<br>  
+
| 8/2/07: Received log files from Jonathan Sewall. Will load to QA.<br>
8/07/07: Files loaded to QA. Some still need fixed (contain invalid XML) <br> 8/16/07: All CTAT logs free of errors. All have been loaded to QA - waiting for researcher verification.<br> 8/24/07: Need to get logs from Discussion Board from Erin Walker. Let's ping them again late Sept.<br> 9/25/07: Files are on QA - waiting for go-ahead to move to prod from the researchers.
+
8/07/07: Files loaded to QA. Some still need fixed (contain invalid XML) <br>8/16/07: All CTAT logs free of errors. All have been loaded to QA - waiting for researcher verification.<br>8/24/07: Need to get logs from Discussion Board from Erin Walker. Let's ping them again late Sept.<br>9/25/07: Files are on QA - waiting for go-ahead to move to prod from the researchers.
  
 
|-
 
|-
Line 676: Line 714:
 
| 10/9/2007: should be renamed to IWT Article Tutor Level 5 Study Fall 2007 when data collection is complete
 
| 10/9/2007: should be renamed to IWT Article Tutor Level 5 Study Fall 2007 when data collection is complete
 
|-
 
|-
|  
+
| <br>
 
| Knowledge Tracking
 
| Knowledge Tracking
 
| Chinese Vocabulary Spring 2006
 
| Chinese Vocabulary Spring 2006
Line 685: Line 723:
 
| CMU
 
| CMU
 
| Spring 2006
 
| Spring 2006
|  
+
| <br>
 
|-
 
|-
|
 
| Knowledge Tracking
 
| Chinese Vocabulary Fall 2006
 
| 160
 
| Phil's own
 
| Chinese
 
| Phil Pavlik
 
| CMU
 
| Spring 2006
 
| 7/06/07: Phil working on conversion. ETA 4-6 weeks.<br>
 
09/25/07: DataShop has the dataset file. Waiting to upgrade the-cooker before loading for vetting.<br> 09/28/07: Loaded on the cooker under v2.3, waiting for verification<br> 10/9/07: Problem with the number of observations in Learning Curves. DS team looking into it.<br> 11/12/07: Data reloaded to bunny.<br>Loaded to production on 11/13/07
 
 
 
|-
 
|-
|  
+
| <br>
 
| Knowledge Tracking
 
| Knowledge Tracking
 
| Chinese Vocabulary Spring 2007
 
| Chinese Vocabulary Spring 2007
Line 709: Line 735:
 
| CMU
 
| CMU
 
| Spring 2007
 
| Spring 2007
| 7/06/07: Phil working on conversion. ETA 4-6 weeks.<br>  
+
| 7/06/07: Phil working on conversion. ETA 4-6 weeks.<br>
9/26/07: Loaded to the-cooker<br> 10/9/07: Problem with the number of observations in Learning Curves. DS team looking into it.<br> 11/12/2007: Data reloaded to bunny.<br>Loaded to production on 11/13/07
+
9/26/07: Loaded to the-cooker<br>10/9/07: Problem with the number of observations in Learning Curves. DS team looking into it.<br>11/12/2007: Data reloaded to bunny.<br>Loaded to production on 11/13/07
 
+
 
|-
 
|-
 
|  
 
|  
 +
| Knowledge Tracking
 +
| French Vocabulary Spring 2007
 +
| 168
 +
| FacT System
 +
| French
 +
| Phil Pavlik/Nora Presson
 +
| <br>
 +
| Spring 2007
 +
| 07/06/07: Nora Presson to send data, no ETA yet.<br>
 +
8/13/07: Nora has import file ready. Waiting to receive from her to load onto the-cooker.<br>8/15/07: Data loaded to the-cooker. Vetting in process.<br>10/8/07: Problem with import file, Nora to reprocess and send to DataShop team.<br>11/12/07: Data reloaded to were-rabbit. Noticed problem with # students.<br>12/3/07: Nora to improve problem names in dataset<br>1/22/08: Data goes live!
 +
 +
|-
 +
| <br>
 
| Knowledge Tracking
 
| Knowledge Tracking
 
| Spanish Vocabulary Spring 2006
 
| Spanish Vocabulary Spring 2006
Line 720: Line 758:
 
| Spanish
 
| Spanish
 
| Phil Pavlik
 
| Phil Pavlik
|  
+
| <br>
 
| Winter 2007
 
| Winter 2007
|  
+
| <br>
 
|-
 
|-
|  
+
| <br>
 
| Knowledge Tracking
 
| Knowledge Tracking
 
| Chinese Vocabulary Transfer Lab Study Spring 2006
 
| Chinese Vocabulary Transfer Lab Study Spring 2006
Line 744: Line 782:
 
| unknown
 
| unknown
 
| Fall 2005
 
| Fall 2005
|  
+
| <br>
 
|-
 
|-
 
| 34
 
| 34
Line 755: Line 793:
 
| unknown
 
| unknown
 
| Fall 2005
 
| Fall 2005
|  
+
| <br>
 
|-
 
|-
 
| 35
 
| 35
Line 766: Line 804:
 
| unknown
 
| unknown
 
| Fall 2005
 
| Fall 2005
|  
+
| <br>
 
|-
 
|-
 
| 36
 
| 36
Line 779: Line 817:
 
| Loaded on Jun-08-2007
 
| Loaded on Jun-08-2007
 
|-
 
|-
| <br>
 
| Physics Course
 
| Physics - USNA - Fall 2006
 
| 126
 
| Andes
 
| Physics
 
| Kurt VanLehn
 
| US Naval Academy
 
| Fall 2006
 
| 6/19/07: Received 18,000 files from Anders.<br>6/20/07: Loaded to cooker, awaiting verification.
 
6/22/07: Tim to work on better skill model with Brett van de Sande &amp; Anders. Waiting on result of this.<br> 6/27/07: Anders regenerating raw logs <br> 6/28/07: Files distilled ~100 have invalid xml. Anders to fix.<br> 7/3/07: Reloading to the-cooker, some tutor semantic names are off. Also asked Anders to include units. 7/24/07: Received new data from Anders. Fixing bugs in distiller.<br> 7/29/07: Data loaded to production - will be reloaded for changes to distiller.
 
 
 
|-
 
|-
 
| <br>
 
| <br>
Line 801: Line 827:
 
| US Naval Academy
 
| US Naval Academy
 
| Spring 2006
 
| Spring 2006
| Waiting to receive files from Anders.<br>  
+
| Waiting to receive files from Anders.<br>
8/14/07: files on Cooker, waiting for verification.<br> 09/28/07: Loaded on the cooker under v2.3, waiting for verification <br>10/26/07: Will reload to production once new release is live. <br>11/5/07: Loaded to production
+
8/14/07: files on Cooker, waiting for verification.<br>09/28/07: Loaded on the cooker under v2.3, waiting for verification <br>10/26/07: Will reload to production once new release is live. <br>11/5/07: Loaded to production
  
 
|-
 
|-
Line 814: Line 840:
 
| US Naval Academy
 
| US Naval Academy
 
| Spring 2007
 
| Spring 2007
| 10/26/07: Bob to send files to Anders for conversion.<br>  
+
| 10/26/07: Bob to send files to Anders for conversion.<br>
10/30/07: Anders sent files to DataShop team.<br> 11/2/07: Loaded to the cooker, waiting for verify.<br> renamed from previous "The Effects of Interaction on Robust Learning - Spring 2007"
+
10/30/07: Anders sent files to DataShop team.<br>11/2/07: Loaded to the cooker, waiting for verify.<br>renamed from previous "The Effects of Interaction on Robust Learning - Spring 2007"
  
 
11/14/07: Loaded onto production
 
11/14/07: Loaded onto production
Line 840: Line 866:
 
| Pitt-ELI
 
| Pitt-ELI
 
| Summer 2006
 
| Summer 2006
| 3/29/2007 - Sent message to Michael, Maxine to determine status<br>  
+
| 3/29/2007 - Sent message to Michael, Maxine to determine status<br>
4/4/07 - Waiting for Michael's studies to finish, he will then convert to DS format.<br> 6/12/07 - Waiting for Michael to add &lt;conditions&gt; to the datasets<br> 6/25/07 - Loading datasets to the-cooker<br> 6/26/07 - Ready to load to production<br> 6/27/07 - Loaded to production
+
4/4/07 - Waiting for Michael's studies to finish, he will then convert to DS format.<br>6/12/07 - Waiting for Michael to add &lt;conditions&gt; to the datasets<br>6/25/07 - Loading datasets to the-cooker<br>6/26/07 - Ready to load to production<br>6/27/07 - Loaded to production
  
 
|-
 
|-
Line 853: Line 879:
 
| Pitt-ELI
 
| Pitt-ELI
 
| Spring 2006
 
| Spring 2006
| 3/29/2007 - Sent message to Michael, Maxine to determine status<br>  
+
| 3/29/2007 - Sent message to Michael, Maxine to determine status<br>
4/4/07 - Waiting for Michael's studies to finish, he will then convert to DS format.<br> 6/12/07 - Waiting for Michael to add &lt;conditions&gt; to the datasets<br> 6/25/07 - Loading datasets to the-cooker<br> 6/26/07 - Reloading to the-cooker<br> 6/27/07 - Ready to load to production<br> 6/27/07 - Loaded to production
+
4/4/07 - Waiting for Michael's studies to finish, he will then convert to DS format.<br>6/12/07 - Waiting for Michael to add &lt;conditions&gt; to the datasets<br>6/25/07 - Loading datasets to the-cooker<br>6/26/07 - Reloading to the-cooker<br>6/27/07 - Ready to load to production<br>6/27/07 - Loaded to production
  
 
|-
 
|-
Line 867: Line 893:
 
| 2005-2006
 
| 2005-2006
 
|  
 
|  
7/18/07: Files received from Octav, modifying DataShop code to improve import speed.<br> 7/26/07: Loaded to the-cooker.<br> 8/13/07: Loaded onto production!
+
7/18/07: Files received from Octav, modifying DataShop code to improve import speed.<br>7/26/07: Loaded to the-cooker.<br>8/13/07: Loaded onto production!
  
 
|-
 
|-
Line 880: Line 906:
 
| 2005-2006
 
| 2005-2006
 
|  
 
|  
7/18/07: Files received from Octav, modifying DataShop code to improve import speed.<br> 7/26/07: Loaded to the-cooker.<br> 8/13/07: Loaded onto production!
+
7/18/07: Files received from Octav, modifying DataShop code to improve import speed.<br>7/26/07: Loaded to the-cooker.<br>8/13/07: Loaded onto production!
  
 
|-
 
|-
Line 893: Line 919:
 
| 2005-2006
 
| 2005-2006
 
|  
 
|  
7/18/07: Files received from Octav, modifying DataShop code to improve import speed.<br> 7/26/07: Loaded to the-cooker.<br> 8/13/07: Loaded onto production!
+
7/18/07: Files received from Octav, modifying DataShop code to improve import speed.<br>7/26/07: Loaded to the-cooker.<br>8/13/07: Loaded onto production!
  
 
|-
 
|-
Line 906: Line 932:
 
| 2005-2006
 
| 2005-2006
 
|  
 
|  
7/18/07: Files received from Octav, modifying DataShop code to improve import speed.<br> 7/26/07: Loaded to the-cooker.<br> 8/13/07: Loaded onto productionnot!
+
7/18/07: Files received from Octav, modifying DataShop code to improve import speed.<br>7/26/07: Loaded to the-cooker.<br>8/13/07: Loaded onto productionnot!
  
 
|-
 
|-
| 43
+
| <br>
| Stoichometry Study
+
| Robust learning with a Meta-Cognitive Tutor<br>
| PSLC Stoichiometry Study 1
+
| Help Tutor - Angles and Quads - 2006<br>
| 2
+
| 159<br>
| CTAT-Flash
+
| CTAT only<br>
| Chemistry
+
| Geometry<br>
| Bruce McLaren
+
| Ido Roll<br>
| UBC, a NJ HS, Hampton HS
+
| <br>
| 2005+
+
| 2005-2006<br>
| Holds data for Studies 1, 2, 3.<br>  
+
| 11/12/07: CTAT logs loaded to production.&nbsp; <br>
Paper written. Problem name labels include the condition and probably should not, makes error report difficult. Also missing problem descriptions.
+
|-
 
+
 
|-
 
|-
 
| 44
 
| 44
Line 955: Line 980:
 
| <br>
 
| <br>
 
|-
 
|-
| <br>
 
| The Effect of Generation and Interaction on Robust Learning
 
| Self Explanation - Electric Fields - Spring 2006
 
| 104
 
| Andes
 
| Physics
 
| Kurt VanLehn
 
| US Naval Academy
 
| Spring 2006
 
| Waiting to receive files from Anders.<br>
 
8/14/07: files on Cooker, waiting for verification.<br> 09/28/07: Loaded on the cooker under v2.3, waiting for verification <br>10/26/07: Will reload to production once new release is live. <br>11/5/07: Loaded to production
 
 
<br>
 
 
 
|-
 
|-
 
| 47
 
| 47
Line 1,023: Line 1,034:
 
| Mass Public
 
| Mass Public
 
| 2004-2005
 
| 2004-2005
| 6/20: Need to reload.<br> 6/25: Reloaded.
+
| 6/20: Need to reload.<br>6/25: Reloaded.
 
|-
 
|-
 
| 50
 
| 50
Line 1,034: Line 1,045:
 
| Mass Public
 
| Mass Public
 
| 2004-2005
 
| 2004-2005
| 6/20: Need to reload.<br> 6/25: Ready to load (on production)<br> 6/26: Reloaded.<br> LFA - OutOfMemory on all 4 KC models.
+
| 6/20: Need to reload.<br>6/25: Ready to load (on production)<br>6/26: Reloaded.<br>LFA - OutOfMemory on all 4 KC models.
 
|-
 
|-
 
| <br>
 
| <br>
Line 1,063: Line 1,074:
 
(datasets that are ready to be loaded to learnlab.web, highest priority on top)
 
(datasets that are ready to be loaded to learnlab.web, highest priority on top)
  
{| class="pipeLineTable" id="pipeLineTable3"
+
{| class="pipeLineTable FCK__ShowTableBorders" id="pipeLineTable3"
 
|-
 
|-
! width="33" style="width: 25pt;" | <br>
+
! width="33" | <br>
! width="134" style="width: 101pt;" | Project
+
! width="134" | Project
! width="283" style="width: 130pt;" | Dataset Name
+
! width="283" | Dataset Name
! width="85" style="width: 64pt;" | Tool
+
! width="85" | Tool
! width="82" style="width: 62pt;" | LearnLab
+
! width="82" | LearnLab
! width="107" style="width: 80pt;" | P.I.
+
! width="107" | P.I.
! width="104" style="width: 78pt;" | School(s)
+
! width="104" | School(s)
! width="94" style="width: 71pt;" | Date
+
! width="94" | Date
! style="width: 236px;" | Notes/Status Remarks
+
! Notes/Status Remarks
 
|-
 
|-
 
|  
 
|  
Line 1,090: Line 1,101:
 
(datasets that have been loaded and are to be reviewed for correctness, highest priority on top)
 
(datasets that have been loaded and are to be reviewed for correctness, highest priority on top)
  
{| class="pipeLineTable" id="pipeLineTable4"
+
{| class="pipeLineTable FCK__ShowTableBorders" id="pipeLineTable4"
 
|-
 
|-
! width="33" height="25" style="width: 25pt;" | <br>
+
! width="33" height="25" | <br>
! width="134" style="width: 101pt;" | Project
+
! width="134" | Project
! width="283" style="width: 130pt;" | Dataset Name
+
! width="100" | Dataset Name
! width="85" style="width: 64pt;" | Tool
+
! width="85" | Tool
! width="82" style="width: 62pt;" | LearnLab
+
! width="82" | LearnLab
! width="107" style="width: 80pt;" | P.I.
+
! width="107" | P.I.
! width="104" style="width: 78pt;" | School(s)
+
! width="104" | School(s)
! width="94" style="width: 71pt;" | Date
+
! width="94" | Date
! style="width: 236px;" | Notes/Status Remarks
+
! width="200" | Notes/Status Remarks
 
|-
 
|-
 
| <br>
 
| <br>
Line 1,110: Line 1,121:
 
| CWCTC, Hampton, Wilkinsburg, 3 LA schools
 
| CWCTC, Hampton, Wilkinsburg, 3 LA schools
 
| 2006-2007 school year
 
| 2006-2007 school year
| Waiting for harvest from CL. Also need a new version of munger to load. <br> Loaded to the-cooker, had multiple errors on load.
+
| Waiting for harvest from CL. Also need a new version of munger to load. <br>Loaded to the-cooker, had multiple errors on load.
 
|-
 
|-
 
| <br>
 
| <br>
| A Multimodal Interface for Solving Equations
+
| Robust Learning of Vocabulary
| Handwriting - Spring 2007
+
| Fall 2006 Personalization Study
| CL-2006 (Munger)
+
| REAP Tutor
| Algebra
+
| English
| Lisa Anthony
+
| Juffs
 +
| English Language Institute
 +
| Fall 2006
 +
| No ETA from Michael yet.<br>
 +
9/25/07: Sent follow-up email to Michael<br>10/03/07: Waiting for files from REAP Team.<br>12/10/07: Loaded to the-cooker for verification.
 +
 
 +
|-
 
| <br>
 
| <br>
 +
| Robust Learning of Vocabulary
 +
| Spring 2007 Reading 4
 +
| REAP Tutor
 +
| English
 +
| Juffs
 +
| English Language Institute
 
| Spring 2007
 
| Spring 2007
| 07/06/07: Being QA'd by Tristan<br>  
+
| No ETA from Michael yet. <br>
09/24/07: Data loaded to the-cooker.<br> 09/28/07: Updated on cooker to v2.3, waiting for new version of munger to load on production with v2.3 analysis_db<br> 10/26/07: Bug fixes completed in Munger, waiting to receive new version. <br> 11/5/07 New munger version had errors on run, awaiting word from tristan on causes.
+
9/25/07: Sent follow-up email to Michael<br>10/03/07: Waiting for files from REAP Team.<br>12/10/07: Loaded to the-cooker for verification.
  
 
|-
 
|-
| 15
 
| Knowledge Tracking
 
| French Vocabulary Spring 2007
 
| Phil's own
 
| French
 
| Phil Pavlik/Nora Presson
 
 
| <br>
 
| <br>
| Fall 2006
+
| Robust Learning of Vocabulary
| 07/06/07: Nora Presson to send data, no ETA yet.<br>  
+
| Summer 2007 Word Sense and Pronunciation Audio Study
8/13/07: Nora has import file ready. Waiting to receive from her to load onto the-cooker.<br> 8/15/07: Data loaded to the-cooker. Vetting in process.<br> 10/8/07: Problem with import file, Nora to reprocess and send to DataShop team.<br> 11/12/07: Data reloaded to bunny. Noticed problem with # students.
+
| REAP Tutor
 +
| English
 +
| Juffs
 +
| English Language Institute
 +
| Summer 2007
 +
| No ETA from Michael yet.<br>
 +
9/25/07: Sent follow-up email to Michael<br>10/03/07: Waiting for files from REAP Team.<br>12/10/07: Loaded to the-cooker for verification.
  
 
|}
 
|}
Line 1,141: Line 1,164:
 
(datasets that require additional conversion work or tweaking, highest priority on top)
 
(datasets that require additional conversion work or tweaking, highest priority on top)
  
{| class="pipeLineTable" id="pipeLineTable5"
+
{| class="pipeLineTable FCK__ShowTableBorders" id="pipeLineTable5"
 
|-
 
|-
! width="33" height="25" style="width: 25pt;" | <br>
+
! width="33" height="25" | <br>
! width="134" style="width: 101pt;" | Project
+
! width="134" | Project
! width="283" style="width: 130pt;" | Dataset Name
+
! width="283" | Dataset Name
! width="85" style="width: 64pt;" | Tool
+
! width="85" | Tool
! width="82" style="width: 62pt;" | LearnLab
+
! width="82" | LearnLab
! width="107" style="width: 80pt;" | P.I.
+
! width="107" | P.I.
! width="104" style="width: 78pt;" | School(s)
+
! width="104" | School(s)
! width="94" style="width: 71pt;" | Date
+
! width="94" | Date
! style="width: 236px;" | Notes/Status Remarks
+
! Notes/Status Remarks
 
|-
 
|-
 
| <br>
 
| <br>
Line 1,168: Line 1,191:
 
(datasets we expect to receive soon - may or may not require additional processing on our part)
 
(datasets we expect to receive soon - may or may not require additional processing on our part)
  
{| id="pipeLineTable6" class="pipeLineTable"
+
{| class="pipeLineTable FCK__ShowTableBorders" id="pipeLineTable6"
 
|-
 
|-
! width="33" height="25" style="width: 25pt;" | <br>
+
! width="33" height="25" | <br>
! width="134" style="width: 101pt;" | Project
+
! width="134" | Project
! width="283" style="width: 130pt;" | Dataset Name
+
! width="283" | Dataset Name
! width="85" style="width: 64pt;" | Tool
+
! width="85" | Tool
! width="82" style="width: 62pt;" | LearnLab
+
! width="82" | LearnLab
! width="107" style="width: 80pt;" | P.I.
+
! width="107" | P.I.
! width="104" style="width: 78pt;" | School(s)
+
! width="104" | School(s)
! width="94" style="width: 71pt;" | Date
+
! width="94" | Date
! style="width: 236px;" | Notes/Status Remarks
+
! Notes/Status Remarks
 
|-
 
|-
 
| <br>
 
| <br>
Line 1,189: Line 1,212:
 
| <br>
 
| <br>
 
| 6/6/07: Data is good to munge.  
 
| 6/6/07: Data is good to munge.  
Request sent to CL to break the dataset into smaller pieces<br> 07/09/2007: Confirmed that the munger can load sections only. Steve Ritter wishes to speak with ken/kurt to cover legal issues with this dataset.
+
Request sent to CL to break the dataset into smaller pieces<br>07/09/2007: Confirmed that the munger can load sections only. Steve Ritter wishes to speak with ken/kurt to cover legal issues with this dataset.
  
 
|-
 
|-
 
| 7
 
| 7
| width="134" class="xl58" style="width: 101pt;" | Stoichiometry
+
| class="xl58" width="134" | Stoichiometry
| width="283" class="xl24" style="width: 212pt;" | VLAB data from Stoich studies
+
| class="xl24" width="283" | VLAB data from Stoich studies
 
| <br>
 
| <br>
 
| Chemistry
 
| Chemistry
| width="107" class="xl24" style="width: 80pt;" | Bruce McLaren
+
| class="xl24" width="107" | Bruce McLaren
 
| class="xl24" | <br>
 
| class="xl24" | <br>
 
| class="xl25" | <br>
 
| class="xl25" | <br>
| width="314" class="xl24" style="width: 236pt;" | Oliver Scheuer to oversee conversion in Germany.  
+
| class="xl24" width="314" | Oliver Scheuer to oversee conversion in Germany.  
6/20/07: Mike Karabinos to work up examples for each type of VLAB Action, DataShop to fill in dummy and real &lt;tutor_message&gt;. Will then send this info to Oliver.<br> 7/27/07: Kyle completed transformation to XML. Sent out for review.<br> 8/14/07: Working on storage of replay data - waiting for response from CTAT and Germany<br> 9/25/07: Jonathan Sewall wants to see what logs look like in DataShop.<br>10/8/07: Alida loaded sample Vlab dataset to QA (Dataset: Chemistry VLAB Example Data Fall 2007).
+
6/20/07: Mike Karabinos to work up examples for each type of VLAB Action, DataShop to fill in dummy and real &lt;tutor_message&gt;. Will then send this info to Oliver.<br>7/27/07: Kyle completed transformation to XML. Sent out for review.<br>8/14/07: Working on storage of replay data - waiting for response from CTAT and Germany<br>9/25/07: Jonathan Sewall wants to see what logs look like in DataShop.<br>10/8/07: Alida loaded sample Vlab dataset to QA (Dataset: Chemistry VLAB Example Data Fall 2007).
  
 
|-
 
|-
Line 1,232: Line 1,255:
 
| CWCTC
 
| CWCTC
 
| Spring 2007
 
| Spring 2007
| Researcher has raw logs is responsible for submitting to DataShop for import.<br>  
+
| Researcher has raw logs is responsible for submitting to DataShop for import.<br>
 
7/06/07: Frank sent anonymized raw logs to Erin.
 
7/06/07: Frank sent anonymized raw logs to Erin.
  
 
|-
 
|-
 
| <br>
 
| <br>
| width="134" class="xl58" style="width: 101pt;" | Geometry Course
+
| class="xl58" width="134" | Geometry Course
| width="283" class="xl24" style="width: 212pt;" | Geometry 2006-2007
+
| class="xl24" width="283" | Geometry 2006-2007
 
| CL Geometry
 
| CL Geometry
 
| Geometry
 
| Geometry
Line 1,245: Line 1,268:
 
| 2006-2007 school year
 
| 2006-2007 school year
 
| Waiting for harvest from CL. Will need a conversion from Octav.
 
| Waiting for harvest from CL. Will need a conversion from Octav.
|-
 
| <br>
 
| Robust Learning of Vocabulary
 
| Fall 2006 Personalization Study
 
| REAP Tutor
 
| English
 
| Juffs
 
| English Language Institute
 
| Fall 2006
 
| No ETA from Michael yet.<br>
 
9/25/07: Sent follow-up email to Michael<br> 10/03/07: Waiting for files from REAP Team.
 
 
|-
 
| <br>
 
| Robust Learning of Vocabulary
 
| Spring 2007 Reading 4
 
| REAP Tutor
 
| English
 
| Juffs
 
| English Language Institute
 
| Spring 2007
 
| No ETA from Michael yet. <br>
 
9/25/07: Sent follow-up email to Michael<br> 10/03/07: Waiting for files from REAP Team.
 
 
|-
 
| <br>
 
| Robust Learning of Vocabulary
 
| Summer 2007 Word Sense and Pronunciation Audio Study
 
| REAP Tutor
 
| English
 
| Juffs
 
| English Language Institute
 
| Summer 2007
 
| No ETA from Michael yet.<br>
 
9/25/07: Sent follow-up email to Michael<br> 10/03/07: Waiting for files from REAP Team.
 
  
 
|-
 
|-
Line 1,311: Line 1,299:
 
| Winter 2008
 
| Winter 2008
 
| <br>
 
| <br>
 +
|-
 +
| <br>
 +
| Middle School Tutor (Full year)
 +
| Middle School Tutor - 2002-2003
 +
| CL-Middle School (proto BTA)
 +
| n/a
 +
| Ryan Baker (Albert Corbett)
 +
| CMU
 +
| July 2008
 +
| Waiting on bug: DataShop Import Verify Tool hanging on data set
 +
|-
 +
| <br>
 +
| Middle School Tutor (Gaming studies)
 +
| Middle School Tutor - 2003-2005
 +
| CL-Middle School (proto BTA)
 +
| n/a
 +
| Ryan Baker
 +
| CMU
 +
| July 2008
 +
| Waiting on Middle School Tutor (Full year) data set to be successfully imported
 
|}
 
|}
  
<font style="color: blue;">Octav to regenerate all of his datasets?</font>
+
<font size="+0">Octav to regenerate all of his datasets?</font>
  
 
[[Category:DataShop]]
 
[[Category:DataShop]]

Latest revision as of 12:45, 21 July 2008

This page provides information on datasets in various stages of progress. If you see an error in any of this information please feel free to correct it by editing this page. If you have a comment or concern on where a dataset sits in the DataShop pipeline, please contact the DataShop staff.

ANALYSIS REPORTED

(on learnlab.org - ordered by alphabetically by project, researchers get a gold star when their datasets move up into this table!)


Project Dataset Name DB ID Tool LearnLab P.I. School(s) Date Notes/Status Remarks
Chemistry Collaboration Effects of collaboration in virtual laboratory environments 95 VLAB Chemistry Bruce McLaren Spring 2007 "The CoChemEx Project: Conceptual Chemistry Learning through Experimentation and Adaptive Collaboration"
Contiguity Contiguity CWCTC Spring 2006 79 CL Geometry Vincent Aleven CWCTC HS Spring 2006
Geometry Course Geometry Area (1996-97) 76 CL Geometry Ken Koedinger unknown 1996-97 Paper written -- get reference!!
8/17: Primary analysis complete. Cen (2006), Cen (2007)
Geometry Course Geometry Angles - Fox Chapel 1998 122 CL Geometry Vincent Aleven unknown unknown Files-only

7/23: Low priority to do: get raw logs and distill to transaction level.
8/17: Two best paper awards!

Improving Algebra Learning and Collaboration CPS Algebra 1 2005 109 CL-2005 Algebra Bruce McLaren, Nikol Rummel Hampton HS 2005-2006
Knowledge Tracking Chinese Vocabulary Fall 2006 160 FaCT Chinese Phil Pavlik CMU Spring 2006 2 papers
Physics Course USNA Physics Fall 2006 126 Andes Physics Kurt VanLehn US Naval Academy Fall 2006
Stoichiometry Study PSLC Stoichiometry Study 1 2 CTAT-Flash Chemistry Bruce McLaren UBC, Hampton HS, NJ High school 2005+
The Effect of Generation and Interaction on Robust Learning Self Explanation - Electric Fields - USNA - Spring 2006 104 Andes Physics Kurt VanLehn/Bob Hausman US Naval Academy Spring 2006


LIVE!

(on learnlab.org - ordered by alphabetically by project)


Project Dataset Name DB ID Tool LearnLab P.I. School(s) Date Notes/Status Remarks

A Multimodal Interface for Solving Equations Handwriting Examples - Winter 2006 145 CL-2006 (Munger) Algebra Lisa Anthony
Winter 2006 07/06/07: Being QA'd by Tristan

09/22/07: Data loaded to production. 01/18/08: Reloaded on production with bug fixes.


A Multimodal Interface for Solving Equations Handwriting - Spring 2007 165 CL-2006 (Munger) Algebra Lisa Anthony
Spring 2007 07/06/07: Being QA'd by Tristan

09/24/07: Data loaded to the-cooker.
09/28/07: Updated on cooker to v2.3, waiting for new version of munger to load on production with v2.3 analysis_db
10/26/07: Bug fixes completed in Munger, waiting to receive new version.
11/5/07 New munger version had errors on run, awaiting word from tristan on causes.
1/18/08 Loaded onto production.


Algebra Course Algebra I 2005
Algebra I 2005-2006 (Hampton only)
123,110 CL-2005 Algebra Albert Corbett CWCTC
Hampton
Wilkinsburg HSs
2005-2006 2/20: Received DataMunger.

2/27: Started running data munger for wilkinsburg-algebra i.
3/5: Munger done, have 1,378,891 txs instead of 1,187,841.
5/4: Munger fixed, Jon Steinhart running the conversion, Tristan to QA. Supposed to have data week of 5/7
Load of Wilkinsburg student data delayed due to problems copying the student files.
6/25: Hampton numbers on production not right. CWCTC numbers on the-cooker not right. Wilks data doesn't load. Requested a new complete CD from CL.
7/9: Modifying analysis database to support improved pruning, waiting for new CD with latest version of munger.
7/23: Loaded onto production with all 3 schools, one student from Wilkinsburg failed to load.


Chemistry Buffer Chemistry_Buffer_Study 63 OLI Chemistry Jodi Davenport CMU Spring 2006  

Chemistry Buffer CMU_sp07_BUFFERS 94 OLI Chemistry Jodi Davenport CMU unknown  

Chemistry Buffer Chemistry_Buffer_Study_2007 84 OLI Chemistry Jodi Davenport CMU, UBC? Spring 2007

Chinese Tone Study Chinese_tonestudy 1 unknown Chinese Ying Liu CMU Summer 2006

Chinese Tone Study Two Chinese_toneperception 64 unknown Chinese Ying Liu CMU 2005-2006

Contiguity-CWCTC Contiguity Difficulty Factors Analysis Winter 2006 153
Geometry Vincent Aleven
Winter 2006 Files only

Contiguity-CWCTC Contiguity CWCTC Winter 2006 80 CL Geometry Vincent Aleven CWCTC HS Winter 2006

Contiguity-CWCTC Contiguity CWCTC Fall 2006 102 CL + CTAT Geometry Kirsten Butcher, Vincent Aleven CWCTC Fall 2006 6/21: Reloaded.

07/18/07: v4.13 Loaded on test machine for reload verify
7/23: Received missing CTAT data from Frank. Kyle to work on 7/24.
7/26: Missing CTAT uploaded to production.
11/14 reloaded CL data, CTAT data needs reload


Contiguity-CWCTC Contiguity CWCTC Spring 2007 162 CL Geometry Vincent Aleven CWCTC Spring 2006 11/14: Loaded to production

Contiguity-CWCTC Contiguity CMU Winter 2007 113 CL Geometry Kirsten Butcher, Vincent Aleven CMU Winter 2007

6/21: Reloaded.
11/14/07: Reloaded with converter v4.13


Division Tutor division_study 98 CTAT-Flash Geometry? Stefan King Perrysville Elem Summer 2007

Does Treating Student Uncertainty as a Learning Impasse Improve Learning in Spoken Dialogue Tutoring? WOZ Uncertainty Adaptation 128 ITSPOKE Physics (intended) Kate Forbes-Riley; Diane Litman Lab experiment with Pitt students Winter-Spring 2006-7 7/31/07: Project & Dataset created.

Elementary Chinese Course ElemChineseFA06 75 OLI Chinese



Elementary Chinese Course ElemChineseSU07 96 OLI Chinese




Elementary Chinese Course ElemChineseSP07 83 OLI Chinese



Example Study - Freiburg and CMU Example Freiburg Spring 2006 77 CL Geometry Vincent Aleven CMU Spring 2006

Example Study - Freiburg and CMU Example CMU Summer 2006 88 CL-modified Geometry Vincent Aleven CMU Summer 2006

Example Study - Freiburg and CMU Example Freiburg Summer 2006 78 CL Geometry Alexander Renkl unknown Summer 2006

Example Study - Freiburg and CMU Example CWCTC Winter 2007 99 CL+CTAT Geometry Vincent Aleven CMU Summer 2006 6/19: Need to reload CL data. Need to reload CTAT data.

6/21: Reloading CTAT to QA machine. CL will follow.
6/21: Data reloaded to production.
6/25: Bad (not all students were anonymized correctly) CTAT logs removed from production.
7/18: CL data: v4.13 Loaded on test machine for reload verify by Octav.
7/24: CTAT data: ready to load
7/31: CTAT data reloaded.


Example Study - Freiburg and CMU Example Steel Valley Spring 2007 114 CL Geometry Vincent Aleven Steel Valley HS Spring 2006 6/19: Need to reload CL data.

6/21: Data reloaded to production.


Example Study - Freiburg and CMU Example CWCTC Spring 2007 125 CL-Lisp tutor (no CTAT) Geometry Vincent Aleven, Ron Salden CWCTC & Wilkinsburg Spring 2007 07/06/07: Will receive from CL week of 7/10/07

07/09/07: Octav received data
07/18/07: Loaded on cooker
07/21/07: QUESTION: is this 3 datasets or 1 with 3 schools?
7/29/07: Loaded to production.

11/14: reloaded onto production.


Example Study - Freiburg and CMU Example Wilkinsburg Spring 2007 124 CL-Lisp tutor (no CTAT) Geometry Vincent Aleven, Ron Salden CWCTC & Wilkinsburg Spring 2007 7/29/07: Loaded to production.
11/14: reloaded onto production

Example Study - Freiburg and CMU Example Freiburg Spring 2007 127 CL-Lisp tutor (no CTAT) Geometry Vincent Aleven, Ron Salden Gymnasium Spring 2007 07/10/07: Octav verifying conversion

7/29/07: Loaded to production.


Fostering fluency in second language learning: Testing two types of instruction Repetition in fluency training, Study 1 129 4/3/2 training English De Jong English Language Institute Fall 2006 7/31/07: Dataset created on production. No files yet.

Fostering fluency in second language learning: Testing two types of instruction Formulaic sequences in fluency training, Study 2 130 4/3/2 training English De Jong English Language Institute Spring 2007 7/31/07: Dataset created on production. No files yet.

Fostering fluency in second language learning: Testing two types of instruction Formulaic sequences in fluency training, Study 3 131 shadowing training English De Jong English Language Institute Spring 2007 7/31/07: Dataset created on production. No files yet.
21 French Course FrenchLanguag2 82 OLI French



22 French Course FrenchLanguage 74 OLI French



23 French Course FrenchLanguage2 81 OLI French




French Course pitteiffel 151 OLI French




French Course toureiffel 152 OLI French



28 Geometry Course Geometry Angles - North Hills Spring 2003 68 CL Geometry Vincent Aleven North Hills HS Spring 2003 Paper: Aleven & Koedinger, 2002 in Cognitive Science?? Or Popescu
29 Geometry Course Hampton Fall 2005 66 CL Geometry Vincent Aleven Hampton HS 2005-2006 Redoing this dataset with bug fixes in Octav's converter.
11/14: Reloaded onto production
38 Geometry Course Geometry-AllStudents 6 CL Geometry Ken Koedinger unknown unknown Bad Version
30 IERI: Learning Oriented Dialogue Project Learning Oriented Dialogue Project - original 62




2 Improving Algebra Learning and Collaboration PTS Algebra I 2005 112 CL-2005 Algebra Bruce McLaren, Erin Walker
2005-2006 raw data files only

Improving Skill at Solving Equations via Better Encoding of Algebraic Concepts Corrective Self Explanation 147, 149 CL-2006 (Munger) + CTAT Algebra Julie Booth Golden Valley High School Feb, 2007 07/06/07: Being QA'd by Tristan

8/13/07: CTAT files received from Frank. Need to insert course_name, check for errors.
8/15/07: CTAT logs loaded to production. Some dataset info updated as well.
9/21/07: Munger failure due to CTAT logs already loaded. Need to clean out CTAT logs and try again.
9/22/37: Data loaded to production.
9/25/07: CTAT logs loaded to production (were removed for munger reload)
1/18/2007: reoaded to production

24 Improving cultural learning by predicting in French film FrenchOnline 32 CTAT? French Amy Ogan CMU Fall 2005
25 Improving cultural learning by predicting in French film FrenchTutor_Demo 16 CTAT? French Amy Ogan CMU Spring 2005
26 Improving cultural learning by predicting in French film French_Culture_Tutor 9 CTAT? French Amy Ogan CMU Spring 2005
27 Improving cultural learning by predicting in French film French_Culture_Tutor_Fall_2005 8 CTAT? French Amy Ogan CMU Fall 2005

Improving cultural learning by predicting in French film French Discussion Board 154 CTAT French Amy Ogan CMU Spring 2007 8/2/07: Received log files from Jonathan Sewall. Will load to QA.

8/07/07: Files loaded to QA. Some still need fixed (contain invalid XML)
8/16/07: All CTAT logs free of errors. All have been loaded to QA - waiting for researcher verification.
8/24/07: Need to get logs from Discussion Board from Erin Walker. Let's ping them again late Sept.
9/25/07: Files are on QA - waiting for go-ahead to move to prod from the researchers.


Improving cultural learning by predicting in French film French Discussion Board ITS 155 CTAT French Amy Ogan CMU Spring 2007

Improving cultural learning by predicting in French film French Verb Tutor 156 CTAT French Amy Ogan CMU Spring 2007
31 Intelligent Writing Tutor iwt_course 86
english Ruth Wylie


32 Intelligent Writing Tutor iwt retention course 89
english Ruth Wylie

10/9/2007: this dataset is garbage.
32 Intelligent Writing Tutor eli_study 148
english Ruth Wylie Fall 1007
10/9/2007: should be renamed to IWT Article Tutor Level 5 Study Fall 2007 when data collection is complete

Knowledge Tracking Chinese Vocabulary Spring 2006 107 Phil's own Chinese Phil Pavlik CMU Spring 2006

Knowledge Tracking Chinese Vocabulary Spring 2007 161 Phil's own Chinese Phil Pavlik CMU Spring 2007 7/06/07: Phil working on conversion. ETA 4-6 weeks.

9/26/07: Loaded to the-cooker
10/9/07: Problem with the number of observations in Learning Curves. DS team looking into it.
11/12/2007: Data reloaded to bunny.
Loaded to production on 11/13/07

Knowledge Tracking French Vocabulary Spring 2007 168 FacT System French Phil Pavlik/Nora Presson
Spring 2007 07/06/07: Nora Presson to send data, no ETA yet.

8/13/07: Nora has import file ready. Waiting to receive from her to load onto the-cooker.
8/15/07: Data loaded to the-cooker. Vetting in process.
10/8/07: Problem with import file, Nora to reprocess and send to DataShop team.
11/12/07: Data reloaded to were-rabbit. Noticed problem with # students.
12/3/07: Nora to improve problem names in dataset
1/22/08: Data goes live!


Knowledge Tracking Spanish Vocabulary Spring 2006 108 Phil's own Spanish Phil Pavlik
Winter 2007

Knowledge Tracking Chinese Vocabulary Transfer Lab Study Spring 2006 115 Phil's own Chinese Phil Pavlik
Spring 2006 6/22: Loaded to production
33 MacWhinney Dictation Studies Chinese Dictation Fall 2005 73 unknown Chinese Brian MacWhinney unknown Fall 2005
34 MacWhinney Dictation Studies French Dictation Fall 2005 71 unknown French Brian MacWhinney unknown Fall 2005
35 MacWhinney Dictation Studies Spanish Dictation Fall 2005 72 unknown Other Brian MacWhinney unknown Fall 2005
36 OLI Statistics 07Meyer201 103 OLI Other
CMU 2007 Loaded on Jun-08-2007

Physics Course Physics - USNA - Spring 2007 157 Andes Physics Kurt VanLehn US Naval Academy Spring 2006 Waiting to receive files from Anders.

8/14/07: files on Cooker, waiting for verification.
09/28/07: Loaded on the cooker under v2.3, waiting for verification
10/26/07: Will reload to production once new release is live.
11/5/07: Loaded to production


Physics Joint Explanation - Electric Fields - Pitt - Spring 2007 163 Andes Physics Bob Hausmann US Naval Academy Spring 2007 10/26/07: Bob to send files to Anders for conversion.

10/30/07: Anders sent files to DataShop team.
11/2/07: Loaded to the cooker, waiting for verify.
renamed from previous "The Effects of Interaction on Robust Learning - Spring 2007"

11/14/07: Loaded onto production

39 Public Pre_Summer_School_01_Jun_05 10 CTAT-Flash Chemistry unknown unknown June 2005 Used to show DataShop features, example data only.

Robust Learning of Vocabulary REAP ELI Reading 4 Summer 2006 118 REAP English Maxine Eskenazi Pitt-ELI Summer 2006 3/29/2007 - Sent message to Michael, Maxine to determine status

4/4/07 - Waiting for Michael's studies to finish, he will then convert to DS format.
6/12/07 - Waiting for Michael to add <conditions> to the datasets
6/25/07 - Loading datasets to the-cooker
6/26/07 - Ready to load to production
6/27/07 - Loaded to production


Robust Learning of Vocabulary REAP ELI Reading 4 Spring 2006 117 REAP English Maxine Eskenazi Pitt-ELI Spring 2006 3/29/2007 - Sent message to Michael, Maxine to determine status

4/4/07 - Waiting for Michael's studies to finish, he will then convert to DS format.
6/12/07 - Waiting for Michael to add <conditions> to the datasets
6/25/07 - Loading datasets to the-cooker
6/26/07 - Reloading to the-cooker
6/27/07 - Ready to load to production
6/27/07 - Loaded to production


Robust learning with a Meta-Cognitive Tutor Help Tutor CWCTC Spring 2006 (cognitive) 134 CTAT+CL Geometry Ido Roll CWCTC HS 2005-2006

7/18/07: Files received from Octav, modifying DataShop code to improve import speed.
7/26/07: Loaded to the-cooker.
8/13/07: Loaded onto production!


Robust learning with a Meta-Cognitive Tutor Help Tutor CWCTC Spring 2006 (meta) 135 CTAT+CL Geometry Ido Roll CWCTC HS 2005-2006

7/18/07: Files received from Octav, modifying DataShop code to improve import speed.
7/26/07: Loaded to the-cooker.
8/13/07: Loaded onto production!


Robust learning with a Meta-Cognitive Tutor Short Hints Wilkinsburg Spring 2006 (cognitive) 137 CTAT+CL Geometry Ido Roll Wilkinsburg HS 2005-2006

7/18/07: Files received from Octav, modifying DataShop code to improve import speed.
7/26/07: Loaded to the-cooker.
8/13/07: Loaded onto production!


Robust learning with a Meta-Cognitive Tutor Short Hints Wilkinsburg Spring 2006 (meta) 136 CTAT+CL Geometry Ido Roll Wilkinsburg HS 2005-2006

7/18/07: Files received from Octav, modifying DataShop code to improve import speed.
7/26/07: Loaded to the-cooker.
8/13/07: Loaded onto productionnot!


Robust learning with a Meta-Cognitive Tutor
Help Tutor - Angles and Quads - 2006
159
CTAT only
Geometry
Ido Roll

2005-2006
11/12/07: CTAT logs loaded to production. 
44 Stoichometry Study SummerSchool2005 11
Chemistry Bruce McLaren


45 Stoichometry Study Winter_Workshop01 59
Chemistry Bruce McLaren


46 Stoichometry Study PSLC Stoichiometry Study Demo 3
Chemistry Bruce McLaren


47 Thermodynamics Thermo Fall 2005 61 unknown Other Vincent Aleven unknown Fall 2005

Training oral production in learning second language grammar The order of French pronouns 132 Flash online audio recording French De Jong Regular French courses at Pitt and CMU Spring and Summer 2007 7/31/07: Dataset created on production. No files yet.

Training oral production in learning second language grammar The use of French conditionals 133 Flash online audio recording French De Jong Regular French courses at Pitt and CMU Spring and Summer 2007 7/31/07: Dataset created on production. No files yet.
48 Unclassified TWS_Group_01 65





49 WPI-Assistments Assistments - 8th Grade Math - 2004-2005 (200 students) 90 Assistments Other Neil Heffernan Mass Public 2004-2005 6/20: Need to reload.
6/25: Reloaded.
50 WPI-Assistments Assistments - 8th Grade Math - 2004-2005 (762 students) 92 Assistments Other Neil Heffernan Mass Public 2004-2005 6/20: Need to reload.
6/25: Ready to load (on production)
6/26: Reloaded.
LFA - OutOfMemory on all 4 KC models.

WPI-Assistments Assistments - 8th Grade Math - 2005-2006 (200 students) 119 Assistments Other Neil Heffernan Mass Public 2005-2006 Loaded on 7/03/07
10 WPI-Assistments Assistments - 8th Grade Math - 2005-2006 (1582 students) 120 Assistments Other Neil Heffernan Mass Public 2005-2006 Loaded on 7/03/07

READY TO GO LIVE

(datasets that are ready to be loaded to learnlab.web, highest priority on top)


Project Dataset Name Tool LearnLab P.I. School(s) Date Notes/Status Remarks
None at the moment.

IN TESTING

(datasets that have been loaded and are to be reviewed for correctness, highest priority on top)


Project Dataset Name Tool LearnLab P.I. School(s) Date Notes/Status Remarks

Algebra I Course Algebra I 2006-2007 CL Algebra - Mungable Algebra Albert Corbett CWCTC, Hampton, Wilkinsburg, 3 LA schools 2006-2007 school year Waiting for harvest from CL. Also need a new version of munger to load.
Loaded to the-cooker, had multiple errors on load.

Robust Learning of Vocabulary Fall 2006 Personalization Study REAP Tutor English Juffs English Language Institute Fall 2006 No ETA from Michael yet.

9/25/07: Sent follow-up email to Michael
10/03/07: Waiting for files from REAP Team.
12/10/07: Loaded to the-cooker for verification.


Robust Learning of Vocabulary Spring 2007 Reading 4 REAP Tutor English Juffs English Language Institute Spring 2007 No ETA from Michael yet.

9/25/07: Sent follow-up email to Michael
10/03/07: Waiting for files from REAP Team.
12/10/07: Loaded to the-cooker for verification.


Robust Learning of Vocabulary Summer 2007 Word Sense and Pronunciation Audio Study REAP Tutor English Juffs English Language Institute Summer 2007 No ETA from Michael yet.

9/25/07: Sent follow-up email to Michael
10/03/07: Waiting for files from REAP Team.
12/10/07: Loaded to the-cooker for verification.

IN PROGRESS

(datasets that require additional conversion work or tweaking, highest priority on top)


Project Dataset Name Tool LearnLab P.I. School(s) Date Notes/Status Remarks

Geometry Course Geometry 2005-2006 CL-2005 (lisp) Geometry Vincent Aleven CWCTC, Hampton, Wilkinsburg HSs 2005-2006 Octav needs to convert this data into XML.

UPCOMING

(datasets we expect to receive soon - may or may not require additional processing on our part)


Project Dataset Name Tool LearnLab P.I. School(s) Date Notes/Status Remarks

Algebra Bridge to Algebra 2005/2006 CL-Munger Algebra


6/6/07: Data is good to munge.

Request sent to CL to break the dataset into smaller pieces
07/09/2007: Confirmed that the munger can load sections only. Steve Ritter wishes to speak with ken/kurt to cover legal issues with this dataset.

7 Stoichiometry VLAB data from Stoich studies
Chemistry Bruce McLaren

Oliver Scheuer to oversee conversion in Germany.

6/20/07: Mike Karabinos to work up examples for each type of VLAB Action, DataShop to fill in dummy and real <tutor_message>. Will then send this info to Oliver.
7/27/07: Kyle completed transformation to XML. Sent out for review.
8/14/07: Working on storage of replay data - waiting for response from CTAT and Germany
9/25/07: Jonathan Sewall wants to see what logs look like in DataShop.
10/8/07: Alida loaded sample Vlab dataset to QA (Dataset: Chemistry VLAB Example Data Fall 2007).

8 ESL ESL data




7/27/07: Moving the Online Search System to PSLC servers in the near future. Will tie in to DataShop with single-sign on. No current plan to move any ESL data into DataShop itself.

Improving Algebra Learning and Collaboration PTS Algebra I 2005 CL Algebra Tutor Algebra Erin Walker  ? 2005-2006 (not sure) Researcher has raw logs and has written a converter for analysis. Need to find out if it's in DataShop format.

Improving Algebra Learning and Collaboration PTS Algebra I Spring 2007 CL Algebra Tutor Algebra Erin Walker CWCTC Spring 2007 Researcher has raw logs is responsible for submitting to DataShop for import.

7/06/07: Frank sent anonymized raw logs to Erin.


Geometry Course Geometry 2006-2007 CL Geometry Geometry Vincent Aleven CWCTC, Hampton, Wilkinsburg 2006-2007 school year Waiting for harvest from CL. Will need a conversion from Octav.










Physics Physics - USNA - Spring 2006 Andes Physics Kurt VanLehn US Naval Academy Spring 2006 10/26/07: Check with Anders to see when DataShop can get this data.

Fluency Study Fluency Study 4 - Winter 2008 CTAT ESL Nel de Jong CMU Winter 2008

Middle School Tutor (Full year) Middle School Tutor - 2002-2003 CL-Middle School (proto BTA) n/a Ryan Baker (Albert Corbett) CMU July 2008 Waiting on bug: DataShop Import Verify Tool hanging on data set

Middle School Tutor (Gaming studies) Middle School Tutor - 2003-2005 CL-Middle School (proto BTA) n/a Ryan Baker CMU July 2008 Waiting on Middle School Tutor (Full year) data set to be successfully imported

Octav to regenerate all of his datasets?