Saves data structures in Genstat, Excel, R, Quattro, dBase, SPlus, Gauss, MatLab, SAS, Instat, Image or text files (D.B. Baird).
Options
PRINT = string token |
What to print (summary ); default summ |
---|---|
OUTFILE = text |
Data file to be written |
METHOD = string token |
Action to take if the file already exists (add , append , concatenate , merge , overwrite , prompt , fail , replace ); default prompt in interactive mode, fail in batch mode |
PLAINNAMES = string token |
Whether to leave the column names in the file in plain form rather than decorating them with the column type information i.e. ! for factors, :D for dates etc (yes , no ) default no |
SHEETNAME = text |
Name of new sheet to be added to an existing Excel file |
NONAMES = string token |
Whether to suppress column names in output to spreadsheet or text file (yes , no ); default no |
TITLE = text |
Description for spreadsheet |
READONLY = string token |
Whether to define the complete sheet as read only (yes , no ); default no |
ANALYSIS = text |
Genstat commands to analyse columns in the spreadsheet |
ASETUP = text |
Genstat commands to be run once before the analysis of any columns in the spreadsheet |
ADUMMY = text |
The name of the dummy (if any) used the ANALYSIS commands |
CSVOPTIONS = string tokens |
Options for CSV files (noquotes , pack , round , fixed , align ); default pack |
HTMLOPTIONS = string token |
Options for HTML files (allowformats , nogrid , centre , rightjustify ); default * i.e. none |
COLMATCH = string token |
How to match columns when appending (name , position ); default posi |
GROUPS = factor or text |
Identifier for the factor, or text containing the name of the factor, to identify appended sections in the output file |
GLABEL = texts |
Labels for the GROUPS factor for the current appended section, and also for the original section if no previous sections have been appended |
MATCH = texts, variates or pointers |
Up to four DATA variables to use as keys when METHOD=merge ; default * uses the first DATA variable |
WITH = texts, variates or pointers |
Columns in the file to use as keys when METHOD=merge ; default * uses as many columns of the initial columns in the file as are needed to give a column for each MATCH column |
UPDATE = string token |
Whether to use columns with matching names to replace existing columns when concatenating or merging (yes , no ); default no changes the names of columns with the same name as existing columns so that they become unique |
OUTOPTIONS = text |
Optional output file arguments to be passed to the Dataload.dll |
ROWCOLOURS = factor |
The factor to be used for colouring the rows (the factor must have colours defined by the FACCOLOURS parameter) |
TABLEFORMAT = string token |
The format to use when displaying tables with two or more classifying factors (page , column ); default page |
MISSING = text |
String to represent a numerical missing value when writing to a text file (.TXT , .TAB or .CSV) or a spreadsheet file (Excel, Quattro or Open Office); default is to use '*' in .TXT or .TAB files, and leave cells with missing values empty in .CSV or spreadsheet files |
DELETESHEETS = string token |
Whether to delete sheets if you are overwriting a multiple paged file with a single page (always , never , prompt ); default prom when running interactively and neve when running in batch |
NONASCII = string token |
Specifies how to output non-ASCII characters to text files (
utf8 , unicode ); default utf8 |
TIMEOUT = scalar |
Number of seconds to wait when a file is open in another process; default 10
|
Parameters
DATA = identifiers |
The data structures to be written to the file, these must be compatible (i.e. of the same length) |
---|---|
COLUMNS = texts |
Names for the columns to be saved |
PROTECT = scalars |
Whether the column is to be defined as read only when option READONLY=no (yes , no ); default no |
FACCOLOURS = variates, texts or pointers |
Specifies background colours for factor columns |
FOREGROUND = variates, texts, scalars or pointer |
Specifies foreground colours for columns |
BACKGROUND = variates, texts, scalars or pointer |
Specifies background colours for columns |
DECIMALS = variates or scalars |
Specifies numbers of decimals for the columns |
Description
EXPORT
saves the data structures specified by the DATA
parameter to a disk file specified by the OUTFILE
option in a foreign data format specified by the extension of the file name. The available extensions are: .XLS
for Excel, .XLSX
for Excel 2007-13, .WQ1
for Quattro, .ODS
for Open Office Spreadsheet, .DBF
for dBase, .FMT
for Gauss, .SDD
for SPlus, .RDA
for R, .TPT
for SAS transport, .WOR
for Instat, .MAT
for MatLab, .ARFF
for Weka Attribute, .TXT
for plain ASCII text, .CSV
for comma delimited text, .TAB
for tab delimited text, .HTM
for a HTML table, .RTF
for Word Rich text format, .GSH
for Genstat spreadsheet, .GWB
for Genstat work book, and .BMP
, .EMF
, .GIF
, .JPG
, .TIF
, .PNG
or .PSD
for an image file. An image file can be created either from single matrix containing RGB colour values, or three columns of variates or factors columns (specifying x-coordinates, y-coordinates and RGB colour values), or five columns of variates or factors columns (specifying x-coordinates, y-coordinates and red, green and blue colour values). The coordinate (0, 0) corresponds to the top left corner of the image, and the y-values increase as you move down the image.
Note that, if you save a file in .XLS format (Excel 2- 2003 file format) from Genstat and then open it in Excel 2010 or later versions, you will get a warning. Excel 2010 or later versions always do this when asked to open a file in .XLS format that was not saved by Excel. The warning will say
Office has detected a problem with this file. Editing it may harm your computer. Click for more details.
If you click this warning and then click the Edit Anyway
button, the file will open as expected with no further issues. Saving the file with Excel will stop this happening in the future. However, if you are using Excel 2010 or later versions, it is always best to use the Excel 2007-13 .XLSX file format, and then this will not happen.
The SHEETNAME
option lets you specify the name of the sheet to add to an Excel file, rather than using the default 'Genstat data'
. The name should only contain letters, numbers and spaces.
The METHOD
option controls how EXPORT
behaves when asked to overwrite an existing file. The available settings are add
, append
, concatenate
, merge
, overwrite
, prompt
, fail
and replace
, with a default of prompt
in interactive mode, and fail
in batch mode. The following example shows how METHOD=add
can be used to build up pages in a Genstat workbook file:
TEXT OutFile; VALUE='Results.GWB'
EXPORT [OUTFILE=OutFile; METHOD=overwrite; SHEET='Maximums']\
MaxLane,MaxLoc,WarpLoc,MaxAmp
EXPORT [OUTFILE=OutFile; METHOD=add; SHEET='Parameters']\
ParLane,ParLoc,ParSig,ParAmp
EXPORT [OUTFILE=OutFile; METHOD=add; SHEET='Components']\
PeakLane,PeakLoc,PeakSig,PeakAmp,PeakHt
EXPORT [OUTFILE=OutFile; METHOD=add; SHEET='Lanes']\
eX,eLane[]
EXPORT [OUTFILE=OutFile; METHOD=add; SHEET='Warping'] eX,W[]
The append
setting lets you append new values to an existing page. The GROUPS
option can define a factor column in the output file to identify the blocks of values that are appended; this can be set either to an existing factor (whose identifier will then be used) or a text containing the name to be used for that column in the file. The GLABEL
option can supply labels for the appended blocks. On the first append, this may be set to two values, where the first value identifies the new (appended) block, and the second identifies the original block of values. The COLMATCH
option controls whether the columns are matched by name or position. For example:
CALCULATE X1,Y1,Z1 = GRNORMAL(1000; 0; 1)
EXPORT [OUTFILE='Test.gsh'; METHOD=overwrite] X1,Y1,Z1
CALCULATE X2,Y2,Z2 = GRNORMAL(100; 3; 4)
EXPORT [OUTFILE='Test.gsh'; METHOD=append; GROUPS=Source;\
GLABEL=!T('Contaminated','Standard')] X2,Y2,Z2
If METHOD=concatenate
, the new data are added as new columns on the right-hand side of an existing page. The new data can also be added as new columns on the right-hand side the page by setting METHOD=merge
. The DATA
variables are now merged with the original rows using up to four key columns specified by the MATCH
and WITH
options (for the new and original rows, respectively). If MATCH
is not specified, the first DATA
variable is used. If WITH
is not specified, the MATCH
variables are matched with the same number of initial columns in the page. If a column with the same name already exists in the page when concatenating or merging, the default action is to rename the new column by adding a number to the end of the name to make it unique. Alternatively, if you set option UPDATE=yes
, the new column will replace the existing column.
If METHOD=replace
, then for Genstat spreadsheet files or Excel .XLSX files, the sheet may replace an existing sheet within the file. The name of the sheet to be replaced must be supplied by the SHEETNAME
option. If no matching sheet is found, the sheet will be added to the file.
The DELETESHEETS
option controls what happens when you are adding data to a file, containing multiple sheets, that is in a format does not support the updating of one page (e.g. older Excel .XLS, Quattro or Open Office files). The settings are:
always |
always delete the sheets that are not being updated, |
---|---|
never |
give a fault if the file contains multiple sheets, |
prompt |
in an interactive run, prompt to check whether the sheets should be deleted, or a fault should be given and the file left unchanged. |
In an interactive run the default is prompt
, and in a batch it is never
.
The CSVOPTIONS
option controls aspects of the output to CSV files:
noquotes |
suppresses the use of quotes around text, |
---|---|
pack |
removes any spaces around the columns to give a more compact but less readable file, |
round |
rounds numerical values to 6 significant figures, |
fixed |
writes the numerical values without using scientific notation, and |
align |
adds spaces to align the columns to make the file more readable. |
The HTMLOPTIONS
option controls the format of a table written to an HTML file:
allowformats |
interprets HTML format characters in cells (/& etc) as formats rather than including them as as literal text, |
---|---|
nogrid |
suppresses the grid between cells, |
centre |
centres the information within each cell, and |
right |
right-justifies the information. |
If neither centre
or right
are selected, the information in each cell will be left
justified.
The OUTOPTION
option allows extra options to be passed to Dataload.dll
. See IMPORT
for details.
The TABLEFORMAT
option controls how tables with two or more classifying factors are stored in spreadsheet files, with settings:
page |
to put each table onto a separate page, with the last classifying factor displayed across the columns, and |
---|---|
column |
to put each table into a single column, so that several tables are displayed on a single page. |
The default is TABLEFORMAT=page
.
The NONASCII
option specifies how to output non-ASCII characters to a text file: either in UTF-8 format (default), or in Unicode.
The TIMEOUT
option specifies the number of seconds to wait when a file that needs to be deleted or replaced is open in another process; default 10. This allows time for anti-virus and disk synchronization programs to finish their processing.
The COLUMNS
parameter can specify names for the columns to be saved. The setting is a text with a single line except for a matrix, where it should have a line for each column and also an extra initial line if the matrix has row labels.
The PLAINNAMES
option lets you suppress the additional type information that Genstat adds by default to the column names (!
for factors, :D
for dates etc). Alternatively, you can set option NONAMES=yes
to suppress the names altogether.
The MISSING
option lets you specify the string to use to represent a numerical missing value when writing to a text file (.TXT
, .TAB
or .CSV)
or a spreadsheet file (Excel, Quattro or Open Office). If MISSING
is not set, the string '*'
is used in .TXT
and .TAB
files, while in spreadsheet and .CSV
files the cell is left empty. Missing text values are always output as empty strings.
The TITLE
option can supply a text containing a title or description of the spreadsheet. This will be saved in a GSH or GWB file, and will be the heading of an HTML file.
You can set option READONLY=yes
to make the entire spreadsheet read-only (so that its contents cannot be changed). Alternatively, you can use the PROTECT
parameter to protect any individual column (by making it read-only). Settings of the PROTECT
parameter override the setting of the READONLY
option.
The ANALYSIS
option can supply a text containing Genstat commands to analyse columns in the spreadsheet. The ASETUP
option can similarly define commands that should be to be run once before the analysis of the columns, and the ADUMMY
option can be used to define the name of the dummy (if any) used in the ANALYSIS
commands.
The colours displayed in the cells of a spreadsheet can be controlled by using the FOREGROUND
and BACKGROUND
parameters to specify the foreground and background colours of the cells in each column. The setting can contain either colour names or RGB values (see the PEN
directive for details). You can specify a scalar or a text of length one if all the cells in a column have the same colour. You can specify a variate or text with several values to define different a colour for each cell. Finally you can specify a single pointer to a set of variates or texts if the corresponding DATA
setting will need several columns in the spreadsheet. Alternatively, you can specify the background columns for factor columns using the FACCOLOURS
parameter. This should be set to a variate or text with the same number of values as the number of levels of the factor. You can apply the colours defined for background of each cell of a factor to the cell’s complete row by setting the ROWCOLOURS
option to the identifier of the factor. A missing value, empty string or undefined setting for any of these parameters will retain the default colour for the foreground or background.
The DECIMALS
parameter lets you specify the number of decimal places to use for columns. When saving a Genstat or Excel file the columns will be displayed with that precision, but saved with full precision. However, with a text file (.TXT
, .TAB
or .CSV)
, the values will be rounded to that number of decimal places, and precision will be thus be lost. The default for text files, when DECIMALS
is unset, is to save the data with full precision (15 significant figures).
When DATA
contains a pointer, the corresponding COLUMNS
setting should be a text of the same length as the pointer. The DECIMALS
setting can be either a scalar, or a variate of the same length as the DATA
pointer. Similarly, the FACCOLOURS
, FOREGROUND
and BACKGROUND
settings can be either a single variate, scalar or text, or a pointer containing the same number of variates, scalars and/or texts as the length of the DATA
pointer. For example:
EXPORT [OUT='Test.xls'] !P(U,V),X,!P(Y,Z);\
COLUMNS=!T(A,B),'C',!T(D,E); DECIMALS=!(2,2),3,!(4,5)
(Note: EXPORT
replaces the procedure %DSAVE
from earlier editions of Genstat.)
Options: PRINT
, OUTFILE
, METHOD
, PLAINNAMES
, SHEETNAME
, NONAMES
, TITLE
, READONLY
, ANALYSIS
, ASETUP
, ADUMMY
, CSVOPTIONS
, HTMLOPTIONS
, COLMATCH
, GROUPS
, GLABEL
, MATCH
, WITH
, UPDATE
, OUTOPTIONS
, ROWCOLOURS
, TABLEFORMAT
, MISSING
, DELETESHEETS
, NONASCII
.
Parameters: DATA
, COLUMNS
, PROTECT
, FACCOLOURS
, FOREGROUND
, BACKGROUND
, DECIMALS
.
Method
The procedure calls the FSPREADSHEET
procedure to create a temporary GSH file, which is translated to the required file type using the DATALOAD.DLL
library.
Action with RESTRICT
Any restrictions are ignored. However, if the restrictions on the structures are not consistent, a fault will occur.
See also
Procedures: FSPREADSHEET
, IMPORT
.
Commands for: Input and output.
Example
CAPTION 'EXPORT example'; STYLE=major FDELETE 'XYData.xlsx','XYData.rda','XYData.csv' VARIATE X; !(1...8) CALC [SEED=5827] Y1,Y2 = 1,2 + X*(1,1.5 - X/3,4)/2 + GRNORMAL(8;0;0.4) FACTOR [LABELS=!T('None','Some')] Group; !((1,2)4) TEXT Unit; VALUES = !T(A,B,C,D,E,F,G,H) TABULATE [CLASS=Group] Y1,Y2; MEANS=M1,M2 "Write data to an Excel file, overwriting an existing file" EXPORT [OUTFILE='XYData.xlsx'; SHEET='Time 1'; METHOD=overwrite] Unit,Group,X,Y1 "Fit a linear regression model to the data and save the fitted values and residuals" MODEL Y1; FITTEDVALUES=Fit; RESIDUALS=Residuals FIT [PRINT=*] Group*X "Add extra columns to the end of the sheet" EXPORT [OUTFILE='XYData.xlsx'; SHEET='Time 1'; METHOD=concatenate] Fit,Residuals "Add a second page to the Excel file, colouring the rows by group" EXPORT [OUTFILE='XYData.xlsx'; SHEET='Time 2'; METHOD=add; ROWCOLOURS=Group] \ Unit,Group,X,Y2; FACCOLOURS=!T('mistyrose','azure'); DECIMALS=2 "Add another page to the Excel file for the combined data" EXPORT [OUTFILE='XYData.xlsx'; SHEET='Time 1 + 2'; METHOD=add] Unit,Group,X,Y1 "Append the second time in the same page and add factor time to index these" EXPORT [OUTFILE='XYData.xlsx'; SHEET='Time 1 + 2'; METHOD=append; GROUPS='Time'] \ Unit,Group,X,Y2 "Add sheet containing tables of means" EXPORT [OUTFILE='XYData.xlsx'; SHEET='Means'; METHOD=add] M1,M2 "Write data to a R file" EXPORT [OUTFILE='XYData.rda'; METHOD=overwrite] Unit,Group,X,Y1,Y2 "Write data to a CSV file" EXPORT [OUTFILE='XYData.csv'; CSV=noquotes,pack,round] Unit,Group,X,Y1,Y2