| Contact | Discussion | News | Publications | Site Map | |
![]() |
LIPID Metabolites And Pathways Strategy |
| About Lipid Classification Standards Experimental Data Databases Pathways Tools Protocols Home | |
GPStrGen.pl - Generate structures for Glycerophospholipids (GP)
GPStrGen.pl GPAbbrev|GPAbbrevFileName ...
GPStrGen.pl [-h, --help] [-o, --overwrite] [-r, --root rootname] [-w, --workingdir dirname] <arguments>...
Generate Glyceriphospholipids (GP) structures using compound abbreviations specified on a command line or in a CSV/TSV Text file. All the command line arguments represent either compound abbreviations or file name containing abbreviations. Use mode option to control the type of command line arguments.
A SD file, containing structures for all GP abbreviations along with ontological information, is generated as an output.
Current support for GP structure generation include these main classes and sub classes:
o Glycerophosphocholines (PC)
o Glycerophosphoethanolamines (PE)
o Glycerophosphoserines (PS)
o Glycerophosphoglycerols (PG)
o Glycerophosphoglycerophosphates (PGP)
o Glycerophosphoinositols (PI)
o Glycerophosphoinositol monophosphates (PIP)
o Glycerophosphates (PA)
o Glyceropyrophosphates (PPA)
o Glycerophosphonocholines (PnC)
o Glycerophosphonoethanolamines (PnE)
Print this help message
Controls interpretation of command line arguments. Two different methods are provided: specify compound abbreviations or a file name containing compound abbreviations. Possible values: Abbrev or AbbrevFileName. Default: Abbrev
In AbbrevFileName mode, a single line in CSV/TSV files can contain multiple compound abbreviations. The file extension determines delimiter used to process data lines: comma for CSV and tab for TSV. For files with TXT extension, only one compound abbreviation per line is allowed.
Wild card character, *, is also supported in compound abbreviations.
Examples:
With wild card character, +/- can also be used for chain lengths to indicate even and odd lengths at sn1/sn2/sn3 positions; additionally > and < qualifiers are also allowed to specify length requirements. Examples:
Default sn2 stereochemistry is R. However, abbreviation format also supports these additional stereochemistry specifications for sn2 position: S; U - unknown; rac - racemic mixture. Examples:
Overwrite existing files
New file name is generated using the root: <Root>.sdf. Default for new file names: GPAbbrev.sdf, <AbbrevFilenName>.sdf, or <FirstAbbrevFileName>1To<Count>.sdf.
Location of working directory. Default: current directory
On some systems, command line scripts may need to be invoked using perl -s GLStrGen.pl; however, all the examples assume direct invocation of command line script works.
To generate a GPStructures.sdf file containing a structure specified by a command line GP abbreviation, type:
To generate a GPStructures.sdf file containing structures specified by a command line GL abbreviations, type:
To generate a GPStructures.sdf file containing structures specified by a command line GP abbreviations with specific stereochemistry, type:
To enumerate all possible GP structures and generate a GPStructures.sdf file, type:
or
or
To enumerate all possible GP structures with a sn1 chain, and generate a GPStructures.sdf file, type:
To enumerate all possible GP structures with a sn1 chain containing one double bond, and generate a GPStructures.sdf file, type:
To enumerate all possible GP structures with even chain length larger than 10 at sn1 position, and generate and generate a GPStructures.sdf file, type:
To enumerate all possible GP structures with odd chains longer than 10 at sn1 and even chains longer than 18 at sn2, and generate a GPStructures.sdf file, type:
CLStrGen.pl, FAStrGen.pl, GLStrGen.pl, SPStrGen.pl, STStrGen.pl
Copyright (C) 2006-2008. The Regents of the University of California. All Rights Reserved.