Repository Installation, Upgrade and Administration Guide</title= >  <style>  </style> </head> <body> <h1>Repository Installation, Upgrade and Administration Guide</h1> <div class=3D"Section1"> <p><style type=3D"text/css">/*<![CDATA[*/ div.rbtoc1760389810405 {padding: 0px;} div.rbtoc1760389810405 ul {margin-left: 0px;} div.rbtoc1760389810405 li {margin-left: 0px;padding-left: 0px;} /*]]>*/</style></p> <div class=3D"toc-macro rbtoc1760389810405"> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Introd= uction">Introduction</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Compon= entsandLayout">Components and Layout</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Comman= d-LineTools">Command-Line Tools</a></li> </ul></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Instal= lation">Installation</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Platfo= rmRequirements">Platform Requirements</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Prereq= uisites">Prerequisites</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Scalab= ilityLimits">Scalability Limits</a></li> </ul></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Instal= landConfigureRepository">Install and Configure Repository</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Step1.= GetRepositoryDistribution">Step 1. Get Repository Distribution</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Step2.= EstablishtheRepositoryHomeDirectory">Step 2. Establish the Repository Home = Directory</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Step3.= PopulatetheRepositoryHomeDirectoryfromtheDistribution">Step 3. Populate the= Repository Home Directory from the Distribution</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Step4.= LocatetheServletContainer(ApacheTomcat)">Step 4. Locate the Servlet Contain= er (Apache Tomcat)</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Step5.= ConfigureTomcat:JAVA_OPTSandSystemProperties">Step 5. Configure Tomcat: JAV= A_OPTS and System Properties</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Step6.= InstallApacheDerbyjarsifnecessary">Step 6. Install Apache Derby jars if nec= essary</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Step7.= (OPTIONAL)ChoosealternateApacheDerbyimplementation">Step 7. (OPTIONAL) Choo= se alternate Apache Derby implementation</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Step8.= InstalltheRepository">Step 8. Install the Repository</a></li> </ul></li> </ul></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Upgrad= e">Upgrade</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Before= Upgrading">Before Upgrading</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Getthe= RepositoryDistribution">Get the Repository Distribution</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Backup= ">Back up</a></li> </ul></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-StepBy= StepUpgradeProcedure">Step By Step Upgrade Procedure</a></li> </ul></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Config= uration">Configuration</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-URIsfo= rCreatingNewRoles,Transitions,andWorkspaces">URIs for Creating New Roles, T= ransitions, and Workspaces</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Ruleso= fCreatingYourOwnURIs">Rules of Creating Your Own URIs</a></li> </ul></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Managi= ngAccessControlsonContact&"Hidden"Properties">Managing Access= Controls on Contact & "Hidden" Properties</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Config= urationReference">Configuration Reference</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-System= Properties">System Properties</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Reposi= toryHomeDirectory">Repository Home Directory</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-TheRep= ositoryConfigurationPropertiesFile">The Repository Configuration Properties= File</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Config= uringLogging">Configuring Logging</a></li> </ul></li> </ul></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Monito= ringandTroubleshooting">Monitoring and Troubleshooting</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Versio= nInformation">Version Information</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-LogFil= es">Log Files</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Perfor= manceMonitoring">Performance Monitoring</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Tuning= ">Tuning</a></li> </ul></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Admini= stratorTools">Administrator Tools</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-make-s= napshot.shScript">make-snapshot.sh Script</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Usage"= >Usage</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Restor= ingDumpsmadebymake-snapshot">Restoring Dumps made by make-snapshot</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Exampl= es">Examples</a></li> </ul></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-move-e= verything.sh:CopyingEverythingBetweenRepositoriesorFiles">move-everything.s= h: Copying Everything Between Repositories or Files</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-ThisIs= InherentlyNotAGoodIdea">This Is Inherently Not A Good Idea</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Restor= ingfromBackups">Restoring from Backups</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Usingt= heScript">Using the Script</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Option= s">Options</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-FixedA= rguments">Fixed Arguments</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Hints"= >Hints</a></li> </ul></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-move-r= esources.sh-CopyingOnlyResourceInstances">move-resources.sh - Copying Only = Resource Instances</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-ThisIs= InherentlyNotAGoodIdea.1">This Is Inherently Not A Good Idea</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Usingt= heScript.1">Using the Script</a></li> </ul></li> </ul></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Proced= ures">Procedures</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Proced= ure:UpgradingPackagedTomcat">Procedure: Upgrading Packaged Tomcat</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Proced= ure:InstallingRepoonUbuntu10'spackagedTomcat6">Procedure: Installing Repo o= n Ubuntu 10's packaged Tomcat6</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Proced= ure:RunTomcatonPort80(and443)">Procedure: Run Tomcat on Port 80 (and 443)</= a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Proced= ure:DumpandRestoretheRDFResourceData">Procedure: Dump and Restore the RDF R= esource Data</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-MakeBa= ckupDump(obsolete-seemake-snapshot)">Make Backup Dump (obsolete - see make-= snapshot)</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Restor= eRepositoryfromBackup">Restore Repository from Backup</a></li> </ul></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Proced= ure:SavingandRestoringUserAccounts">Procedure: Saving and Restoring User Ac= counts</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Step0.= CreatePrototypeAccountsandExportThem">Step 0. Create Prototype Accounts and= Export Them</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Step1.= ImportAccountsonDestinationSites">Step 1. Import Accounts on Destination Si= tes</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Step2.= TestingUsers">Step 2. Testing Users</a></li> </ul></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Proced= ure:ExportingandImportingPropertyAccessControls">Procedure: Exporting and I= mporting Property Access Controls</a></li> </ul></li> </ul> </div> <p></p> <hr> <h1 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Introduction= ">Introduction</h1> <p>The Data Repository is a software component that manages an RDF database= and makes it available to other applications through a REST API, and gives= end users specific views of the data. It adds role-based access control of= varying granularity, transactional editing, custom treatment of ontologies= and minimal/fast inference, and various administrative functions on top of= the RDF database.</p> <p>This page explains how it works on a host computer system, and how to in= stall and maintain it. This page serves as an application administrator's m= anual for the development cycle.</p> <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Componentsan= dLayout">Components and Layout</h2> <p>The data repository is installed in two <strong><em>intentionally separa= te</em></strong> places on the host operating system:</p> <ol> <li>A Java Servlet <em>web application</em>, or <strong>webapp</strong>, in= stalled within a Servlet container such as Apache Tomcat.</li> <li>The repository's <strong>home directory</strong>, a place on the filesy= stem containing:=20 <ol> <li>The repository's <strong>configuration properties</strong> file</li> <li>Files supporting the <strong>RDF database</strong> (opaque to users)</l= i> <li>Files supporting the small internal RDBMS used for <strong>user authent= ication</strong>.</li> <li>A directory for system <strong>log files</strong>.</li> </ol></li> </ol> <p>Why do we want the home directory separated from the webapp? Mainly, bec= ause the webapp is prone to getting replaced completely when a new version = is deployed; typically, the servlet container unpacks a new WAR file and re= places all of the old webapp. Any data files stored there would be lost. Si= nce the configuration and data have to persist through many incarnations of= the webapp, it is much safer to keep them in a separate place in the files= ystem, outside of the entire servlet container hierarchy. Also, this way th= e webapp needs no modification after it is installed, simplifying the proce= dure for the system administrator.</p> <p>Another advantage to the separate location is that it gives the system a= dministrator more flexibility to assign that directory to a location with a= ppropriate capacity, reliability, and performance.</p> <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Command-Line= Tools">Command-Line Tools</h2> <p>The installed repository includes a set of command-line tools you will u= se for many of the administrative tasks. They are found in the etc/ subdire= ctory of the repository home directory. All of them respond to these two op= tions:</p> <ul> <li><code><strong>--version</strong></code> - display what released version= the tool came from</li> <li><code><strong>--help</strong></code> - display a synopsis of command ar= gs and switches</li> </ul> <p>For example:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>bash ${REPO_HOME}/etc/upgrade.sh --version upgrade.sh from release 1.1-MS4.00 SCM revision 5422 </pre> </div> </div> <p> </p> <h1 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Installation= ">Installation</h1> <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-PlatformRequ= irements">Platform Requirements</h2> <ul> <li>This application requires <strong>Sun's JRE version 1.7.</strong></li> <li>The repository is a pure Java webapp and ought to run on any Java Servl= et container conforming to the 2.5 version of the specification. It has onl= y been thoroughly tested on <strong>Apache Tomcat 6.0</strong> and <strong>= Apache Tomcat 7.0</strong>, however.</li> <li>The supporting utility scripts and tools require a Unix environment suc= h as <strong>MacOS</strong> or <strong>Linux</strong>. <em><span style=3D"t= ext-decoration: underline;">MS Windows is</span></em> <strong><em><span sty= le=3D"text-decoration: underline;">NOT</span></em></strong> <em><span style= =3D"text-decoration: underline;">supported</span></em>.</li> <li>Aside from the Java Servlet environment, the webapp requires a separate= "home" directory, located outside of the servlet container hierarchy, to w= hich the container's JVM has read/write access.</li> </ul> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Prerequisite= s">Prerequisites</h3> <ol> <li><strong>System requirements</strong>. The current eagle-i network = deployment is a reference configuration. In this deployment, eagle-i instit= utional servers are VMs. System requirements for these VMs are available&nb= sp;<a href=3D"https://open.med.harvard.edu/display/eaglei/Software#Software= -Systemrequirements" class=3D"external-link" rel=3D"nofollow">here</a>.</li= > <li><strong>Unix-like operating system</strong>. This procedure is only val= id for Unix variants like Linux, Solaris, MacOSX. To run some of the script= s you will need to have these commands installed:=20 <ul> <li><code>bash</code></li> <li><code>perl</code></li> <li><code>curl</code></li> <li><code>awk</code> (surely anything that calls itself unix must have= awk)</li> <li><code>tr</code> (seriously, is tr missing? if you are running Gent= oo, install an operating system)</li> </ul></li> <li><strong>Sun's Java JDK 1.7</strong>.</li> <li><strong>Apache Tomcat web servlet container,*version 6.0</strong> = (version 7.0 also works, but this guide refers to version 6.0), configured = to run with the Java JDK in #2.=20 <ul> <li>Make sure you follow Tomcat installation and configuration instructions= for the Tomcat version and Linux distribution you are using; before instal= ling the eagle-i repository, Tomcat must be fully functional. You may want = to test this by using Tomcat's manager app, which should be available at&nb= sp;<a href=3D"http://localhost/manager/html/" class=3D"external-link" rel= =3D"nofollow">http://localhost/manager/html/</a> - you will need to ed= it the file conf/tomcat-users.xml for defining a user and a role - see this= guide: <a href=3D"http://tomcat.apache.org/tomcat-6.0-doc/manager-how= to.html#Configuring_Manager_Application_Access" class=3D"external-link" rel= =3D"nofollow">Apache Tomcat 6.0 Manager App HOW_TO</a></li> <li>Tomcat may be configured as a standalone web server, or be fronted by a= n Apache httpd server. In this guide we assume the former configuration. Th= e latter should also work, but describing it is out of our scope.</li> <li><span style=3D"color: rgb(128,0,0);"><strong>Tomcat must be configured = to use SSL</strong></span>, see the quickstart section here: <a href= =3D"http://tomcat.apache.org/tomcat-6.0-doc/ssl-howto.html" class=3D"extern= al-link" rel=3D"nofollow">Apache Tomcat 6.0 SSL Configuration HOW-TO</a>. N= ote that a production server will require a valid <strong>SSL certific= ate</strong>.=20 <ul> <li>Make sure your certificate is properly installed by using an SSL checke= r, e.g. <a href=3D"http://www.geocerts.com/ssl_checker" class=3D"exter= nal-link" rel=3D"nofollow">http://www.geocerts.com/ssl_checker</a></li> </ul></li> <li>Network configuration for <span style=3D"color: rgb(128,0,0);"><st= rong>Tomcat to respond on standard ports 80 and 443</strong></span> is= required. The section <strong>Run Tomcat on Port 80 (and 443)</strong= > under <a href=3D"#RepositoryInstallation,UpgradeandAdministrati= onGuide-Procedures">#Procedures</a> details our preferred method. Othe= r methods (e.g. using of Apache httpd) are possible but out of scope for th= is guide.</li> <li>See the <a href=3D"#RepositoryInstallation,UpgradeandAdministratio= nGuide-Procedures">Procedures</a> section if using Ubuntu's tomcat6 pa= ckage.=20 <ul> <li>It may be necessary to download Tomcat directly and install it manually= if the version supplied by the host OS's package system is not usable. Don= 't hesitate to do this if it is expedient; Tomcat can run as a pure Java ap= plication in a single file hierarchy, so a manual download can work just as= well (if not better) than the packaged version.</li> </ul></li> </ul></li> </ol> <ol> <li><strong>Apache Derby RDBMS</strong> installed in your Tomcat servl= et container.=20 <ul> <li>A copy of Derby is provided if you need to install it.</li> </ul></li> </ol> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-ScalabilityL= imits">Scalability Limits</h3> <p><strong>Note that only one instance of a Repository webapp may be run on= a given home directory</strong>. This means that only one JVM and Servlet = Container may access that home directory and RDF dataset at any one time. T= his is a restriction imposed by the Sesame triplestore.</p> <p>It is not possible to "scale" performance of the repository by sharing t= he online RDF database among multiple machines or processes. it is possible= to make periodic read-only snapshots of a database and serve them from sep= arate machines, so long as you do not allow them to be changed.</p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-InstallandCo= nfigureRepository">Install and Configure Repository</h2> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Step1.GetRep= ositoryDistribution">Step 1. Get Repository Distribution</h3> <p>The repository is distributed as a single Zip file. It contains a file R= EADME which identifies the software release it was built from. It is the ar= tifact produced by the Maven project:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>org.eagle-i:eagle-i-repository-dist </pre> </div> </div> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Step2.Establ= ishtheRepositoryHomeDirectory">Step 2. Establish the Repository Home Direct= ory</h3> <p>You need to determine the repository's home directory. It may be anywher= e on the system so long as it satisfies these criteria:</p> <ol> <li><span style=3D"color: rgb(128,0,0);"><strong>It</strong></span> <span s= tyle=3D"color: rgb(128,0,0);"><strong>must</strong></span> <span style=3D"c= olor: rgb(128,0,0);"><strong>be owned by the same user-id under which the s= ervlet container (Tomcat) is executing</strong></span>. The repository will= automatically create files and subdirectories as needed.</li> <li>The <em>host filesystem</em> <strong>must</strong> have adequate space = for your anticipated RDF database, logfiles, and backup snapshots.</li> </ol> <p>We will call this directory <code>REPO_HOME</code> and it will appear in= commands and scripts below as <code>${REPO_HOME</code>}.<br> Create the repository home directory in your file system. It is useful to = have a base eagle-i directory to place data and configuration used by other= eagle-i applications. For example,</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>mkdir /opt/eaglei mkdir /opt/eaglei/repo </pre> </div> </div> <p>If necessary (i.e if you created it using your own user-id), change owne= rship of the directory to the user-id under which Tomcat is running. If you= followed the example above, change the ownership of the two directories us= ing the -R option. For example, if the user-id under which Tomcat executes = is <em>tomcat</em>, </p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>chown -R tomcat /opt/eaglei </pre> </div> </div> <p>Initialize it as a variable in your shell environment. In this example (= Bourne/bash shell) the repository home directory is <code>/opt/eaglei/repo<= /code> :</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>REPO_HOME=3D/opt/eaglei/repo </pre> </div> </div> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Step3.Popula= tetheRepositoryHomeDirectoryfromtheDistribution">Step 3. Populate the Repos= itory Home Directory from the Distribution</h3> <p>Unpack the distribution Zip archive in a directory under <code>/tmp</cod= e>:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>cd /tmp unzip repository-dist.zip </pre> </div> </div> <p>Move the contents of the unzipped directory to your repository home dire= ctory. In this example the distribution is version <code>1.1-MS1.00-SNAPSHO= T</code></p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>mv /tmp/repository-1.1-MS1.00-SNAPSHOT/* ${REPO_HOME}/. </pre> </div> </div> <p>List the contents of the home directory:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>cd ${REPO_HOME} ls </pre> </div> </div> <p>It should contain the subdirectories <code>etc/</code> <code>lib/</code>= and <code>webapps/</code></p> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Step4.Locate= theServletContainer(ApacheTomcat)">Step 4. Locate the Servlet Container (Ap= ache Tomcat)</h3> <p>Determine the Java Servlet Container's home directory (e.g. Tomcat) whic= h is usually dictated by your host OS. For example, it may be the 'tomcat' = user's home directory, <code>~tomcat</code>.</p> <p>We will call this directory <code>CATALINA_HOME</code> and it will appea= r in commands and scripts below as</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>${CATALINA_HOME} </pre> </div> </div> <p>Initialize it as a variable in your shell environment. In this example (= in Bourne/bash shell) the Tomcat'shome directory is <code>/opt/tomcat</code= >:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>CATALINA_HOME=3D/opt/tomcat </pre> </div> </div> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Step5.Config= ureTomcat:JAVA_OPTSandSystemProperties">Step 5. Configure Tomcat: JAVA_OPTS= and System Properties</h3> <p>Ensure that your Tomcat server is run with the following options on its = JVM. The simplest way to accomplish this is to have the environment variabl= e <code>JAVA_OPTS</code> include those options, but each platform, distro, = package etc. of Tomcat has its own mechanism for setting this variable. For= example, on Fedora 14, it should be in the file <code>/etc/tomcat6/tomcat6= .conf</code>. If you can't find your distribution's configuration file= , you may create a file <code>setenv.sh</code> in tomcat's bin directory to= add the environment variable:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>...(ONLY DO THIS if you can't find your distribution's config file) cd ${CATALINA_HOME}/bin touch setenv.sh </pre> </div> </div> <p>Edit the configuration file (<code>tomcat6.conf</code>, <code>setenv.sh<= /code> or whatever your distribution uses) and add the following line:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>JAVA_OPTS=3D"-XX:PermSize=3D64M -XX:MaxPermSize=3D256M -Xmx1024m" </pre> </div> </div> <p>Add the following two system properties to file <code>conf/catalina.prop= erties</code> under the <code>CATALINA_HOME</code> directory -- the same di= rectory where you'll find <code>server.xml</code>. The value for both of th= ese properties is the absolute path of the repository home directory. In th= is example, it is <code>/opt/eaglei/repo</code>:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre># example org.eaglei.repository.home =3D /opt/eaglei/repo derby.system.home=3D /opt/eaglei/repo </pre> </div> </div> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Step6.Instal= lApacheDerbyjarsifnecessary">Step 6. Install Apache Derby jars if necessary= </h3> <p>Look in your Tomcat installation's main lib directory. If there are no f= iles named <code>derby.jar</code> or <code>derby-version.jar</code>, you mu= st install the Derby jars from the "scripts" distribution, e.g.</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>cp ${REPO_HOME}/lib/derby-* ${CATALINA_HOME}/lib/ </pre> </div> </div> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Step7.(OPTIO= NAL)ChoosealternateApacheDerbyimplementation">Step 7. (OPTIONAL) Choose alt= ernate Apache Derby implementation</h3> <p><strong>Are you already running applications which use a certain Apache = Derby in your servlet container?</strong> If so, set the environment variab= le <code>DERBY_HOME</code> as documented by Apache; if not, leave it unset = and the script will use its own version of Derby (the jars in its <code>lib= /</code> subdirectory):</p> <p>Bourne/bash shell version:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>....(ONLY DO THIS when ALREADY running Apache Derby!)  export DERBY_HOME=3Dmy-derby-installation-toplevel  </pre> </div> </div> <p>C Shell/csh version:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>....(ONLY DO THIS when ALREADY running Apache Derby!) setenv DERBY_HOME my-derby-installation-toplevel  </pre> </div> </div> <p>NOTE: You <strong>must</strong> use the same version of Derby to create = this initial user database as the version installed in Tomcat, so if Tomcat= is already running a version of Derby, set <code>DERBY_HOME</code> to use = that.</p> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Step8.Instal= ltheRepository">Step 8. Install the Repository</h3> <p>Follow this step-by-step procedure. <span style=3D"color: rgb(128,0,0);"= ><strong>Before you start</strong></span>, make sure the Tomcat server is <= strong>not running</strong>.</p> <p> </p> <ol> <li><p>Navigate to Tomcat's webapps directory. <span style=3D"color: rgb(12= 8,0,0);"><strong>If there exist a directory named ROOT</strong></span>, mov= e it aside. The eagle-i repository <strong>must</strong> be the ROOT applic= ation</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>cd ${CATALINA_HOME}/webapps mv ROOT ROOT.original </pre> </div> </div></li> <li><p>Copy the repository webapp to the Tomcat webapps directory:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>cp ${REPO_HOME}/webapps/ROOT.war ${CATALINA_HOME}/webapps/. </pre> </div> </div></li> <li><p>Create your initial administrative user login. Think of a USERNAME a= nd PASSWORD and substitute them for the upper case words in this command:</= p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>bash ${REPO_HOME}/etc/prepare-install.sh USERNAME PASSWORD ${REPO_HOME= } </pre> </div> </div></li> <li>Start up Tomcat.</li> <li><p>Run the finish-install script, which loads the data model ontology a= mong other things. Note that you can also give it additional options to spe= cify a personal name and email box for the initial admin user.</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>bash ${REPO_HOME}/etc/finish-install.sh USERNAME PASSWORD https://loca= lhost:8443</pre> </div> </div><p>...or, with username metadata included:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>bash ${REPO_HOME}/etc/finish-install.sh \ -f firstname \ -l lastname \ -m admin@ei.edu \ USERNAME PASSWORD https://localhost:8443</pre> </div> </div></li> <li><p>Run the upgrade.sh script, which preforms additional configurations.= </p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>bash ${REPO_HOME}/etc/upgrade.sh USERNAME PASSWORD https://localhost:8= 443 </pre> </div> </div></li> <li>Copy the file default.configuration.properties in located in {<cod= e>${REPO_HOME} }}into a file named {{configuration.properti= es</code> and edit the latter to reflect your installation. See the <= a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Configurati= on">#Configuration</a> section below for details on the property definition= s and expected values.</li> <li><p>Restart Tomcat to pick up these configuration changes. Confirm that = the eagle-i repository is running by visiting the admin page (login with US= ERNAME and PASSWORD):</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>https://localhost:8443/repository/admin </pre> </div> </div></li> </ol> <h1 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Upgrade">Upg= rade</h1> <p>This is the procedure to upgrade an existing repository instance to a ne= w release of the software. <strong>All existing configurations, data, and u= ser accounts are preserved</strong>. However, if the upgrade includes ontol= ogy changes there will also be an extra procedure to transform the existing= data to reconcile it with ontology changes. </p> <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-BeforeUpgrad= ing">Before Upgrading</h2> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-GettheReposi= toryDistribution">Get the Repository Distribution</h3> <p>The repository release is distributed as a single Zip file. It contains = a file <code>README</code> whcih identifies the software release it was bui= lt from. It is the artifact produced by the Maven project:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>org.eagle-i:eagle-i-repository-dist </pre> </div> </div> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Backup">Back= up</h3> <p>It would be a wise precaution to make a backup of the current repository= state so you can roll back to it in case of fatal problems with the upgrad= e. Follow the <strong>Backup Procedure</strong> in the <a href=3D"#Reposito= ryInstallation,UpgradeandAdministrationGuide-Procedures">#Procedures</a> se= ction to get a snapshot of the current repository contents.</p> <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-StepByStepUp= gradeProcedure">Step By Step Upgrade Procedure</h2> <p>Note that the directory macros ${CATALINA_HOME} and ${REPO_HOME} are use= d in the examples here; see the <strong>Install Procedure</strong> above fo= r a description of what they mean.</p> <ol> <li><p>Unpack the distribution Zip archive in a directory e.g. under <code>= /tmp</code>:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>cd /tmp unzip repository-dist.zip </pre> </div> </div></li> <li>Shut down your Tomcat java servlet container.</li> <li><p>Delete the old repo webapp subdirectory and WAR file, since there sh= ould not be any local modifcations there. For example:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>rm -rf ${CATALINA_HOME}/webapps/ROOT* </pre> </div> </div></li> <li><p>Save the current release files in case you have to roll back:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>cd ${REPO_HOME} mv etc etc.old mv lib lib.old mv webapps webapps.old </pre> </div> </div></li> <li><p>Copy the distribution into place (in this example the distribution i= s version 1.7-MS1.01) -- note there are 2 steps:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>cp -f -rp /tmp/repository-1.7-MS1.01/* ${REPO_HOME} cp ${REPO_HOME}/webapps/ROOT.war ${CATALINA_HOME}/webapps/. </pre> </div> </div></li> <li>Start up your tomcat java servlet container.</li> <li><p>Run the upgrade script, substituting your admin's username and passw= ord:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>bash ${REPO_HOME}/etc/upgrade.sh USERNAME PASSWORD https://localhost:8= 443</pre> </div> </div><p>Watch the output of upgrade.sh very carefully! Pay particular atte= ntion to the final status and any messages beginning "WARN", they will indi= cate problems you MUST resolve.</p></li> <li>Confirm that it worked: visit the <strong>repo admin page</strong>, che= ck for new version, and then follow the link to <span style=3D"text-decorat= ion: underline;">Show Data Model Ontology versions</span> to confirm that "= loaded" and "available" versions of the ontology are the same.When running = the upgrade script, there may be messages about out-of-date <code>NG_Intern= al</code> and <code>NG_Query</code> graphs. Most likely, these are nothing = to worry about -- check the release notes. These graphs are only initialize= d from static files when the repository was created, and afterward they acc= umulate statements, so reloading a new copy of the original data is not pra= ctical. Some releases may include instructions for making changes in these = graphs when upgrading from previous versions.</li> <li><p>Download the data migration toolkit <strong>that corresponds to your= repository version</strong> (in this example, version 1.7-MS1.02) and run = the data migration script, substituting your admin's username and password:= </p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>wget -O ${REPO_HOME}/etc/eagle-i-datatools-datamanagement.jar \ http://infra.search.eagle-i.net:8081/nexus/content/repositories/\ releases/org/eagle-i/eagle-i-datatools-datamanagement/1.7-MS1.02/\ eagle-i-datatools-datamanagement-1.7-MS1.02.jar bash ${REPO_HOME}/etc/data-migration.sh -u USERNAME -p PASSWORD -r https://= localhost:8443 </pre> </div> </div><p>Watch the output of data-migration.sh very carefully! Pay particul= ar attention to the final status and any messages beginning "WARN", they wi= ll indicate problems you MUST resolve. In addition to the output on screen,= the data-migration script will place a data migration report in the <code>= logs</code> directory directly under <code>/etc</code>.</p></li> </ol> <p> </p> <h1 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Configuratio= n">Configuration</h1> <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-URIsforCreat= ingNewRoles,Transitions,andWorkspaces">URIs for Creating New Roles, Transit= ions, and Workspaces</h2> <p>When you create a new <strong>Role</strong> or <strong>Workflow Transiti= on</strong>, you have the option of assigning your own URI to the new resou= rce. When should you make up a URI, and when should you just let the system= create one?</p> <p>The answer is, if you expect to be exporting and sharing this resource -= - which is to be expected for most Roles and Transitions, since there will = typically be many commonly-administered repositories sharing the same confi= guration of Roles and workflow, <strong>make up your own URIs following gui= delines here</strong>. This ensures that when, e.g. a User is copied from o= ne repository to another, her Roles are all available on the destination re= pository with the same access grants. Likewise, Workflow Transitions should= be given the same uniform URI on all repository sites to ensure that a cha= nge on the master site is propagated correctly. Since you ensure the Transi= tion's URI is globally unique, you can import it on all the slave repos wit= h the URI preserved, replacing the local copy, since the local URI will be = the same as the master's URI.</p> <p>For <strong>Workspace</strong> (aka Named Graph) URIs, you have to assig= n them in the process of creating a new Named Graph. Follow the rules below= to create a reasonable URI.</p> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-RulesofCreat= ingYourOwnURIs">Rules of Creating Your Own URIs</h3> <p>Note that these URIs <strong>do not need to be resolvable</strong>. They= are purely <em>symbolic</em> names for instances buried within the reposit= ory, which are virtually guaranteed never to appear in the outside world. S= o don't worry about whether the URI is actually resolved, most of the exist= ing URIs for these types of things are not resolvable anyway.</p> <ol> <li>Choose a <strong>namespace</strong> (URI prefix):=20 <ol> <li>Unique to your project, e.g. <span class=3D"nolink">http://dartmouse.ed= u/repo/</span></li> <li>Borrow the Repository's namespace: <span class=3D"nolink">http://eagle-= i.org/ont/repo/1.0/</span><strong>(but be careful of the suffix you pick!)<= /strong></li> </ol></li> <li>Choose a unique symbolic suffix:=20 <ol> <li>It <strong>must not</strong> contain the slash (/) character!</li> <li>Include a description of the content to prevent collisions, e.g. "Role"= or "WFT"</li> <li>If you are borrowing the Repository namespace, start your suffix with a= unique leader, e.g. "DARTMOUSE_".</li> </ol></li> </ol> <p>Examples of good URIs:</p> <p><span class=3D"nolink">http://dartmouse.edu/repo/Role_LabRat</span><span= class=3D"nolink">http://dartmouse.edu/repo/WFT_13_2</span><span class=3D"n= olink">http://eagle-i.org/ont/repo/1.0/DARTMOUSE_ROLE_PI</span><span class= =3D"nolink">http://eagle-i.org/ont/repo/1.0/DARTMOUSE_WFT_TRASH</span><br><= strong>Exception</strong>: The URI of a named graph representing an ontolog= y is usually the same as the URI of the ontology itself, i.e. the subject o= f its <code>owl:versionInfo</code> statement. If you should happen to add a= new ontology named graph to the repository, use that URI for its name. How= ever this should be a very rare occurrence; usually new ontological informa= tion is simply added to the existing eagle-i data model ontology graph.</p> <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-ManagingAcce= ssControlsonContact&"Hidden"Properties">Managing Access Contr= ols on Contact & "Hidden" Properties</h2> <p>The repository has a mechanism for restricting access to some of the pro= perties of resource instances, deemed "hidden" and "contact" properties - t= hese are two distinct sets of properties, configured independently but by a= n identical mechanism. See the <strong>Resource Property Hiding</strong>&nb= sp;and <strong>Acces Control</strong> sections under <strong>Concepts<= /strong> in the <strong>Repository Design Specification / API Manual</stron= g> for more details about how this works.</p> <p>To configure access control, bring up the Admin GUI home page, and click= on the link <span style=3D"text-decoration: underline;">Manage Property Ac= cess Controls</span> under <strong>Administrator Tasks</strong>. This page = lets you edit the Access Control List (ACL) of both contact and hiding prop= erty sets. Granting <strong>READ</strong> permission allows a user or role = to see those properties in Dissemination and harvest reports.</p> <p>It is best to grant these permissions only to Roles - there should be no= need to grant property read access at the granularity of users.</p> <p>Note that if you grant <strong>READ</strong> to the <strong>Anonymous</s= trong> pseudo-role, that is the same as turning off all protection since un= authenticated users will be able to see the hidden/contact properties.</p> <p>Once you have set up a single repository to your liking, you can export = and re-import the grants to other repositories. See the <strong>Procedure: = Exporting and Importing Property Access Controls</strong> section below.</p= > <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Configuratio= nReference">Configuration Reference</h2> <p>This section lists everything that can be configured, so you can get fam= iliar with it before installing anything.</p> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-SystemProper= ties">System Properties</h3> <p>The repository requires these system properties to be defined in the JVM= environment running your servlet container:</p> <ol> <li><code>org.eaglei.repository.home</code> - absolute path of the <em>repo= sitory home directory</em> (see below)</li> <li><code>derby.system.home</code> - directory containing Derby databases.<= /li> </ol> <p>We recommend you set to the same path as repository home</p> <p>If you are using the Apache Tomcat version 6 container (which is recomme= nded), you can add these system properties to file <code>conf/catalina.prop= erties</code> - add lines like these: (note that the path <code>/opt/eaglei= /repo</code> is just shown as an example)</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>org.eaglei.repository.home =3D /opt/eaglei/repo derby.system.home=3D /opt/eaglei/repo </pre> </div> </div> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-RepositoryHo= meDirectory">Repository Home Directory</h3> <p>The repository has a notion of a <em>home directory</em>, the root of a = hierarchy of other runtime files.</p> <p><strong>We recommend that you place this home directory</strong> <strong= ><span style=3D"text-decoration: underline;">outside</span></strong> <stron= g>of the Java Servlet container,</strong> <strong><span style=3D"text-decor= ation: underline;">especially</span></strong> <strong>outside of the webapp= structure -- this is because the entire webapp directory tree may be remov= ed and replaced when the webapp is updated.</strong> The default location i= s under the process-owning user's home directory.</p> <p>The path of the home directory is determined:</p> <ol> <li>If the Java system property <code>org.eaglei.repository.home</code> is = set, its value must be the absolute path of the home directory.</li> <li>Otherwise it defaults to the user's home directory (value of system pro= perty user.home) followed by path elements eaglei and repository, e.g. <cod= e>/home/lcs/eaglei/repository/</code></li> </ol> <p>These files and subdirectories are found under the repository home:</p> <ul> <li><code>configuration.properties</code> - java properties file with repos= itory and log4j configuration props. This is optional, it must be created b= y the administrator.</li> <li><code>logs/</code> - Default subdirectory for log files, see configurat= ion. Created automatically by default.</li> <li><code>sesame/</code> - Default Sesame RDF database files - DO NOT TOUCH= . Created automatically by default.</li> <li><code>etc/</code> - Contains scripts and tools for the repo administrat= or.</li> <li><code>db/</code> - Default subdirectory Derby RDBMS files - DO NOT TOUC= H. Created automatically by default.</li> </ul> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-TheRepositor= yConfigurationPropertiesFile">The Repository Configuration Properties File<= /h3> <p>The configuration file is read by <a href=3D"http://commons.apache.org/c= onfiguration/index.html" class=3D"external-link" rel=3D"nofollow">Apache Co= mmons Configuration</a>, which recognizes interpolated property and system = property values. See its documentation for more information about features = in the configuration file.</p> <p>You can set the following properties in the <code>configuration.properti= es</code> file. Most of the repository's "configuration" comes from adminis= trative metadata in its RDF database and from the ontologies loaded into it= , so the configuration settings here are very minimal and mostly serve to b= ootstrap the RDF repository.</p> <p>The properties in <span style=3D"color: rgb(255,0,0);">red</span>&n= bsp;are <strong>required</strong>; those in <span style=3D"color: rgb(= 255,153,0);">orange</span> are <strong>important</strong> and can be c= onsidered required for a production system, although they can be elided for= a test or development system at the cost of some ugliness in the UI.</p> <p>Any properties not present (or commented-out) in the configuration prope= rties file will revert to the default values documented here. In most cases= this is just fine. The property is only provided so that the application's= behavior can be customized and adjusted to suit the requirements of a part= icular installation site. For example, your site may have a convention of w= riting all log files to a fielsystem separate from the applications.</p> <ul> <li><span style=3D"color: rgb(255,0,0);"><code>eaglei.repository.namespace<= /code></span> - The namespace URI prefix for Eagle-I resource instances cre= ated in the repository.=20 <ul> <li><strong>Every administrator should set this to a reasonable value for h= is/her site, because the default is NOT desireable.</strong></li> <li><strong>The value must be a fully qualified, resolvable, HTTP URL.</str= ong></li> <li>For example, <code><span class=3D"nolink">http://foo.bar.edu/i/</s= pan></code></li> <li>Use the <strong>http</strong> scheme, <em>NOT</em> <strong>https</stron= g>, since the container will redirect to https if necessary, but it is not = possible to direct back if it becomes preferable to use http later.</li> <li>The system-generated default is the hostname followed by /i/ -- but thi= s is often wrong, since Java's determination of hostnames in a servlet cont= ainer environment is not reliable.</li> </ul></li> <li><span style=3D"color: rgb(255,153,0);"><code>eaglei.repository.title</c= ode></span> - the decorative title for UI pages, should be set for cosmetic= reasons.=20 <ul> <li>Set this to the name of your site, e.g. "Miskatonic University School o= f Medicine".</li> </ul></li> <li><span style=3D"color: rgb(255,153,0);"><code>eaglei.repository.logo</co= de></span> - URL of the logo image for your site, may be either relative UR= L (to refer to a image embedded in the webapp) or an absolute URL to use an= image hosted elsewhere. It should be about 50 pixels high and a suitable w= ith given the proportions.</li> <li><span style=3D"color: rgb(255,153,0);"><code>eaglei.repository.index.ur= l</code></span> - Set this to the URL to which you want the site's "root" (= top-level index) page redirected. Although the repository is installed as t= he root webapp to have control over resolving Semantic Web URIs, it does no= t need the root page so this allows you to configure your site as you like.= </li> <li><span style=3D"color: rgb(255,153,0);"><code>eaglei.repository.admin.ba= ckgroundColor</code></span> - Lets you change the background color for admi= n web UI pages, to give admins an obvious cue when they are operating on e.= g. the production vs. test repos. Value is CSS color expression, e.g. crayo= n name like "bisque" or hex #CCFFCC (Added in Release 1.2MS2 or 3)</li> <li><span style=3D"color: rgb(255,153,0);"><code>eaglei.repository.instance= .xslt</code></span> - path to XSL stylesheet used to transform the HTML out= put of the instance dissemination service. A value for this key is <span st= yle=3D"color: rgb(255,0,0);"><strong>required</strong></span> to produce XH= TML in the dissemination service; without it, the service returns the inter= nal XML document describing the instance.=20 <ul> <li>If it is a relative path then it must be located relative to the root o= f the web application, if absolute then it is in the filesystem at large.</= li> <li>The advantage of keeping your stylesheets external to the webapp is tha= t you can change them easily, and don't have to modify the webapp from its = default installation.</li> <li>An example is provided at <code>repository/styles/example.xsl</code> wh= ich creates very simple HTML, as a demonstration of how to write an XSL sty= lesheet.</li> </ul></li> <li><p><code>eaglei.repository.instance.css</code> - URI of the CSS stylesh= eet resource to be used to style instance dissemination pages. It must be a= n absolute path or absolute URL. The default is:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>eaglei.repository.instance.css =3D /repository/styles/i.css</pre> </div> </div></li> <li><code>eaglei.repository.tbox.graphs</code> - a comma-separated list of = graph URIs making up the "TBox".<br><strong>You should</strong> <strong><sp= an style=3D"text-decoration: underline;">never</span></strong> <strong>have= to set this!</strong> It is configurable "just in case", and for testing/e= xperimenting. For more information, see the section on <strong>inferencing<= /strong> in the API Manual.<br> By default, the TBox consists of:=20 <ul> <li>The repository's internal ontology, <span class=3D"nolink">http://= eagle-i.org/ont/repo/1.0/</span></li> <li>The eagle-i data model ontology, <code><span class=3D"nolink">http= ://purl.obolibrary.org/obo/ero.owl</span></code></li> </ul></li> <li><code>eaglei.repository.datamodel.source</code> - the full name of a re= source within the webapp which its itself a property file describing the RD= F data model ontology. <em>You should not need to set this, the default is = adequate for the eagle-i applicaiton</em>. Default is <code>eaglei-datamode= l.properties</code> which is a built-in resource file.<br> For a description of the contents of this properties file, see the separat= e document <strong>Guide to Data Model Configuration Properties</strong></l= i> <li><code>eaglei.repository.sesame.dir</code> - directory where Sesame RDF = database files are created.=20 <ul> <li>Defaults to <code>sesame</code> subdirectory of home dir.</li> </ul></li> <li><code>eaglei.repository.log.dir</code> - Directory where log files are = created.=20 <ul> <li>Defaults to <code>logs</code> subdirectory of the home dir. </li> <li>You can also configure <code>log4j</code> explicitly by adding <code>lo= g4j</code> properties to this file.</li> </ul></li> <li><p><code>eaglei.repository.sesame.indexes</code> - index configuration = for Sesame triple store. Must be a comma-separated list of index specifiers= , see <a href=3D"http://www.openrdf.org/doc/sesame2/users/ch07.html#section= -native-store-config" class=3D"external-link" rel=3D"nofollow">Sesame Nativ= eStore configuration</a> documentation for details. Use this to change= the internal indexes Sesame maintains to process queries. It takes effect = on next servlet container (tomcat) restart.</p> <div class=3D"confluence-information-macro confluence-information-macro-war= ning"> <p class=3D"title conf-macro-render">WARNING</p><span class=3D"aui-icon aui= -icon-small aui-iconfont-error confluence-information-macro-icon"></span> <div class=3D"confluence-information-macro-body"> <p>If you have a configured value and wish to go back to the default, <stro= ng>do NOT</strong> just delete this configuration property. If you do, Sesa= me will simply keep the existing indexes. You must change it to the origina= l default value, which is documented int he default configuration file.</p> </div> </div></li> <li><code>eaglei.repository.slow.query</code> - Value in seconds of time af= ter which a SPARQL query should be considered "slow" and logged as such. On= ly affects the SPARQL Protocol endpoint service. Default is 0, which never = logs. Use this to check for performance problems, since it logs the full te= xt of the query and time of occurance in the regular log at INFO level.</li= > <li><code>eaglei.repository.sparqlprotocol.max.time</code> - Time limit, in= seconds, of the maximum time allowed for a query invoked by the SPARQL Pro= tocol endpoint. Note that this <em>does not</em> affect any internally-gene= rated SPARQL queries.=20 <ul> <li>Any user can override this setting to impose a <strong><em>shorter</em>= </strong> timeout by giving a value for the nonstandard <strong>time</stron= g> argument.</li> <li>Only the Administrator can override with a <strong><em>longer</em></str= ong> timeout.</li> <li>The built-in default is <strong>600 seconds</strong> (10 min) if nothin= g is configured.</li> <li>If a SPARQL Protocol request cannot be complted within the timeout, it = returns an <strong>HTTP 413 status</strong> (result too large - it was the = standard response code that comes closest to the concept).</li> </ul></li> <li><code>eaglei.repository.anonymous.user</code> - <strong>This is a hack,= only intended for testing the Anonymous role</strong>. Its value is a user= name, e.g. "nobody". If configured, when the designated user logs in, their= session is downgraded to the <strong>Anonymous</strong> role; this allows = explicit testing of <strong>Anonymous </strong>(vs. <strong>Authentica= ted</strong>) access even when the webapp configuration does not allow unau= thenticated access. <strong>ONLY TESTERS SHOULD EVER NEED TO SET THIS.</str= ong></li> <li><strong>Configuring Contact Hiding:*The following properties control th= e contact hiding extension, which restricts the display of "contact locatio= n" properties of instances and instead offers an anonymous email option.&nb= sp;</strong><span style=3D"color: rgb(255,0,0);"><strong>Red</strong></span= ><strong> properties are required *only</strong> if you enable contact= hiding:=20 <ul> <li><span style=3D"color: rgb(255,0,0);"><code>eaglei.repository.hideContac= ts</code></span> - true|false, enables the contact hiding function. When it= is false, none of the other properties are used.</li> <li><span style=3D"color: rgb(255,0,0);"><code>eaglei.repository.postmaster= </code></span> - email address of repository administrator(s). User-generat= ed messages about resources without a contact email address get sent here, = as well as diagnostic messages. We recommend using an email list or alias s= o it can be changed or directed to multiple people.</li> <li><span style=3D"color: rgb(255,0,0);"><code>eaglei.repository.mail.host<= /code></span> - hostname of SMTP server for outgoing mail, defaults to loca= lhost.</li> <li><code>eaglei.repository.mail.port</code> - TCP port number of SMTP serv= er for outgoing mail, only necessary if using a non-default port for your c= hosen type of service.</li> <li><code>eaglei.repository.mail.ssl</code> - Use SSL for connection to SMT= P server for outgoing mail, value is true or false.</li> <li><code>eaglei.repository.mail.username</code> - Username with which to a= uthenticate to SMTP server for outgoing mail, default is unauthenticated.</= li> <li><code>eaglei.repository.mail.password</code> - password with which to a= uthenticate to SMTP server for outgoing mail, default is none.</li> </ul></li> </ul> <h6 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-2.0Release">= <strong>2.0 Release</strong></h6> <p>The following <strong>optional</strong> properties are valid after the 2= .0 release.</p> <ul> <li><code>eaglei.repository.searchBar.javascript.url</code> - Location of t= he source of the JavaScript for the search bar. The default value is suffic= ient unless custom search bar code needs to be loaded.</li> <li><code>eaglei.repository.centralSearch.url</code> - Location of the dest= ination of the actual searches from the search bar. The default value is su= fficient unless search is to be performed by a specific application.</li> </ul> <p>Note that the properties file may also contain Log4J configuration prope= rties. For example you can turn on debugging log output by adding this line= :</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>log4j.logger.org.eaglei.repository=3DDEBUG, repository </pre> </div> </div> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-ConfiguringL= ogging">Configuring Logging</h3> <p>The repository uses <a href=3D"http://logging.apache.org/log4j/" class= =3D"external-link" rel=3D"nofollow">Apache log4j</a> for its logging. Any p= roperties starting with <code>log4j.</code> in the repository configuration= properties are simply passed through to configure log4j. The Loggers (aka = Categories) are all descendents of the <strong>repository root Logger</stro= ng>, <code>org.eaglei.repository</code>, so you should configure the log le= vel and appenders for that Logger.</p> <p>Any log4j configuration properties in your repository configuration <str= ong>replace</strong> the corresponding defaults. Therefore, you <strong>mus= t</strong> assign an appender for the repository root Logger, or it will no= t be able to log anything.</p> <p>The default log4j configuration sets up an appender named repository wit= h buffered I/O for efficiency. <strong><em>Note that this means log message= s will not appear in the log file immediately, but only after the logging v= olume fills a buffer</em></strong>. <strong><em>This is useless for interac= tive debugging through the logs.</em></strong> If you are doing interactive= debugging and want to see more log detail, along with immediate results, y= ou should add the properties:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>log4j.logger.org.eaglei.repository=3DDEBUG, repository log4j.appender.repository.BufferedIO=3Dfalse log4j.appender.repository.ImmediateFlush=3Dtrue </pre> </div> </div> <p>Also note that the default configuration turns off additivity in the rep= o root Logger; this means its log events do not propagate up to e.g. the ro= ot logger. If you wish to turn it back on, add this to your configuration:<= /p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>log4j.additivity.org.eaglei.repository=3Dtrue </pre> </div> </div> <p>Here are all of the default log4j configuration properties:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>log4j.logger.org.eaglei.repository=3DINFO, repository log4j.additivity.org.eaglei.repository=3Dfalse log4j.appender.repository=3Dorg.apache.log4j.RollingFileAppender log4j.appender.repository.File=3D${eaglei.repository.log.dir}/repository.lo= g log4j.appender.repository.ImmediateFlush=3Dfalse log4j.appender.repository.BufferedIO=3Dtrue log4j.appender.repository.Append=3Dtrue log4j.appender.repository.Encoding=3DUTF-8 log4j.appender.repository.layout=3Dorg.apache.log4j.PatternLayout log4j.appender.repository.layout.ConversionPattern=3D%d{ISO8601} %p %c - %m= %n </pre> </div> </div> <p><strong>IMPORTANT NOTE</strong>: If you add <code>logger</code> configur= ations to tweak the level of a subset of the repo log hierarchy, you <stron= g>must</strong> add an <code>additivity</code> configuration to prevent log= 4j from applying the ancestor logger as well, which would result in double = log entries. For example, this fragment shows a default log level of INFO b= ut adds DEBUG logging of RepositoryServlet to get elapsed time messages:</p= > <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>log4j.logger.org.eaglei.repository=3DINFO, repository log4j.additivity.org.eaglei.repository=3Dfalse log4j.logger.org.eaglei.repository.servlet.RepositoryServlet=3DDEBUG, repos= itory log4j.additivity.org.eaglei.repository.servlet.RepositoryServlet=3Dfalse log4j.appender.repository.BufferedIO=3Dfalse log4j.appender.repository.ImmediateFlush=3Dtrue </pre> </div> </div> <h1 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Monitoringan= dTroubleshooting">Monitoring and Troubleshooting</h1> <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-VersionInfor= mation">Version Information</h2> <p>It's often helpful to know exactly what version of the repository you're= dealing with, especially in a hectic development and/or testing environmen= t when many versions are available. The release version appears in these pl= aces:</p> <ol> <li><p>Dissemination HTML pages, the <strong>head</strong> element contains= a <strong>meta</strong> tag with the name <code>eaglei.version</code>, e.g= .</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre><meta name=3D"eaglei.version" content=3D"1.1-MS5.00-SNAPSHOT" />= </pre> </div> </div></li> <li>The repository admin home page <code>/repository/admin</code> lists app= lication version info in a human-readable format.</li> <li>The page <code>/repository/version</code> gives a complete breakdown of= component versions, including repo source and the version of the OpenRDF S= esame database. It is XHTML, and it includes <strong>meta</strong> tags to = be easy to scrape or transform.</li> </ol> <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-LogFiles">Lo= g Files</h2> <p>Since the repository is mainly accessed by the REST service API it provi= des to other applications, you should get used to monitoring it by watching= the log file. This is a text file (UTF-8 encoding) maintained by the log4j= library under the control of the repository's configuration properties. Se= e the description of the <code>log.dir</code> property above to learn the d= irectory where logfiles are created; they are automatically rotated when th= e logfile grows too large.</p> <p>The default repository logfile is in the <code>logs/</code> subdirectory= of the repository home directory, and it is named <code>repository.log</co= de>. See the <strong>Configuration Properties</strong> section, above, for = instructions on changing the destination directory for logfiles.</p> <p>To troubleshoot problems with the logging system itself (e.g. log4j conf= ig that isn't working as expected), look for where your Java Servlet contai= ner writes the standard output stream. For Tomcat 6, this is typically the = <code>catalina.out</code> file in some log directory.</p> <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-PerformanceM= onitoring">Performance Monitoring</h2> <p>As of release 1.1MS5 the repo can log the elapsed time (in milliseconds)= for each service request. You must enable DEBUG level logging for the Repo= sitoryServlet, as in this configuration example.</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>log4j.logger.org.eaglei.repository=3DINFO, repository log4j.additivity.org.eaglei.repository=3Dfalse log4j.logger.org.eaglei.repository.servlet.RepositoryServlet=3DDEBUG, repos= itory log4j.additivity.org.eaglei.repository.servlet.RepositoryServlet=3Dfalse log4j.appender.repository.BufferedIO=3Dfalse log4j.appender.repository.ImmediateFlush=3Dtrue </pre> </div> </div> <p>As of release 1.2MS3 the repo will also show the time spent on internal = SPARQL queries, which can be useful when tuning Sesame indexes. Add these l= og4j configuration lines to see <strong><em>just</em></strong> the query lo= g messages:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>log4j.logger.org.eaglei.repository.util.SPARQL =3D DEBUG, repository log4j.additivity.org.eaglei.repository.util.SPARQL =3D false </pre> </div> </div> <p>Then, you'll see log entries like this which you can correlate to reques= ts from your application:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>...service invocation examples: 2011-01-27 14:28:06,483 T=3Dhttp-8443-1 DEBUG org.eaglei.repository.servlet= .RepositoryServlet - =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D Ending Request /repository/updat= e (2,159 mSec elapsed) 2011-01-27 14:27:58,023 T=3Dhttp-8443-1 DEBUG org.eaglei.repository.servlet= .RepositoryServlet - =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D Ending Request /repository/workf= low/push (1,763 mSec elapsed) ... (internal query example:) 2011-04-15 14:13:28,383 T=3Dhttp-8443-1 DEBUG org.eaglei.repository.util.SP= ARQL - SPARQL Query executed by org.eaglei.repository.model.User:findAll at line 227 in elapsed time (mSec)= 15 </pre> </div> </div> <p>You can also get the SPARQL Protocol endpoint to make log entries at the= INFO level for "slow" queries, i.e. ones that take longer than a certain t= hreshold.</p> <p>See the <code>eaglei.repository.slow.query</code> configuration property= for more details. Note that this <strong><em>only</em></strong> applies to= to queries made through the SPARQL Protocol endpoint, not the SPARQL queri= es generated internally by the repo code.</p> <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Tuning">Tuni= ng</h2> <p>The performance of Sesame's NativeStore implementation is extremely sens= itive to its index configuration. There is a major benefit to configuring i= ndexes that help resolve triple patterns used by the most frequent and/or v= oluminous SPARQL queries. A knowledgeable repository administrator should a= djust the setting of the <code>eaglei.repository.sesame.indexes</code> prop= erty to get the NativeStore to build the most necessary indexes. See doc on= that configuration for more details.</p> <p> </p> <h1 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Administrato= rTools">Administrator Tools</h1> <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-make-snapsho= t.shScript">make-snapshot.sh Script</h2> <p>The make-snapshot script creates a <strong><em>complete backup copy</em>= </strong> of a data repository, in a designated directory. It has to be giv= en a directory because the backup consists of multiple files. It is package= d with the repository distribution, under the <code>etc/</code> directory.<= /p> <p>Upon success, the directory will contain two files:</p> <ol> <li><code>resources.trig</code> -- RDF resource data in TriG format, read b= y /graph</li> <li><code>users.trig</code> -- user accounts, must be read by /import servi= ceUpon failure, it prints an explanatory messaeg and returns non-0 status.<= /li> </ol> <p>NO MESSAGE is printed upon success, which lets it run under cron.</p> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Usage">Usage= </h3> <p>Synopsis:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>make-snapshot.sh username password repo-URL directory </pre> </div> </div> <p>Where:</p> <ul> <li>username - username with which to authenticate to the repo</li> <li>password - password with which to authenticate to the repo</li> <li>repo-URL - prefix of repository URL, e.g. "https://localhost/"</li> <li>directory - directory in which to write the dump, will be created if ne= cessary</li> </ul> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-RestoringDum= psmadebymake-snapshot">Restoring Dumps made by make-snapshot</h3> <p>Given a dump created in e.g. ${DUMPDIR}, to restore this dump on a newly= -created, empty, repository, use these commands: (where ${REPOSITORY} is UR= L prefix of the repo)</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>curl -D - -s -S -u ADMIN:PASSWORD -F type=3Duser -F format=3Dapplicati= on/x-trig \ -F content=3D@${DUMPDIR}/users.trig -F duplicate=3Dreplace \ -F transform=3Dno ${REPOSITORY}/repository/import </pre> </div> </div> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>curl -s -S -D - -u ADMIN:PASSWORD -F action=3Dreplace -F all=3D \ -F "content=3D@${DUMPDIR}/resources.trig;type=3Dapplication/x-trig" \ ${REPOSITORY}/repository/graph </pre> </div> </div> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Examples">Ex= amples</h3> <p>For example, your crontab might invoke this command to write a daily sna= pshot</p> <p>in a differently-named directory each day, rotating through a week:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>make-snapshot.sh ADMIN PASSWORD https://localhost:8443 "daily_cron_`da= te +%u`" </pre> </div> </div> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-move-everyth= ing.sh:CopyingEverythingBetweenRepositoriesorFiles">move-everything.sh: Cop= ying Everything Between Repositories or Files</h2> <p>The <code>move-everything.sh</code> script replicates <strong><em>all</e= m></strong> of a repository's contents - including resources, users and met= adata - from one repository to a different one, or from a static file dump = to a live repository. It <strong><em>transforms</em></strong> all resource = (and user) URIs to match the URI prefix of the destination repository.</p> <div class=3D"confluence-information-macro confluence-information-macro-war= ning"> <p class=3D"title conf-macro-render">WARNING</p><span class=3D"aui-icon aui= -icon-small aui-iconfont-error confluence-information-macro-icon"></span> <div class=3D"confluence-information-macro-body"> <p>This command <span style=3D"text-decoration: underline;">obliterates</sp= an> all contents of the target repository.</p> </div> </div> <p>Why do you need this script instead of just export and import requests? = Because moving from one repo to another, the URIs of resources and users ha= ve to be transformed.</p> <p>Since resource URIs have to be resolvable, this effectively creates new = resources in the destination repository with URIs that resolve there. It do= es this by substituting the target's default prefix into all URIs that used= to resolve at the source repository.</p> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-ThisIsInhere= ntlyNotAGoodIdea">This Is Inherently Not A Good Idea</h3> <p>Before you start copying resources around, be sure you understand <stron= g><em>why this is not a good idea!</em></strong> Reasons include:</p> <ol> <li><strong>Poor Semantic Web Hygiene</strong>: Existing semantic web stand= ards and technologies have ways to show that multiple URIs really describe = the same object. This method does not use them, in the interest of making a= precisely accurate copy of the data and not changing the source.</li> <li><strong>Previous Content of Target Repo Is Lost</strong>: We can't emph= asize this enough because someone is sure to make a tragic mistake after no= t reading these instructions carefully enough. All contents of the reposito= ry you are moving resources into will be replaced by the copy of the source= repository. <span style=3D"color: rgb(255,0,0);"><strong><em>All previous = contents are lost irretrievably</em></strong></span>. You made a backup, ri= ght?</li> <li><strong>All State is Preserved</strong>: All metadata for the state of = e.g. data tools is copied faithfully, so claimed instances will have claims= on the new site. This is not intuitive.</li> <li><strong>User Accounts are added to Destination</strong>: All of the use= r login accounts from the source are added to the target repo. Previously e= xisting accounts are still available there too, but in the event of a dupli= cate, the target's account is replaced by the user URI and password from th= e source. <span style=3D"color: rgb(255,0,0);"><strong><em>You must be awar= e of any security issues this may create</em></strong></span>.</li> <li><strong>Abuse of auto-generated URIs</strong>: Since the script is impo= rting a bunch of URIs which were not generated by the native /new service, = there is some chance of overlap. This should not happen so long as all repo= sitories use the same time-based UUID generator for the suffix of URIs, but= there is always a chance of conflicts.</li> </ol> <p>However, move-everything has some advantages over move-resources:</p> <ol> <li><strong>Copy is Close to Perfect</strong>: The repository copy includes= all self-contained resource instances, including unpublished resources and= the user account instances referenced by administrative metadata.</li> <li><strong>Blank Nodes included</strong>: If your source data includes any= blank nodes, they get copied too.</li> <li><strong>Metadata should be complete</strong>: The user account instance= s referenced by creator and contributor properties are copied as well so re= ferences will work.</li> </ol> <p>Given all of these limitations, move-everything can still be an effectiv= e way of populating a repository for testing and demonstrations. Just stay = aware of what doesn't work, and <strong>only use it when the results are te= mporary and will be discarded</strong>.</p> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Restoringfro= mBackups">Restoring from Backups</h3> <p>There is one other legitimate use of <strong>move-everything</strong>: r= estoring a backup copy made with <strong>make-snapshot</strong>. In this ca= se you don't really have to transform the URIs, and the whole intent is to = re-create the original state of the repo so the side effects are all desire= d.</p> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-UsingtheScri= pt">Using the Script</h3> <p>The resource copying script is installed under etc/ in the repository ho= me directory. Its name is <code>move-everything.sh</code> . It only runs on= a Unix-based operating system such as <strong><em>Linux</em></strong> or <= strong><em>MacOS X</em></strong>. It requires <strong>bash</strong>, <stron= g>perl 5</strong>, and the <strong>curl</strong> executable.</p> <p>The synopsis for copying from <strong>repository</strong> to <strong>rep= ository</strong>:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>Usage: move-everything.sh [--version|--version] [ -f | --force ] [-exclude-users user,user,..|-exclude-users user,user,..] [-nousers] from-username from-password from-repo-URL to-username to-password to-repo-URL </pre> </div> </div> <p>The synopsis for copying from <strong>file</strong> to <strong>repositor= y</strong>:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>Usage: move-everything.sh [--version|--version] [ -f | --force ] [-exclude-users user,user,..|-exclude-users user,user,..] [-nousers] --from-snapshot directory --from-prefix from-prefix to-username to-password to-repo-URL </pre> </div> </div> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Options">Opt= ions</h3> <p>The <code>--force</code> option: Normally the script starts up with a di= alog explaining how dangerous it is and how the destination repo will be co= mpletely obliterated, and ask if you want to continue. Adding this option (= abbreviated <strong>-f)</strong> will bypass the question and run every tim= e, without asking. It is necessary when embedding it in another script. <st= rong><em>Only specify --force when you are very sure you're doing the right= thing</em></strong>. When prompted with the "Danger!" message, take time t= o actually <strong><em>read</em></strong> it before agreeing. You may be su= rprised.</p> <p>If you specify a <code>--exclude-users</code> option, its value is a lis= t of one or more usernames (separated by commas and/or spaces) to be left o= ut of the source export. This is handy when you do not want to import the a= dministrator accounts from the source system, for example. <em>Note that th= e excluded users' RDF metadata will</em> <strong><em>still</em></strong> <e= m>get copied, because it is in one of the named graphs which gets moved and= transformed.</em></p> <p>If you specify the <code>--nousers</code> option (has no value), this tu= rns off explicit copying of user accounts entirely. <em>Note that users' RD= F metadata will</em> <strong><em>still</em></strong> <em>get copied, becaus= e it is in one of the named graphs which gets moved and transformed</em>. T= here will just be no login accounts. Also note this allows you to run move-= everything without an Administrator login at the source repo, since all you= need is read access to all the graphs - that does not necessarily require = Administrator access.</p> <p>The <code>--from-snapshot</code> and <code>--from-prefix</code> options = must be specified together. They select the input data from a directory of = serialized files, in the same format as produced by the make-snapshot scrip= t. The value of <code>--from-snapshot</code> is the path to the direcotry c= ontaining the RDF serialization files. The value of <code>-from-prefix</cod= e> is the <strong>exact and complete URI prefix</strong> (including the tra= iling '<strong>/</strong>') of the repo that generated the dump in the dire= ctory. This is necessary because the script does not ahve access to that re= pository to query it for its prefix.</p> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-FixedArgumen= ts">Fixed Arguments</h3> <p>The fixed command arguments are either one or two triplets of repository= access information, i.e. the username, password, and URL of each repo.</p> <p>If you selected file as input with <code>--from-snapshot</code>, then yo= u must only specify the destination repository args. Otherwise, you specify= first the <strong>source</strong> or <strong>from</strong> repository, and= then the <strong>target</strong> or <strong>destination</strong> repo. Eac= h set of args consists of:</p> <ul> <li>Username of the login account; must be Administrator on the target.</li= > <li>Password for that login account.</li> <li>URL Prefix of the repository - just the part of the URL before "/reposi= tory/..."; e.g. <code><span class=3D"nolink">https://localhost:8443</s= pan></code></li> </ul> <p>Here is an example that copies from the production Harvard repo to a loc= al one:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>move-everything.sh bigbird PASSWORD https://harvard.eagle-i.net \ bigbird PASSWORD https://localhost:8443 </pre> </div> </div> <p>Here is an example that copies a snapshot the production Harvard repo to= a local one:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>make-snapshot bigbird PASSWORD https://harvard.eagle-i.net \ harvard.monday move-everything.sh -f \ --from-snapshot harvard.monday \ --from-prefix http://harvard.eagle-i.net/i/ \ bigbird PASSWORD https://localhost:8443 </pre> </div> </div> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Hints">Hints= </h3> <p><strong><em>We strongly recommend</em></strong> you <strong>avoid</stron= g> using the <strong>Superuser</strong> (administrator) login on the source= repository, to prevent accidentally obliterating it by getting the argumen= t order wrong. Use an account that has read access to every graph (e.g. the= <strong>Admin-Read-Only</strong> role). This restricts you to using the --= nousers version of the command but in most cases that is adequate. See the = <a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Procedures= ">#Procedures</a> section for recommendations on how to maintain copies of = repositories this way.</p> <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-move-resourc= es.sh-CopyingOnlyResourceInstances">move-resources.sh - Copying Only Resour= ce Instances</h2> <p>The goal of this procedure is to copy all of the resource instances <str= ong>in one Named Graph</strong> from one repository to another, along with = their relevant provenance and administrative metadata.</p> <p>Since resource URIs have to be resolvable, this effectively creates new = resources in the destination repository with URIs that resolve there. The h= ostname portion of the URI matches the new repository server, and even the = local name is allocated by the destination repository -- so there is no pre= dictable way to relate new URIs to the old ones.</p> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-ThisIsInhere= ntlyNotAGoodIdea.1">This Is Inherently Not A Good Idea</h3> <p>Before you start copying resources around, be sure you understand <stron= g>why this is not a good idea</strong>! Reasons include:</p> <ul> <li><strong>Poor Semantic Web Hygiene</strong>: Existing semantic web stand= ards and technologies have ways to show that multiple URIs really describe = the same object. This method does not use them, in the interest of making a= precisely accurate copy of the data and not changing the source.</li> <li><strong>Copy is Imperfect</strong>: The copy is bound to be missing som= e objects, and have some URIs that get translated without all of their desc= riptions being copied over. It's an inevitable consequence of the way data = is stored in the repository. For example, the Person object of a dcterms:cr= eator property may not be resolvable in the copy.</li> <li><strong>Blank Nodes ignored</strong>: If your source data includes any = blank nodes, their contents will probably not get copied or may get scrambl= ed, since blank node identifiers are only unique within one site's reposito= ry.</li> <li><strong>Metadata gets Broken</strong>: If you're copying from a file, t= he provenance metadata is all lost. If the source is a live repo, it gets c= opied, but the creator and contributor properties will refer to Person inst= ances on the original repo. That's another reason "this is inherently not a= good idea."</li> </ul> <p>Given all of these limitations, the resource-mover script can still be a= n effective way of populating a repository for testing and demonstrations. = Just stay aware of what <em>doesn't</em> work.</p> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-UsingtheScri= pt.1">Using the Script</h3> <p>The resource copying script is installed under <code>etc/</code> in the = repository home directory. Its name is <code>move-resources</code>. It only= runs on a Unix-based operating system such as <strong><em>Linux</em></stro= ng> or <strong><em>MacOS X</em></strong>. It requires p<strong>erl 5</stron= g> and the <strong>curl</strong> executable.</p> <p>Run it with -h to get the synopsis:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>Usage: move-resources [-verbose] [-replace] [--type published|workspace]{ --file source-file --prefix uri-prefix | --so= urce source-repo-url =09--user login:password --graph src-graph-URI } dest-repo-url dest-login:dest-password dest-graph-URI (options may be abbreviated to first letter, e.g. -f) </pre> </div> </div> <p>By default it <strong>adds</strong> data to the destination graph, <code= >--replace</code> changes that to replacing the entire graph.</p> <p>You can change the type of the destination graph with the <code>--type</= code> arg. E.g. set it to either <strong>workspace</strong> or <strong>publ= ished</strong>. By default the type is left alone.</p> <p>You must choose a <em>source</em> by specifying either the <em>file</em>= arguments (<strong>-f</strong> and <strong>-p</strong>), or <em>repository= </em> arguments (<strong>-s</strong>, <strong>-u</strong>, <strong>-g</stro= ng>). You must always specify the destination repository, login, and graph = so they are plain args, not options.</p> <p>Here is an example command, it copies from the <strong>Published</strong= > graph on <strong>qa.harvard</strong> to an "Experimental" graph on the lo= cal repo (on <code><span class=3D"nolink">https://localhost:8443</span= >)</code></p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>move-resources -s https://qa.harvard.eagle-i.net:8443 -u bert:ernie \ -g http://eagle-i.org/ont/repo/1.0/NG_Published https://localhost:8443 \ root:password http://eagle-i.org/ont/repo/1.0/NG_Experimental Moved 4694 data statements and 322 metadata statements. </pre> </div> </div> <h1 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Procedures">= Procedures</h1> <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Procedure:Up= gradingPackagedTomcat">Procedure: Upgrading Packaged Tomcat</h2> <div class=3D"confluence-information-macro confluence-information-macro-war= ning"> <p class=3D"title conf-macro-render">IMPORTANT</p><span class=3D"aui-icon a= ui-icon-small aui-iconfont-error confluence-information-macro-icon"></span> <div class=3D"confluence-information-macro-body"> <p>If you are using the Tomcat server from e.g. a Linux distro's package sy= stem, you must be aware of the following serious pitfall that can affect th= e repository when you upgrade Tomcat through the package system:</p> </div> </div> <p>Some if not all packaged Tomcat servers include a sample webapp installe= d as teh ROOT webapp, so that the default server address can respond with a= page congratulating you on installing Tomcat.</p> <p>Meanwhile, the Repository <strong><em>replaces</em></strong> this ROOT w= ebapp with its own (for good and compelling reasons detailed in the design = documents). Thus, we destructively modify the installed state of Tomcat.</p= > <p>Some Tomcat package upgrade proceduers (notably Fedora Core 12) have bee= n observed to simply replace files in the expanded ROOT webapp without chec= king that it was the original default ROOT webapp installed from the packag= e. While we consider this a serious bug in the distribution package, it is = unlikely to be fixed, so you must learn to expect and recover from it.</p> <p>So, <strong>after</strong> upgrading a packaged Tomcat:</p> <p>Remove the ROOT webapp directory (with Tomcat still shutdown) and the <c= ode>${CATALINA_HOME}/webapps/ROOT.war</code> file, to ensure any replaced o= r corrupted version is gone.Replace it with the ROOT.war from your Reposito= ry distribution - you DID save it, right?</p> <p>Finally, delete the entire <code>${CATALINA_HOME}/work</code> direcotry.= Tomcat rebuilds it on startup anyway, but it can contain mistaken caches t= hat do not get updated. Now you can start up Tomcat as usual.</p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Procedure:In= stallingRepoonUbuntu10'spackagedTomcat6">Procedure: Installing Repo on Ubun= tu 10's packaged Tomcat6</h2> <p><strong>See also</strong>: The <strong>Procedure to redirect Port 80</st= rong> so your URLs are simplified.</p> <p>This section describes the differences in the install procedure when usi= ng the packaged tomcat6 server on Ubuntu Linux 9.10 (karmic koala). It was = based on experience with package tomcat6, version 6.0.20-2ubuntu2.1 .</p> <p><strong>NOTE</strong>: This procedure only lists the steps specific to U= buntu's tomcat package. You need to review the previous section and follow = that procedure, referring to this one for the steps related to tomcat.</p> <ol> <li><strong>Shut down tomcat</strong>. This is major surgery, and tomcats d= on't like to be vivisected no matter how much more satisfying you may find = it.</li> <li><strong>Disable Java Security</strong> -- alternately, you could try to= configure all the authorization grants to give the repository webapp acces= s to the filesystem and property resources it needs, but I found it much ea= sier to just disable java security. <strong><em>DO NOT RUN THE TOMCAT PROCE= SS AS ROOT</em></strong> if you do this, but you should not be running it a= s root in any case. That's just insane.=20 <ol> <li><p>Edit the file <code>/etc/init.d/tomcat6</code> and change the follow= ing variable to look like this:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>TOMCAT6_SECURITY=3Dno</pre> </div> </div></li> </ol></li> <li><strong>Install Derby jars</strong>: <span style=3D"color: rgb(255,0,0)= ;">ONLY IF DERBY IS NOT ALREADY INSTALLED IN THE COMMON AREA OF YOUR TOMCAT= </span>. If another webapp is already using Derby, they should share that v= ersion.=20 <ol> <li>Find the Derby jars in the <code>lib/</code> subdirectory under where y= ou installed the <code>create-user.sh</code> script.</li> <li><p>Copy them to the Tomcat common library directory:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>cp ${REPO-ZIP-DIR}/lib/derby* /usr/share/tomcat6/lib/</pre> </div> </div></li> </ol></li> <li><p><strong>Install the webapp</strong>: First, get rid of any existing = root webapp, then copy in the webapp (<code>ROOT.war</code> file from your = installation kit) and be sure it is readable by the tomcat6 user:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>rm /var/lib/tomcat6/webapps/ROOT*cp ROOT.war /var/lib/tomcat6/webapps/= ROOT.war</pre> </div> </div></li> <li><p><strong>Install cached webapp context</strong>: This is <em>VERY IMP= ORTANT</em>, and the Tomcat docs does not even mention it, but without it y= our server will be mysteriously broken. The file <code>/etc/tomcat6/Catalin= a/localhost/ROOT.xml</code> must be a copy of your app's <code>context.xml<= /code>. Redo this command after installing every new <code>ROOT.war</code>:= </p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>mkdir -p /etc/tomcat6/Catalina/localhost unzip -p /var/lib/tomcat6/webapps/ROOT.war META-INF/context.xml > /etc/t= omcat6/Catalina/localhost/ROOT.xml</pre> </div> </div></li> <li><p><strong>Add System Properties</strong>: Be sure you have added syste= m properties to the file <code>/etc/tomcat6/catalina.properties</code>= e.g.</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>org.eaglei.repository.home =3D /opt/eaglei/repoderby.system.home =3D /= opt/eaglei/repo</pre> </div> </div><p>...of course, the value of these properties will be your Repositor= y Home Directory path.</p></li> <li><p><strong>Start up Tomcat</strong>:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>sudo /etc/init.d/tomcat6 start</pre> </div> </div></li> <li><strong>Troubleshooting</strong>: If there are problems, check the foll= owing places for logs (because packaged apps make everything so much easier= ):=20 <ul> <li><code>/var/log/daemon.log</code> - really dire tomcat problems and stdo= ut/stderr go to syslog</li> <li><code>/var/log/tomcat6/*</code> - normal catalina logging</li> <li><code>${REPOSITORY_HOME}/logs/repository.log</code> - default repo log = file in release 1.1; under 1.0 the filename was <code>default.log</code>.</= li> </ul></li> </ol> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Procedure:Ru= nTomcatonPort80(and443)">Procedure: Run Tomcat on Port 80 (and 443)</h2> <p>We want the repository (and other Web tools) to have a simple URL, witho= ut the ugly port number after the hostname, e.g. like this <span class=3D"n= olink">http://dev.harvard.eagle-i.net/</span> and NOT <span class=3D"nolink= ">http://dev.harvard.eagle-i.net:8080/</span><code>...</code> (because real= ly, that first one is already enough to remember.) This procedure uses IP p= ort redirection to let your Tomcat server appear to be running on the canon= ical HTTP port, which is 80. It is the simplest and safest method to accomp= lish this under Linux.</p> <p>The sanest alternative, running an Apache httpd server as an AJP forward= er, is much more effort and adds another point of failure. We will not even= discuss running Tomcat as root so it has access to port 80, since that is = simply unacceptable.</p> <p>These procedures</p> <ul> <li>have been tested under Ubuntu Linux 9.10 _(krazy kitten), Fedora 12 and= 14, and CentOS 6.03</li> <li>assume you are running Tomcat on port 8080. To redirect the HTTPS (HTTP= on SSL) port, also run the 3 additional <strong>iptables</strong> commands= (assuming port 443) below.</li> <li>require root privileges</li> <li>assume the Bourne shell (/bin/sh)</li> </ul> <p> </p> <ol> <li><p>To check the what rules are running</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>iptables -t nat -n -L</pre> </div> </div></li> <li><p>Discover your machine's primary IP address and set the ADDR shell va= riable: (Note that this assumes <strong>eth0</strong> is your primary netwo= rk interface --use <code>ifconfig -a</code> to see them all)</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>ADDR=3D`ifconfig eth0 | perl -ne 'print "$1\n" if m/\sinet addr\:(\d+\= .\d+\.\d+\.\d+)\s/;'`</pre> </div> </div></li> <li><p>Run these iptables commands to redirect all port 80 requests to port= 8080.</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>iptables -t nat -A OUTPUT -d localhost -p tcp --dport 80 -j REDIRECT -= -to-ports 8080 iptables -t nat -A OUTPUT -d $ADDR -p tcp --dport 80 -j REDIRECT --to-ports= 8080 iptables -t nat -A PREROUTING -d $ADDR -p tcp --dport 80 -j REDIRECT --to-p= orts 8080</pre> </div> </div></li> <li><p>(If using SSL) Run these iptables commands to redirect all port 443 = requests to port 8443.</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>iptables -t nat -A OUTPUT -d localhost -p tcp --dport 443 -j REDIRECT = --to-ports 8443 iptables -t nat -A OUTPUT -d $ADDR -p tcp --dport 443 -j REDIRECT --to-port= s 8443 iptables -t nat -A PREROUTING -d $ADDR -p tcp --dport 443 -j REDIRECT --to-= ports 8443</pre> </div> </div></li> <li>Check that your new rules are running (use the command above)</li> <li>Additional configuration=20 <ol> <li>Ubuntu=20 <ol> <li><p>Save the rules in the canonical place to be reloaded on boot:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>iptables-save > /etc/iptables.rules</pre> </div> </div></li> <li><p>Create a script to be run by the network startup infrastructure that= will reload the iptables whenever the network is configured on:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>cat << EOF > /etc/network/if-pre-up.d/iptablesload #!/bin/sh iptables-restore < /etc/iptables.rules exit 0 EOF</pre> </div> </div></li> </ol></li> <li>Fedora=20 <ol> <li>Save the rules to be reloaded on boot:=20 <ol> <li><p>The cleaner/preferable method, but apparently <strong>not</strong> w= orking:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>/sbin/iptables-save</pre> </div> </div></li> <li>Hacky, but works: manually edit <code>/etc/sysconfig/iptables</code></l= i> </ol></li> <li><p>Update the startup settings so iptables will run upon reboot:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>chkconfig --level 35 iptables on</pre> </div> </div></li> </ol></li> </ol></li> <li>Test by accessing your server both locally and remotely by the port-80 = URL. Then reboot the machine and try it again to be sure the iptables comma= nds are run correctly on boot.</li> </ol> <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Procedure:Du= mpandRestoretheRDFResourceData">Procedure: Dump and Restore the RDF Resourc= e Data</h2> <p>The recommended way to dump out the RDF resource data content of the rep= ository is to <em>export</em> it as <em>serialized RDF</em>. If you are exp= orting the entire contents of the repository, it is essential to preserve t= he mapping of statements to named graphs, so you must use one of the format= s that encodes RDF as quads (statement plus graph-name/context).</p> <p>The reason for this is that the repository server employs an RDF databas= e, Sesame, to manage the RDF statements. It uses Sesame's "native" store, w= hich records statements in opaque data files on the host OS's filesystem --= but much like relational database systems, Sesame's files are never in a c= onsistent state while it is running so it would have to be shut down (by sh= utting down the repository Web service) to make a "cold" snapshot backup. I= t is much easier to simply export the live data. Another advantage of expor= ts as backups is that the data can easily be <em>imported</em> into a later= version of Sesame or even a different database.</p> <p>This is a complex <strong>manual</strong> procedure with many options --= for a simpler semi-automated backup snapshot procedure, see the section on= using the <strong>make-snapshot</strong> script.</p> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-MakeBackupDu= mp(obsolete-seemake-snapshot)">Make Backup Dump (obsolete - see make-snapsh= ot)</h3> <p>Typical command to make a backup, in TriG format to a file, e.g. all-dum= p.trig (here highlighted in yellow) from a server running locally on port 8= 0. In practice, you'll probably need to change all the highlighted parts, s= uch as the <code>username:password</code> login credentials, and the hostna= me in the target URL if not running locally.</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>curl -G -X GET -s -S -u username:password -o all-dump.trig -d all \ --write-out 'status=3D%{http_code}, %{time_total}sec\n' \ -d format=3Dapplication/x-trig https://localhost:8443/repository/graph </pre> </div> </div> <p>Be <strong>sure</strong> the output shows a successful status code (name= ly <strong>200</strong>), as shown here, since curl will return a successfu= l status even if the HTTP service did not succeed; curl only reports on the= success of the network request-and-response transaction.</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>status=3D200, 13.283sec </pre> </div> </div> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-RestoreRepos= itoryfromBackup">Restore Repository from Backup</h3> <p><strong>NOTE</strong>: This form of the procedure is a bit obsolete, sin= ce the new move-everything.sh script can also restore the state of a reposi= tory from its own backup -- effectively moving data to itself. See that com= mand for details.</p> <p>The procedure is still worth mentioning since it demonstrates the nature= of the backup's contents:</p> <p>Typical restore command.</p> <div class=3D"confluence-information-macro confluence-information-macro-war= ning"> <p class=3D"title conf-macro-render">WARNING</p><span class=3D"aui-icon aui= -icon-small aui-iconfont-error confluence-information-macro-icon"></span> <div class=3D"confluence-information-macro-body"> <p>this <span style=3D"text-decoration: underline;">replaces</span> the <sp= an style=3D"text-decoration: underline;">entire</span> contents of the repo= sitory!</p> </div> </div> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>curl -s -S -u username:password -F action=3Dreplace -F all=3D \ --write-out 'status=3D%{http_code}, %{time_total}sec\n' \ -F 'content=3D@all-dump.trig;type=3Dapplication/x-trig' https://localhost:8= 443/repository/graph </pre> </div> </div> <p>Be <strong>sure</strong> the output shows a successful status code (name= ly <strong>201</strong>, since it created graphs), as shown here, since cur= l will return a successful status even if the HTTP service did not succeed;= curl only reports on the success of the network request-and-response trans= action.</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>status=3D201, 13.283sec </pre> </div> </div> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Procedure:Sa= vingandRestoringUserAccounts">Procedure: Saving and Restoring User Accounts= </h2> <p>As of the MS6 release, you can use the new Export/Import service to crea= te user accounts automatically (e.g. on a newly-created repository). This i= s <em>NOT</em> the same thing as true backup and restore; rather, it is int= ended more for setting up a test environment. The export and import service= s are very complex and powerful. This only gives one small example of what = they can do. For all the details, see their <strong>entry in the API Manual= </strong>.</p> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Step0.Create= PrototypeAccountsandExportThem">Step 0. Create Prototype Accounts and Expor= t Them</h3> <p><strong>Only do this once</strong>. Once you create a user file you like= , you can use it over and over, on any different sites and tiers you like.<= /p> <p>Create the user accounts you want on some repository instance. You will = export them to create a document describing the user accounts you want. The= re can be extra accounts, you can filter them out of the export. So, get al= l the accounts you want in order, with roles, passwords, and personal names= set up.</p> <p>Now run a command like this to export the accounts into the file <code>a= ll-users.trig</code></p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>curl -s -S -u username:password -G -d type=3Duser -d format=3Dapplicat= ion/x-trig \ --write-out 'status=3D%{http_code}\n' \ -o all-users.trig https://hostname:8443/repository/export </pre> </div> </div> <p>Note that you have to change the <strong>hostname</strong> and possibly = the login. If there are accounts you do not want in the export, add an <str= ong>exclude</strong> argument to filter them out, with a space-separated li= st, e.g.</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>.... -d 'exclude=3Dfrankenstein moreau lizardo' .... </pre> </div> </div> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Step1.Import= AccountsonDestinationSites">Step 1. Import Accounts on Destination Sites</h= 3> <p>You can start with a newly-created repository which needs to have user a= ccounts added. It only has the initial administrator login, e.g. <code>bigb= ird</code>. Use the import service to add users from the file you created i= n step 0. The following command adds all of the accounts <em>except</em> <s= trong>bigbird</strong> (since it already exists), and aborts without changi= ng anything if there are already duplicates of any of the users on the dest= ination repo. It will print "status=3D200" on success.</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>curl -s -S -u username:password -F type=3Duser -F format=3Dapplication= /x-trig \ -F transform=3Dyes --write-out 'status=3D%{http_code}\n' \ -F exclude=3Dbigbird \ -F content=3D@all-users.trig https://hostname:8443/repository/import </pre> </div> </div> <p>Note that the <strong>transform=3Dyes</strong> argument means <em>import= </em> will translate the instance URIs of the new users to newly-created UR= Is in the repository's default namespace. This is usually what you want. If= you are positively restoring users already in the correct namespace and yo= u want to preserve the old URIs, substitute <strong>transform=3Dno</strong>= .</p> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Step2.Testin= gUsers">Step 2. Testing Users</h3> <p>The easiest way to test the existence and details of a user is with the = <strong>/whoami</strong> service. It does not show roles, however, you'll h= ave to go to the repository administrative UI for that (or take it on faith= ). For example, after restoring users including <strong>curator</strong>, t= his is how you'd check that curator exists:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>curl -s -S -u curator:password -G -d format=3Dtext/plain https://hostn= ame:8443/repository/whoami </pre> </div> </div> <p>It's probably only necessary to test one user like this, and to make sur= e the output includes a URI, as a check that the whole import succeeded.</p= > <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Procedure:Ex= portingandImportingPropertyAccessControls">Procedure: Exporting and Importi= ng Property Access Controls</h2> <p>This is only relevant to release 1.5MS1 and later, when resource propert= ies have access controls.</p> <p>To determine the URIs of the access controls, bring up the admin UI page= s and login as an Administrator. There will be a link to the Properites acc= ess control page named <span style=3D"text-decoration: underline;">Manage P= roperty Access Controls</span>. If you go to that page, it will display two= sets of properties for which there is an access control list:</p> <ol> <li>"Hidden" properties</li> <li>Contact properties</li> </ol> <p>Go to each of these in turn and observe the URI of the subject, e.g.&nbs= p;<code><span class=3D"nolink">http://eagle-i.org/ont/app/1.0/PropertyGroup= _AdminData</span></code>. This is the URI to <em>include</em> in your expor= t request. Now do the same for contact properties and record that URI too.<= /p> <p>To export property grants, plug those URIs into the following command (y= ou need to replace italicized words):</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>curl -G -k -u ADMIN:PASSWORD -d type=3Dgrant -d "include=3DHIDE,CONTAC= T" \ -d format=3Dapplication/x-trig https://localhost:8443/repository/export </pre> </div> </div> <p>This writes a record of grants to the standard output. Since the URIs ar= e the same between other repositories running the same data model, you shou= ld be able to import them with the command (shows standard input in the exa= mple):</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>curl -k -u ADMIN:PASSWORD -F type=3Dgrant \ -F duplicate=3Dabort -F transform=3Dno -F content=3D@- \ -F format=3Dapplication/x-trig https://localhost:8443/repository/import </pre> </div> </div> </div> </body> </html> ------=_Part_2724

Date: Mon, 13 Oct 2025 17:10:10 -0400 (EDT) Message-ID: <1404203011.2725.1760389810503@prodopencatalystconfluence.catalyst> Subject: Exported From Confluence MIME-Version: 1.0 Content-Type: multipart/related; boundary="----=_Part_2724_78692282.1760389810503" ------=_Part_2724_78692282.1760389810503 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Content-Location: file:///C:/exported.html Repository Installation, Upgrade and Administration Guide</title= >  <style>  </style> </head> <body> <h1>Repository Installation, Upgrade and Administration Guide</h1> <div class=3D"Section1"> <p><style type=3D"text/css">/*<![CDATA[*/ div.rbtoc1760389810405 {padding: 0px;} div.rbtoc1760389810405 ul {margin-left: 0px;} div.rbtoc1760389810405 li {margin-left: 0px;padding-left: 0px;} /*]]>*/</style></p> <div class=3D"toc-macro rbtoc1760389810405"> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Introd= uction">Introduction</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Compon= entsandLayout">Components and Layout</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Comman= d-LineTools">Command-Line Tools</a></li> </ul></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Instal= lation">Installation</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Platfo= rmRequirements">Platform Requirements</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Prereq= uisites">Prerequisites</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Scalab= ilityLimits">Scalability Limits</a></li> </ul></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Instal= landConfigureRepository">Install and Configure Repository</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Step1.= GetRepositoryDistribution">Step 1. Get Repository Distribution</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Step2.= EstablishtheRepositoryHomeDirectory">Step 2. Establish the Repository Home = Directory</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Step3.= PopulatetheRepositoryHomeDirectoryfromtheDistribution">Step 3. Populate the= Repository Home Directory from the Distribution</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Step4.= LocatetheServletContainer(ApacheTomcat)">Step 4. Locate the Servlet Contain= er (Apache Tomcat)</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Step5.= ConfigureTomcat:JAVA_OPTSandSystemProperties">Step 5. Configure Tomcat: JAV= A_OPTS and System Properties</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Step6.= InstallApacheDerbyjarsifnecessary">Step 6. Install Apache Derby jars if nec= essary</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Step7.= (OPTIONAL)ChoosealternateApacheDerbyimplementation">Step 7. (OPTIONAL) Choo= se alternate Apache Derby implementation</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Step8.= InstalltheRepository">Step 8. Install the Repository</a></li> </ul></li> </ul></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Upgrad= e">Upgrade</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Before= Upgrading">Before Upgrading</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Getthe= RepositoryDistribution">Get the Repository Distribution</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Backup= ">Back up</a></li> </ul></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-StepBy= StepUpgradeProcedure">Step By Step Upgrade Procedure</a></li> </ul></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Config= uration">Configuration</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-URIsfo= rCreatingNewRoles,Transitions,andWorkspaces">URIs for Creating New Roles, T= ransitions, and Workspaces</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Ruleso= fCreatingYourOwnURIs">Rules of Creating Your Own URIs</a></li> </ul></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Managi= ngAccessControlsonContact&"Hidden"Properties">Managing Access= Controls on Contact & "Hidden" Properties</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Config= urationReference">Configuration Reference</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-System= Properties">System Properties</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Reposi= toryHomeDirectory">Repository Home Directory</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-TheRep= ositoryConfigurationPropertiesFile">The Repository Configuration Properties= File</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Config= uringLogging">Configuring Logging</a></li> </ul></li> </ul></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Monito= ringandTroubleshooting">Monitoring and Troubleshooting</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Versio= nInformation">Version Information</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-LogFil= es">Log Files</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Perfor= manceMonitoring">Performance Monitoring</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Tuning= ">Tuning</a></li> </ul></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Admini= stratorTools">Administrator Tools</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-make-s= napshot.shScript">make-snapshot.sh Script</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Usage"= >Usage</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Restor= ingDumpsmadebymake-snapshot">Restoring Dumps made by make-snapshot</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Exampl= es">Examples</a></li> </ul></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-move-e= verything.sh:CopyingEverythingBetweenRepositoriesorFiles">move-everything.s= h: Copying Everything Between Repositories or Files</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-ThisIs= InherentlyNotAGoodIdea">This Is Inherently Not A Good Idea</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Restor= ingfromBackups">Restoring from Backups</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Usingt= heScript">Using the Script</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Option= s">Options</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-FixedA= rguments">Fixed Arguments</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Hints"= >Hints</a></li> </ul></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-move-r= esources.sh-CopyingOnlyResourceInstances">move-resources.sh - Copying Only = Resource Instances</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-ThisIs= InherentlyNotAGoodIdea.1">This Is Inherently Not A Good Idea</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Usingt= heScript.1">Using the Script</a></li> </ul></li> </ul></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Proced= ures">Procedures</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Proced= ure:UpgradingPackagedTomcat">Procedure: Upgrading Packaged Tomcat</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Proced= ure:InstallingRepoonUbuntu10'spackagedTomcat6">Procedure: Installing Repo o= n Ubuntu 10's packaged Tomcat6</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Proced= ure:RunTomcatonPort80(and443)">Procedure: Run Tomcat on Port 80 (and 443)</= a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Proced= ure:DumpandRestoretheRDFResourceData">Procedure: Dump and Restore the RDF R= esource Data</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-MakeBa= ckupDump(obsolete-seemake-snapshot)">Make Backup Dump (obsolete - see make-= snapshot)</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Restor= eRepositoryfromBackup">Restore Repository from Backup</a></li> </ul></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Proced= ure:SavingandRestoringUserAccounts">Procedure: Saving and Restoring User Ac= counts</a> <ul class=3D"toc-indentation"> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Step0.= CreatePrototypeAccountsandExportThem">Step 0. Create Prototype Accounts and= Export Them</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Step1.= ImportAccountsonDestinationSites">Step 1. Import Accounts on Destination Si= tes</a></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Step2.= TestingUsers">Step 2. Testing Users</a></li> </ul></li> <li><a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Proced= ure:ExportingandImportingPropertyAccessControls">Procedure: Exporting and I= mporting Property Access Controls</a></li> </ul></li> </ul> </div> <p></p> <hr> <h1 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Introduction= ">Introduction</h1> <p>The Data Repository is a software component that manages an RDF database= and makes it available to other applications through a REST API, and gives= end users specific views of the data. It adds role-based access control of= varying granularity, transactional editing, custom treatment of ontologies= and minimal/fast inference, and various administrative functions on top of= the RDF database.</p> <p>This page explains how it works on a host computer system, and how to in= stall and maintain it. This page serves as an application administrator's m= anual for the development cycle.</p> <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Componentsan= dLayout">Components and Layout</h2> <p>The data repository is installed in two <strong><em>intentionally separa= te</em></strong> places on the host operating system:</p> <ol> <li>A Java Servlet <em>web application</em>, or <strong>webapp</strong>, in= stalled within a Servlet container such as Apache Tomcat.</li> <li>The repository's <strong>home directory</strong>, a place on the filesy= stem containing:=20 <ol> <li>The repository's <strong>configuration properties</strong> file</li> <li>Files supporting the <strong>RDF database</strong> (opaque to users)</l= i> <li>Files supporting the small internal RDBMS used for <strong>user authent= ication</strong>.</li> <li>A directory for system <strong>log files</strong>.</li> </ol></li> </ol> <p>Why do we want the home directory separated from the webapp? Mainly, bec= ause the webapp is prone to getting replaced completely when a new version = is deployed; typically, the servlet container unpacks a new WAR file and re= places all of the old webapp. Any data files stored there would be lost. Si= nce the configuration and data have to persist through many incarnations of= the webapp, it is much safer to keep them in a separate place in the files= ystem, outside of the entire servlet container hierarchy. Also, this way th= e webapp needs no modification after it is installed, simplifying the proce= dure for the system administrator.</p> <p>Another advantage to the separate location is that it gives the system a= dministrator more flexibility to assign that directory to a location with a= ppropriate capacity, reliability, and performance.</p> <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Command-Line= Tools">Command-Line Tools</h2> <p>The installed repository includes a set of command-line tools you will u= se for many of the administrative tasks. They are found in the etc/ subdire= ctory of the repository home directory. All of them respond to these two op= tions:</p> <ul> <li><code><strong>--version</strong></code> - display what released version= the tool came from</li> <li><code><strong>--help</strong></code> - display a synopsis of command ar= gs and switches</li> </ul> <p>For example:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>bash ${REPO_HOME}/etc/upgrade.sh --version upgrade.sh from release 1.1-MS4.00 SCM revision 5422 </pre> </div> </div> <p> </p> <h1 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Installation= ">Installation</h1> <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-PlatformRequ= irements">Platform Requirements</h2> <ul> <li>This application requires <strong>Sun's JRE version 1.7.</strong></li> <li>The repository is a pure Java webapp and ought to run on any Java Servl= et container conforming to the 2.5 version of the specification. It has onl= y been thoroughly tested on <strong>Apache Tomcat 6.0</strong> and <strong>= Apache Tomcat 7.0</strong>, however.</li> <li>The supporting utility scripts and tools require a Unix environment suc= h as <strong>MacOS</strong> or <strong>Linux</strong>. <em><span style=3D"t= ext-decoration: underline;">MS Windows is</span></em> <strong><em><span sty= le=3D"text-decoration: underline;">NOT</span></em></strong> <em><span style= =3D"text-decoration: underline;">supported</span></em>.</li> <li>Aside from the Java Servlet environment, the webapp requires a separate= "home" directory, located outside of the servlet container hierarchy, to w= hich the container's JVM has read/write access.</li> </ul> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Prerequisite= s">Prerequisites</h3> <ol> <li><strong>System requirements</strong>. The current eagle-i network = deployment is a reference configuration. In this deployment, eagle-i instit= utional servers are VMs. System requirements for these VMs are available&nb= sp;<a href=3D"https://open.med.harvard.edu/display/eaglei/Software#Software= -Systemrequirements" class=3D"external-link" rel=3D"nofollow">here</a>.</li= > <li><strong>Unix-like operating system</strong>. This procedure is only val= id for Unix variants like Linux, Solaris, MacOSX. To run some of the script= s you will need to have these commands installed:=20 <ul> <li><code>bash</code></li> <li><code>perl</code></li> <li><code>curl</code></li> <li><code>awk</code> (surely anything that calls itself unix must have= awk)</li> <li><code>tr</code> (seriously, is tr missing? if you are running Gent= oo, install an operating system)</li> </ul></li> <li><strong>Sun's Java JDK 1.7</strong>.</li> <li><strong>Apache Tomcat web servlet container,*version 6.0</strong> = (version 7.0 also works, but this guide refers to version 6.0), configured = to run with the Java JDK in #2.=20 <ul> <li>Make sure you follow Tomcat installation and configuration instructions= for the Tomcat version and Linux distribution you are using; before instal= ling the eagle-i repository, Tomcat must be fully functional. You may want = to test this by using Tomcat's manager app, which should be available at&nb= sp;<a href=3D"http://localhost/manager/html/" class=3D"external-link" rel= =3D"nofollow">http://localhost/manager/html/</a> - you will need to ed= it the file conf/tomcat-users.xml for defining a user and a role - see this= guide: <a href=3D"http://tomcat.apache.org/tomcat-6.0-doc/manager-how= to.html#Configuring_Manager_Application_Access" class=3D"external-link" rel= =3D"nofollow">Apache Tomcat 6.0 Manager App HOW_TO</a></li> <li>Tomcat may be configured as a standalone web server, or be fronted by a= n Apache httpd server. In this guide we assume the former configuration. Th= e latter should also work, but describing it is out of our scope.</li> <li><span style=3D"color: rgb(128,0,0);"><strong>Tomcat must be configured = to use SSL</strong></span>, see the quickstart section here: <a href= =3D"http://tomcat.apache.org/tomcat-6.0-doc/ssl-howto.html" class=3D"extern= al-link" rel=3D"nofollow">Apache Tomcat 6.0 SSL Configuration HOW-TO</a>. N= ote that a production server will require a valid <strong>SSL certific= ate</strong>.=20 <ul> <li>Make sure your certificate is properly installed by using an SSL checke= r, e.g. <a href=3D"http://www.geocerts.com/ssl_checker" class=3D"exter= nal-link" rel=3D"nofollow">http://www.geocerts.com/ssl_checker</a></li> </ul></li> <li>Network configuration for <span style=3D"color: rgb(128,0,0);"><st= rong>Tomcat to respond on standard ports 80 and 443</strong></span> is= required. The section <strong>Run Tomcat on Port 80 (and 443)</strong= > under <a href=3D"#RepositoryInstallation,UpgradeandAdministrati= onGuide-Procedures">#Procedures</a> details our preferred method. Othe= r methods (e.g. using of Apache httpd) are possible but out of scope for th= is guide.</li> <li>See the <a href=3D"#RepositoryInstallation,UpgradeandAdministratio= nGuide-Procedures">Procedures</a> section if using Ubuntu's tomcat6 pa= ckage.=20 <ul> <li>It may be necessary to download Tomcat directly and install it manually= if the version supplied by the host OS's package system is not usable. Don= 't hesitate to do this if it is expedient; Tomcat can run as a pure Java ap= plication in a single file hierarchy, so a manual download can work just as= well (if not better) than the packaged version.</li> </ul></li> </ul></li> </ol> <ol> <li><strong>Apache Derby RDBMS</strong> installed in your Tomcat servl= et container.=20 <ul> <li>A copy of Derby is provided if you need to install it.</li> </ul></li> </ol> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-ScalabilityL= imits">Scalability Limits</h3> <p><strong>Note that only one instance of a Repository webapp may be run on= a given home directory</strong>. This means that only one JVM and Servlet = Container may access that home directory and RDF dataset at any one time. T= his is a restriction imposed by the Sesame triplestore.</p> <p>It is not possible to "scale" performance of the repository by sharing t= he online RDF database among multiple machines or processes. it is possible= to make periodic read-only snapshots of a database and serve them from sep= arate machines, so long as you do not allow them to be changed.</p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-InstallandCo= nfigureRepository">Install and Configure Repository</h2> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Step1.GetRep= ositoryDistribution">Step 1. Get Repository Distribution</h3> <p>The repository is distributed as a single Zip file. It contains a file R= EADME which identifies the software release it was built from. It is the ar= tifact produced by the Maven project:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>org.eagle-i:eagle-i-repository-dist </pre> </div> </div> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Step2.Establ= ishtheRepositoryHomeDirectory">Step 2. Establish the Repository Home Direct= ory</h3> <p>You need to determine the repository's home directory. It may be anywher= e on the system so long as it satisfies these criteria:</p> <ol> <li><span style=3D"color: rgb(128,0,0);"><strong>It</strong></span> <span s= tyle=3D"color: rgb(128,0,0);"><strong>must</strong></span> <span style=3D"c= olor: rgb(128,0,0);"><strong>be owned by the same user-id under which the s= ervlet container (Tomcat) is executing</strong></span>. The repository will= automatically create files and subdirectories as needed.</li> <li>The <em>host filesystem</em> <strong>must</strong> have adequate space = for your anticipated RDF database, logfiles, and backup snapshots.</li> </ol> <p>We will call this directory <code>REPO_HOME</code> and it will appear in= commands and scripts below as <code>${REPO_HOME</code>}.<br> Create the repository home directory in your file system. It is useful to = have a base eagle-i directory to place data and configuration used by other= eagle-i applications. For example,</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>mkdir /opt/eaglei mkdir /opt/eaglei/repo </pre> </div> </div> <p>If necessary (i.e if you created it using your own user-id), change owne= rship of the directory to the user-id under which Tomcat is running. If you= followed the example above, change the ownership of the two directories us= ing the -R option. For example, if the user-id under which Tomcat executes = is <em>tomcat</em>, </p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>chown -R tomcat /opt/eaglei </pre> </div> </div> <p>Initialize it as a variable in your shell environment. In this example (= Bourne/bash shell) the repository home directory is <code>/opt/eaglei/repo<= /code> :</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>REPO_HOME=3D/opt/eaglei/repo </pre> </div> </div> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Step3.Popula= tetheRepositoryHomeDirectoryfromtheDistribution">Step 3. Populate the Repos= itory Home Directory from the Distribution</h3> <p>Unpack the distribution Zip archive in a directory under <code>/tmp</cod= e>:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>cd /tmp unzip repository-dist.zip </pre> </div> </div> <p>Move the contents of the unzipped directory to your repository home dire= ctory. In this example the distribution is version <code>1.1-MS1.00-SNAPSHO= T</code></p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>mv /tmp/repository-1.1-MS1.00-SNAPSHOT/* ${REPO_HOME}/. </pre> </div> </div> <p>List the contents of the home directory:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>cd ${REPO_HOME} ls </pre> </div> </div> <p>It should contain the subdirectories <code>etc/</code> <code>lib/</code>= and <code>webapps/</code></p> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Step4.Locate= theServletContainer(ApacheTomcat)">Step 4. Locate the Servlet Container (Ap= ache Tomcat)</h3> <p>Determine the Java Servlet Container's home directory (e.g. Tomcat) whic= h is usually dictated by your host OS. For example, it may be the 'tomcat' = user's home directory, <code>~tomcat</code>.</p> <p>We will call this directory <code>CATALINA_HOME</code> and it will appea= r in commands and scripts below as</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>${CATALINA_HOME} </pre> </div> </div> <p>Initialize it as a variable in your shell environment. In this example (= in Bourne/bash shell) the Tomcat'shome directory is <code>/opt/tomcat</code= >:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>CATALINA_HOME=3D/opt/tomcat </pre> </div> </div> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Step5.Config= ureTomcat:JAVA_OPTSandSystemProperties">Step 5. Configure Tomcat: JAVA_OPTS= and System Properties</h3> <p>Ensure that your Tomcat server is run with the following options on its = JVM. The simplest way to accomplish this is to have the environment variabl= e <code>JAVA_OPTS</code> include those options, but each platform, distro, = package etc. of Tomcat has its own mechanism for setting this variable. For= example, on Fedora 14, it should be in the file <code>/etc/tomcat6/tomcat6= .conf</code>. If you can't find your distribution's configuration file= , you may create a file <code>setenv.sh</code> in tomcat's bin directory to= add the environment variable:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>...(ONLY DO THIS if you can't find your distribution's config file) cd ${CATALINA_HOME}/bin touch setenv.sh </pre> </div> </div> <p>Edit the configuration file (<code>tomcat6.conf</code>, <code>setenv.sh<= /code> or whatever your distribution uses) and add the following line:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>JAVA_OPTS=3D"-XX:PermSize=3D64M -XX:MaxPermSize=3D256M -Xmx1024m" </pre> </div> </div> <p>Add the following two system properties to file <code>conf/catalina.prop= erties</code> under the <code>CATALINA_HOME</code> directory -- the same di= rectory where you'll find <code>server.xml</code>. The value for both of th= ese properties is the absolute path of the repository home directory. In th= is example, it is <code>/opt/eaglei/repo</code>:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre># example org.eaglei.repository.home =3D /opt/eaglei/repo derby.system.home=3D /opt/eaglei/repo </pre> </div> </div> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Step6.Instal= lApacheDerbyjarsifnecessary">Step 6. Install Apache Derby jars if necessary= </h3> <p>Look in your Tomcat installation's main lib directory. If there are no f= iles named <code>derby.jar</code> or <code>derby-version.jar</code>, you mu= st install the Derby jars from the "scripts" distribution, e.g.</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>cp ${REPO_HOME}/lib/derby-* ${CATALINA_HOME}/lib/ </pre> </div> </div> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Step7.(OPTIO= NAL)ChoosealternateApacheDerbyimplementation">Step 7. (OPTIONAL) Choose alt= ernate Apache Derby implementation</h3> <p><strong>Are you already running applications which use a certain Apache = Derby in your servlet container?</strong> If so, set the environment variab= le <code>DERBY_HOME</code> as documented by Apache; if not, leave it unset = and the script will use its own version of Derby (the jars in its <code>lib= /</code> subdirectory):</p> <p>Bourne/bash shell version:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>....(ONLY DO THIS when ALREADY running Apache Derby!)  export DERBY_HOME=3Dmy-derby-installation-toplevel  </pre> </div> </div> <p>C Shell/csh version:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>....(ONLY DO THIS when ALREADY running Apache Derby!) setenv DERBY_HOME my-derby-installation-toplevel  </pre> </div> </div> <p>NOTE: You <strong>must</strong> use the same version of Derby to create = this initial user database as the version installed in Tomcat, so if Tomcat= is already running a version of Derby, set <code>DERBY_HOME</code> to use = that.</p> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Step8.Instal= ltheRepository">Step 8. Install the Repository</h3> <p>Follow this step-by-step procedure. <span style=3D"color: rgb(128,0,0);"= ><strong>Before you start</strong></span>, make sure the Tomcat server is <= strong>not running</strong>.</p> <p> </p> <ol> <li><p>Navigate to Tomcat's webapps directory. <span style=3D"color: rgb(12= 8,0,0);"><strong>If there exist a directory named ROOT</strong></span>, mov= e it aside. The eagle-i repository <strong>must</strong> be the ROOT applic= ation</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>cd ${CATALINA_HOME}/webapps mv ROOT ROOT.original </pre> </div> </div></li> <li><p>Copy the repository webapp to the Tomcat webapps directory:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>cp ${REPO_HOME}/webapps/ROOT.war ${CATALINA_HOME}/webapps/. </pre> </div> </div></li> <li><p>Create your initial administrative user login. Think of a USERNAME a= nd PASSWORD and substitute them for the upper case words in this command:</= p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>bash ${REPO_HOME}/etc/prepare-install.sh USERNAME PASSWORD ${REPO_HOME= } </pre> </div> </div></li> <li>Start up Tomcat.</li> <li><p>Run the finish-install script, which loads the data model ontology a= mong other things. Note that you can also give it additional options to spe= cify a personal name and email box for the initial admin user.</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>bash ${REPO_HOME}/etc/finish-install.sh USERNAME PASSWORD https://loca= lhost:8443</pre> </div> </div><p>...or, with username metadata included:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>bash ${REPO_HOME}/etc/finish-install.sh \ -f firstname \ -l lastname \ -m admin@ei.edu \ USERNAME PASSWORD https://localhost:8443</pre> </div> </div></li> <li><p>Run the upgrade.sh script, which preforms additional configurations.= </p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>bash ${REPO_HOME}/etc/upgrade.sh USERNAME PASSWORD https://localhost:8= 443 </pre> </div> </div></li> <li>Copy the file default.configuration.properties in located in {<cod= e>${REPO_HOME} }}into a file named {{configuration.properti= es</code> and edit the latter to reflect your installation. See the <= a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Configurati= on">#Configuration</a> section below for details on the property definition= s and expected values.</li> <li><p>Restart Tomcat to pick up these configuration changes. Confirm that = the eagle-i repository is running by visiting the admin page (login with US= ERNAME and PASSWORD):</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>https://localhost:8443/repository/admin </pre> </div> </div></li> </ol> <h1 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Upgrade">Upg= rade</h1> <p>This is the procedure to upgrade an existing repository instance to a ne= w release of the software. <strong>All existing configurations, data, and u= ser accounts are preserved</strong>. However, if the upgrade includes ontol= ogy changes there will also be an extra procedure to transform the existing= data to reconcile it with ontology changes. </p> <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-BeforeUpgrad= ing">Before Upgrading</h2> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-GettheReposi= toryDistribution">Get the Repository Distribution</h3> <p>The repository release is distributed as a single Zip file. It contains = a file <code>README</code> whcih identifies the software release it was bui= lt from. It is the artifact produced by the Maven project:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>org.eagle-i:eagle-i-repository-dist </pre> </div> </div> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Backup">Back= up</h3> <p>It would be a wise precaution to make a backup of the current repository= state so you can roll back to it in case of fatal problems with the upgrad= e. Follow the <strong>Backup Procedure</strong> in the <a href=3D"#Reposito= ryInstallation,UpgradeandAdministrationGuide-Procedures">#Procedures</a> se= ction to get a snapshot of the current repository contents.</p> <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-StepByStepUp= gradeProcedure">Step By Step Upgrade Procedure</h2> <p>Note that the directory macros ${CATALINA_HOME} and ${REPO_HOME} are use= d in the examples here; see the <strong>Install Procedure</strong> above fo= r a description of what they mean.</p> <ol> <li><p>Unpack the distribution Zip archive in a directory e.g. under <code>= /tmp</code>:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>cd /tmp unzip repository-dist.zip </pre> </div> </div></li> <li>Shut down your Tomcat java servlet container.</li> <li><p>Delete the old repo webapp subdirectory and WAR file, since there sh= ould not be any local modifcations there. For example:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>rm -rf ${CATALINA_HOME}/webapps/ROOT* </pre> </div> </div></li> <li><p>Save the current release files in case you have to roll back:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>cd ${REPO_HOME} mv etc etc.old mv lib lib.old mv webapps webapps.old </pre> </div> </div></li> <li><p>Copy the distribution into place (in this example the distribution i= s version 1.7-MS1.01) -- note there are 2 steps:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>cp -f -rp /tmp/repository-1.7-MS1.01/* ${REPO_HOME} cp ${REPO_HOME}/webapps/ROOT.war ${CATALINA_HOME}/webapps/. </pre> </div> </div></li> <li>Start up your tomcat java servlet container.</li> <li><p>Run the upgrade script, substituting your admin's username and passw= ord:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>bash ${REPO_HOME}/etc/upgrade.sh USERNAME PASSWORD https://localhost:8= 443</pre> </div> </div><p>Watch the output of upgrade.sh very carefully! Pay particular atte= ntion to the final status and any messages beginning "WARN", they will indi= cate problems you MUST resolve.</p></li> <li>Confirm that it worked: visit the <strong>repo admin page</strong>, che= ck for new version, and then follow the link to <span style=3D"text-decorat= ion: underline;">Show Data Model Ontology versions</span> to confirm that "= loaded" and "available" versions of the ontology are the same.When running = the upgrade script, there may be messages about out-of-date <code>NG_Intern= al</code> and <code>NG_Query</code> graphs. Most likely, these are nothing = to worry about -- check the release notes. These graphs are only initialize= d from static files when the repository was created, and afterward they acc= umulate statements, so reloading a new copy of the original data is not pra= ctical. Some releases may include instructions for making changes in these = graphs when upgrading from previous versions.</li> <li><p>Download the data migration toolkit <strong>that corresponds to your= repository version</strong> (in this example, version 1.7-MS1.02) and run = the data migration script, substituting your admin's username and password:= </p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>wget -O ${REPO_HOME}/etc/eagle-i-datatools-datamanagement.jar \ http://infra.search.eagle-i.net:8081/nexus/content/repositories/\ releases/org/eagle-i/eagle-i-datatools-datamanagement/1.7-MS1.02/\ eagle-i-datatools-datamanagement-1.7-MS1.02.jar bash ${REPO_HOME}/etc/data-migration.sh -u USERNAME -p PASSWORD -r https://= localhost:8443 </pre> </div> </div><p>Watch the output of data-migration.sh very carefully! Pay particul= ar attention to the final status and any messages beginning "WARN", they wi= ll indicate problems you MUST resolve. In addition to the output on screen,= the data-migration script will place a data migration report in the <code>= logs</code> directory directly under <code>/etc</code>.</p></li> </ol> <p> </p> <h1 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Configuratio= n">Configuration</h1> <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-URIsforCreat= ingNewRoles,Transitions,andWorkspaces">URIs for Creating New Roles, Transit= ions, and Workspaces</h2> <p>When you create a new <strong>Role</strong> or <strong>Workflow Transiti= on</strong>, you have the option of assigning your own URI to the new resou= rce. When should you make up a URI, and when should you just let the system= create one?</p> <p>The answer is, if you expect to be exporting and sharing this resource -= - which is to be expected for most Roles and Transitions, since there will = typically be many commonly-administered repositories sharing the same confi= guration of Roles and workflow, <strong>make up your own URIs following gui= delines here</strong>. This ensures that when, e.g. a User is copied from o= ne repository to another, her Roles are all available on the destination re= pository with the same access grants. Likewise, Workflow Transitions should= be given the same uniform URI on all repository sites to ensure that a cha= nge on the master site is propagated correctly. Since you ensure the Transi= tion's URI is globally unique, you can import it on all the slave repos wit= h the URI preserved, replacing the local copy, since the local URI will be = the same as the master's URI.</p> <p>For <strong>Workspace</strong> (aka Named Graph) URIs, you have to assig= n them in the process of creating a new Named Graph. Follow the rules below= to create a reasonable URI.</p> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-RulesofCreat= ingYourOwnURIs">Rules of Creating Your Own URIs</h3> <p>Note that these URIs <strong>do not need to be resolvable</strong>. They= are purely <em>symbolic</em> names for instances buried within the reposit= ory, which are virtually guaranteed never to appear in the outside world. S= o don't worry about whether the URI is actually resolved, most of the exist= ing URIs for these types of things are not resolvable anyway.</p> <ol> <li>Choose a <strong>namespace</strong> (URI prefix):=20 <ol> <li>Unique to your project, e.g. <span class=3D"nolink">http://dartmouse.ed= u/repo/</span></li> <li>Borrow the Repository's namespace: <span class=3D"nolink">http://eagle-= i.org/ont/repo/1.0/</span><strong>(but be careful of the suffix you pick!)<= /strong></li> </ol></li> <li>Choose a unique symbolic suffix:=20 <ol> <li>It <strong>must not</strong> contain the slash (/) character!</li> <li>Include a description of the content to prevent collisions, e.g. "Role"= or "WFT"</li> <li>If you are borrowing the Repository namespace, start your suffix with a= unique leader, e.g. "DARTMOUSE_".</li> </ol></li> </ol> <p>Examples of good URIs:</p> <p><span class=3D"nolink">http://dartmouse.edu/repo/Role_LabRat</span><span= class=3D"nolink">http://dartmouse.edu/repo/WFT_13_2</span><span class=3D"n= olink">http://eagle-i.org/ont/repo/1.0/DARTMOUSE_ROLE_PI</span><span class= =3D"nolink">http://eagle-i.org/ont/repo/1.0/DARTMOUSE_WFT_TRASH</span><br><= strong>Exception</strong>: The URI of a named graph representing an ontolog= y is usually the same as the URI of the ontology itself, i.e. the subject o= f its <code>owl:versionInfo</code> statement. If you should happen to add a= new ontology named graph to the repository, use that URI for its name. How= ever this should be a very rare occurrence; usually new ontological informa= tion is simply added to the existing eagle-i data model ontology graph.</p> <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-ManagingAcce= ssControlsonContact&"Hidden"Properties">Managing Access Contr= ols on Contact & "Hidden" Properties</h2> <p>The repository has a mechanism for restricting access to some of the pro= perties of resource instances, deemed "hidden" and "contact" properties - t= hese are two distinct sets of properties, configured independently but by a= n identical mechanism. See the <strong>Resource Property Hiding</strong>&nb= sp;and <strong>Acces Control</strong> sections under <strong>Concepts<= /strong> in the <strong>Repository Design Specification / API Manual</stron= g> for more details about how this works.</p> <p>To configure access control, bring up the Admin GUI home page, and click= on the link <span style=3D"text-decoration: underline;">Manage Property Ac= cess Controls</span> under <strong>Administrator Tasks</strong>. This page = lets you edit the Access Control List (ACL) of both contact and hiding prop= erty sets. Granting <strong>READ</strong> permission allows a user or role = to see those properties in Dissemination and harvest reports.</p> <p>It is best to grant these permissions only to Roles - there should be no= need to grant property read access at the granularity of users.</p> <p>Note that if you grant <strong>READ</strong> to the <strong>Anonymous</s= trong> pseudo-role, that is the same as turning off all protection since un= authenticated users will be able to see the hidden/contact properties.</p> <p>Once you have set up a single repository to your liking, you can export = and re-import the grants to other repositories. See the <strong>Procedure: = Exporting and Importing Property Access Controls</strong> section below.</p= > <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Configuratio= nReference">Configuration Reference</h2> <p>This section lists everything that can be configured, so you can get fam= iliar with it before installing anything.</p> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-SystemProper= ties">System Properties</h3> <p>The repository requires these system properties to be defined in the JVM= environment running your servlet container:</p> <ol> <li><code>org.eaglei.repository.home</code> - absolute path of the <em>repo= sitory home directory</em> (see below)</li> <li><code>derby.system.home</code> - directory containing Derby databases.<= /li> </ol> <p>We recommend you set to the same path as repository home</p> <p>If you are using the Apache Tomcat version 6 container (which is recomme= nded), you can add these system properties to file <code>conf/catalina.prop= erties</code> - add lines like these: (note that the path <code>/opt/eaglei= /repo</code> is just shown as an example)</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>org.eaglei.repository.home =3D /opt/eaglei/repo derby.system.home=3D /opt/eaglei/repo </pre> </div> </div> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-RepositoryHo= meDirectory">Repository Home Directory</h3> <p>The repository has a notion of a <em>home directory</em>, the root of a = hierarchy of other runtime files.</p> <p><strong>We recommend that you place this home directory</strong> <strong= ><span style=3D"text-decoration: underline;">outside</span></strong> <stron= g>of the Java Servlet container,</strong> <strong><span style=3D"text-decor= ation: underline;">especially</span></strong> <strong>outside of the webapp= structure -- this is because the entire webapp directory tree may be remov= ed and replaced when the webapp is updated.</strong> The default location i= s under the process-owning user's home directory.</p> <p>The path of the home directory is determined:</p> <ol> <li>If the Java system property <code>org.eaglei.repository.home</code> is = set, its value must be the absolute path of the home directory.</li> <li>Otherwise it defaults to the user's home directory (value of system pro= perty user.home) followed by path elements eaglei and repository, e.g. <cod= e>/home/lcs/eaglei/repository/</code></li> </ol> <p>These files and subdirectories are found under the repository home:</p> <ul> <li><code>configuration.properties</code> - java properties file with repos= itory and log4j configuration props. This is optional, it must be created b= y the administrator.</li> <li><code>logs/</code> - Default subdirectory for log files, see configurat= ion. Created automatically by default.</li> <li><code>sesame/</code> - Default Sesame RDF database files - DO NOT TOUCH= . Created automatically by default.</li> <li><code>etc/</code> - Contains scripts and tools for the repo administrat= or.</li> <li><code>db/</code> - Default subdirectory Derby RDBMS files - DO NOT TOUC= H. Created automatically by default.</li> </ul> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-TheRepositor= yConfigurationPropertiesFile">The Repository Configuration Properties File<= /h3> <p>The configuration file is read by <a href=3D"http://commons.apache.org/c= onfiguration/index.html" class=3D"external-link" rel=3D"nofollow">Apache Co= mmons Configuration</a>, which recognizes interpolated property and system = property values. See its documentation for more information about features = in the configuration file.</p> <p>You can set the following properties in the <code>configuration.properti= es</code> file. Most of the repository's "configuration" comes from adminis= trative metadata in its RDF database and from the ontologies loaded into it= , so the configuration settings here are very minimal and mostly serve to b= ootstrap the RDF repository.</p> <p>The properties in <span style=3D"color: rgb(255,0,0);">red</span>&n= bsp;are <strong>required</strong>; those in <span style=3D"color: rgb(= 255,153,0);">orange</span> are <strong>important</strong> and can be c= onsidered required for a production system, although they can be elided for= a test or development system at the cost of some ugliness in the UI.</p> <p>Any properties not present (or commented-out) in the configuration prope= rties file will revert to the default values documented here. In most cases= this is just fine. The property is only provided so that the application's= behavior can be customized and adjusted to suit the requirements of a part= icular installation site. For example, your site may have a convention of w= riting all log files to a fielsystem separate from the applications.</p> <ul> <li><span style=3D"color: rgb(255,0,0);"><code>eaglei.repository.namespace<= /code></span> - The namespace URI prefix for Eagle-I resource instances cre= ated in the repository.=20 <ul> <li><strong>Every administrator should set this to a reasonable value for h= is/her site, because the default is NOT desireable.</strong></li> <li><strong>The value must be a fully qualified, resolvable, HTTP URL.</str= ong></li> <li>For example, <code><span class=3D"nolink">http://foo.bar.edu/i/</s= pan></code></li> <li>Use the <strong>http</strong> scheme, <em>NOT</em> <strong>https</stron= g>, since the container will redirect to https if necessary, but it is not = possible to direct back if it becomes preferable to use http later.</li> <li>The system-generated default is the hostname followed by /i/ -- but thi= s is often wrong, since Java's determination of hostnames in a servlet cont= ainer environment is not reliable.</li> </ul></li> <li><span style=3D"color: rgb(255,153,0);"><code>eaglei.repository.title</c= ode></span> - the decorative title for UI pages, should be set for cosmetic= reasons.=20 <ul> <li>Set this to the name of your site, e.g. "Miskatonic University School o= f Medicine".</li> </ul></li> <li><span style=3D"color: rgb(255,153,0);"><code>eaglei.repository.logo</co= de></span> - URL of the logo image for your site, may be either relative UR= L (to refer to a image embedded in the webapp) or an absolute URL to use an= image hosted elsewhere. It should be about 50 pixels high and a suitable w= ith given the proportions.</li> <li><span style=3D"color: rgb(255,153,0);"><code>eaglei.repository.index.ur= l</code></span> - Set this to the URL to which you want the site's "root" (= top-level index) page redirected. Although the repository is installed as t= he root webapp to have control over resolving Semantic Web URIs, it does no= t need the root page so this allows you to configure your site as you like.= </li> <li><span style=3D"color: rgb(255,153,0);"><code>eaglei.repository.admin.ba= ckgroundColor</code></span> - Lets you change the background color for admi= n web UI pages, to give admins an obvious cue when they are operating on e.= g. the production vs. test repos. Value is CSS color expression, e.g. crayo= n name like "bisque" or hex #CCFFCC (Added in Release 1.2MS2 or 3)</li> <li><span style=3D"color: rgb(255,153,0);"><code>eaglei.repository.instance= .xslt</code></span> - path to XSL stylesheet used to transform the HTML out= put of the instance dissemination service. A value for this key is <span st= yle=3D"color: rgb(255,0,0);"><strong>required</strong></span> to produce XH= TML in the dissemination service; without it, the service returns the inter= nal XML document describing the instance.=20 <ul> <li>If it is a relative path then it must be located relative to the root o= f the web application, if absolute then it is in the filesystem at large.</= li> <li>The advantage of keeping your stylesheets external to the webapp is tha= t you can change them easily, and don't have to modify the webapp from its = default installation.</li> <li>An example is provided at <code>repository/styles/example.xsl</code> wh= ich creates very simple HTML, as a demonstration of how to write an XSL sty= lesheet.</li> </ul></li> <li><p><code>eaglei.repository.instance.css</code> - URI of the CSS stylesh= eet resource to be used to style instance dissemination pages. It must be a= n absolute path or absolute URL. The default is:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>eaglei.repository.instance.css =3D /repository/styles/i.css</pre> </div> </div></li> <li><code>eaglei.repository.tbox.graphs</code> - a comma-separated list of = graph URIs making up the "TBox".<br><strong>You should</strong> <strong><sp= an style=3D"text-decoration: underline;">never</span></strong> <strong>have= to set this!</strong> It is configurable "just in case", and for testing/e= xperimenting. For more information, see the section on <strong>inferencing<= /strong> in the API Manual.<br> By default, the TBox consists of:=20 <ul> <li>The repository's internal ontology, <span class=3D"nolink">http://= eagle-i.org/ont/repo/1.0/</span></li> <li>The eagle-i data model ontology, <code><span class=3D"nolink">http= ://purl.obolibrary.org/obo/ero.owl</span></code></li> </ul></li> <li><code>eaglei.repository.datamodel.source</code> - the full name of a re= source within the webapp which its itself a property file describing the RD= F data model ontology. <em>You should not need to set this, the default is = adequate for the eagle-i applicaiton</em>. Default is <code>eaglei-datamode= l.properties</code> which is a built-in resource file.<br> For a description of the contents of this properties file, see the separat= e document <strong>Guide to Data Model Configuration Properties</strong></l= i> <li><code>eaglei.repository.sesame.dir</code> - directory where Sesame RDF = database files are created.=20 <ul> <li>Defaults to <code>sesame</code> subdirectory of home dir.</li> </ul></li> <li><code>eaglei.repository.log.dir</code> - Directory where log files are = created.=20 <ul> <li>Defaults to <code>logs</code> subdirectory of the home dir. </li> <li>You can also configure <code>log4j</code> explicitly by adding <code>lo= g4j</code> properties to this file.</li> </ul></li> <li><p><code>eaglei.repository.sesame.indexes</code> - index configuration = for Sesame triple store. Must be a comma-separated list of index specifiers= , see <a href=3D"http://www.openrdf.org/doc/sesame2/users/ch07.html#section= -native-store-config" class=3D"external-link" rel=3D"nofollow">Sesame Nativ= eStore configuration</a> documentation for details. Use this to change= the internal indexes Sesame maintains to process queries. It takes effect = on next servlet container (tomcat) restart.</p> <div class=3D"confluence-information-macro confluence-information-macro-war= ning"> <p class=3D"title conf-macro-render">WARNING</p><span class=3D"aui-icon aui= -icon-small aui-iconfont-error confluence-information-macro-icon"></span> <div class=3D"confluence-information-macro-body"> <p>If you have a configured value and wish to go back to the default, <stro= ng>do NOT</strong> just delete this configuration property. If you do, Sesa= me will simply keep the existing indexes. You must change it to the origina= l default value, which is documented int he default configuration file.</p> </div> </div></li> <li><code>eaglei.repository.slow.query</code> - Value in seconds of time af= ter which a SPARQL query should be considered "slow" and logged as such. On= ly affects the SPARQL Protocol endpoint service. Default is 0, which never = logs. Use this to check for performance problems, since it logs the full te= xt of the query and time of occurance in the regular log at INFO level.</li= > <li><code>eaglei.repository.sparqlprotocol.max.time</code> - Time limit, in= seconds, of the maximum time allowed for a query invoked by the SPARQL Pro= tocol endpoint. Note that this <em>does not</em> affect any internally-gene= rated SPARQL queries.=20 <ul> <li>Any user can override this setting to impose a <strong><em>shorter</em>= </strong> timeout by giving a value for the nonstandard <strong>time</stron= g> argument.</li> <li>Only the Administrator can override with a <strong><em>longer</em></str= ong> timeout.</li> <li>The built-in default is <strong>600 seconds</strong> (10 min) if nothin= g is configured.</li> <li>If a SPARQL Protocol request cannot be complted within the timeout, it = returns an <strong>HTTP 413 status</strong> (result too large - it was the = standard response code that comes closest to the concept).</li> </ul></li> <li><code>eaglei.repository.anonymous.user</code> - <strong>This is a hack,= only intended for testing the Anonymous role</strong>. Its value is a user= name, e.g. "nobody". If configured, when the designated user logs in, their= session is downgraded to the <strong>Anonymous</strong> role; this allows = explicit testing of <strong>Anonymous </strong>(vs. <strong>Authentica= ted</strong>) access even when the webapp configuration does not allow unau= thenticated access. <strong>ONLY TESTERS SHOULD EVER NEED TO SET THIS.</str= ong></li> <li><strong>Configuring Contact Hiding:*The following properties control th= e contact hiding extension, which restricts the display of "contact locatio= n" properties of instances and instead offers an anonymous email option.&nb= sp;</strong><span style=3D"color: rgb(255,0,0);"><strong>Red</strong></span= ><strong> properties are required *only</strong> if you enable contact= hiding:=20 <ul> <li><span style=3D"color: rgb(255,0,0);"><code>eaglei.repository.hideContac= ts</code></span> - true|false, enables the contact hiding function. When it= is false, none of the other properties are used.</li> <li><span style=3D"color: rgb(255,0,0);"><code>eaglei.repository.postmaster= </code></span> - email address of repository administrator(s). User-generat= ed messages about resources without a contact email address get sent here, = as well as diagnostic messages. We recommend using an email list or alias s= o it can be changed or directed to multiple people.</li> <li><span style=3D"color: rgb(255,0,0);"><code>eaglei.repository.mail.host<= /code></span> - hostname of SMTP server for outgoing mail, defaults to loca= lhost.</li> <li><code>eaglei.repository.mail.port</code> - TCP port number of SMTP serv= er for outgoing mail, only necessary if using a non-default port for your c= hosen type of service.</li> <li><code>eaglei.repository.mail.ssl</code> - Use SSL for connection to SMT= P server for outgoing mail, value is true or false.</li> <li><code>eaglei.repository.mail.username</code> - Username with which to a= uthenticate to SMTP server for outgoing mail, default is unauthenticated.</= li> <li><code>eaglei.repository.mail.password</code> - password with which to a= uthenticate to SMTP server for outgoing mail, default is none.</li> </ul></li> </ul> <h6 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-2.0Release">= <strong>2.0 Release</strong></h6> <p>The following <strong>optional</strong> properties are valid after the 2= .0 release.</p> <ul> <li><code>eaglei.repository.searchBar.javascript.url</code> - Location of t= he source of the JavaScript for the search bar. The default value is suffic= ient unless custom search bar code needs to be loaded.</li> <li><code>eaglei.repository.centralSearch.url</code> - Location of the dest= ination of the actual searches from the search bar. The default value is su= fficient unless search is to be performed by a specific application.</li> </ul> <p>Note that the properties file may also contain Log4J configuration prope= rties. For example you can turn on debugging log output by adding this line= :</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>log4j.logger.org.eaglei.repository=3DDEBUG, repository </pre> </div> </div> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-ConfiguringL= ogging">Configuring Logging</h3> <p>The repository uses <a href=3D"http://logging.apache.org/log4j/" class= =3D"external-link" rel=3D"nofollow">Apache log4j</a> for its logging. Any p= roperties starting with <code>log4j.</code> in the repository configuration= properties are simply passed through to configure log4j. The Loggers (aka = Categories) are all descendents of the <strong>repository root Logger</stro= ng>, <code>org.eaglei.repository</code>, so you should configure the log le= vel and appenders for that Logger.</p> <p>Any log4j configuration properties in your repository configuration <str= ong>replace</strong> the corresponding defaults. Therefore, you <strong>mus= t</strong> assign an appender for the repository root Logger, or it will no= t be able to log anything.</p> <p>The default log4j configuration sets up an appender named repository wit= h buffered I/O for efficiency. <strong><em>Note that this means log message= s will not appear in the log file immediately, but only after the logging v= olume fills a buffer</em></strong>. <strong><em>This is useless for interac= tive debugging through the logs.</em></strong> If you are doing interactive= debugging and want to see more log detail, along with immediate results, y= ou should add the properties:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>log4j.logger.org.eaglei.repository=3DDEBUG, repository log4j.appender.repository.BufferedIO=3Dfalse log4j.appender.repository.ImmediateFlush=3Dtrue </pre> </div> </div> <p>Also note that the default configuration turns off additivity in the rep= o root Logger; this means its log events do not propagate up to e.g. the ro= ot logger. If you wish to turn it back on, add this to your configuration:<= /p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>log4j.additivity.org.eaglei.repository=3Dtrue </pre> </div> </div> <p>Here are all of the default log4j configuration properties:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>log4j.logger.org.eaglei.repository=3DINFO, repository log4j.additivity.org.eaglei.repository=3Dfalse log4j.appender.repository=3Dorg.apache.log4j.RollingFileAppender log4j.appender.repository.File=3D${eaglei.repository.log.dir}/repository.lo= g log4j.appender.repository.ImmediateFlush=3Dfalse log4j.appender.repository.BufferedIO=3Dtrue log4j.appender.repository.Append=3Dtrue log4j.appender.repository.Encoding=3DUTF-8 log4j.appender.repository.layout=3Dorg.apache.log4j.PatternLayout log4j.appender.repository.layout.ConversionPattern=3D%d{ISO8601} %p %c - %m= %n </pre> </div> </div> <p><strong>IMPORTANT NOTE</strong>: If you add <code>logger</code> configur= ations to tweak the level of a subset of the repo log hierarchy, you <stron= g>must</strong> add an <code>additivity</code> configuration to prevent log= 4j from applying the ancestor logger as well, which would result in double = log entries. For example, this fragment shows a default log level of INFO b= ut adds DEBUG logging of RepositoryServlet to get elapsed time messages:</p= > <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>log4j.logger.org.eaglei.repository=3DINFO, repository log4j.additivity.org.eaglei.repository=3Dfalse log4j.logger.org.eaglei.repository.servlet.RepositoryServlet=3DDEBUG, repos= itory log4j.additivity.org.eaglei.repository.servlet.RepositoryServlet=3Dfalse log4j.appender.repository.BufferedIO=3Dfalse log4j.appender.repository.ImmediateFlush=3Dtrue </pre> </div> </div> <h1 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Monitoringan= dTroubleshooting">Monitoring and Troubleshooting</h1> <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-VersionInfor= mation">Version Information</h2> <p>It's often helpful to know exactly what version of the repository you're= dealing with, especially in a hectic development and/or testing environmen= t when many versions are available. The release version appears in these pl= aces:</p> <ol> <li><p>Dissemination HTML pages, the <strong>head</strong> element contains= a <strong>meta</strong> tag with the name <code>eaglei.version</code>, e.g= .</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre><meta name=3D"eaglei.version" content=3D"1.1-MS5.00-SNAPSHOT" />= </pre> </div> </div></li> <li>The repository admin home page <code>/repository/admin</code> lists app= lication version info in a human-readable format.</li> <li>The page <code>/repository/version</code> gives a complete breakdown of= component versions, including repo source and the version of the OpenRDF S= esame database. It is XHTML, and it includes <strong>meta</strong> tags to = be easy to scrape or transform.</li> </ol> <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-LogFiles">Lo= g Files</h2> <p>Since the repository is mainly accessed by the REST service API it provi= des to other applications, you should get used to monitoring it by watching= the log file. This is a text file (UTF-8 encoding) maintained by the log4j= library under the control of the repository's configuration properties. Se= e the description of the <code>log.dir</code> property above to learn the d= irectory where logfiles are created; they are automatically rotated when th= e logfile grows too large.</p> <p>The default repository logfile is in the <code>logs/</code> subdirectory= of the repository home directory, and it is named <code>repository.log</co= de>. See the <strong>Configuration Properties</strong> section, above, for = instructions on changing the destination directory for logfiles.</p> <p>To troubleshoot problems with the logging system itself (e.g. log4j conf= ig that isn't working as expected), look for where your Java Servlet contai= ner writes the standard output stream. For Tomcat 6, this is typically the = <code>catalina.out</code> file in some log directory.</p> <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-PerformanceM= onitoring">Performance Monitoring</h2> <p>As of release 1.1MS5 the repo can log the elapsed time (in milliseconds)= for each service request. You must enable DEBUG level logging for the Repo= sitoryServlet, as in this configuration example.</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>log4j.logger.org.eaglei.repository=3DINFO, repository log4j.additivity.org.eaglei.repository=3Dfalse log4j.logger.org.eaglei.repository.servlet.RepositoryServlet=3DDEBUG, repos= itory log4j.additivity.org.eaglei.repository.servlet.RepositoryServlet=3Dfalse log4j.appender.repository.BufferedIO=3Dfalse log4j.appender.repository.ImmediateFlush=3Dtrue </pre> </div> </div> <p>As of release 1.2MS3 the repo will also show the time spent on internal = SPARQL queries, which can be useful when tuning Sesame indexes. Add these l= og4j configuration lines to see <strong><em>just</em></strong> the query lo= g messages:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>log4j.logger.org.eaglei.repository.util.SPARQL =3D DEBUG, repository log4j.additivity.org.eaglei.repository.util.SPARQL =3D false </pre> </div> </div> <p>Then, you'll see log entries like this which you can correlate to reques= ts from your application:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>...service invocation examples: 2011-01-27 14:28:06,483 T=3Dhttp-8443-1 DEBUG org.eaglei.repository.servlet= .RepositoryServlet - =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D Ending Request /repository/updat= e (2,159 mSec elapsed) 2011-01-27 14:27:58,023 T=3Dhttp-8443-1 DEBUG org.eaglei.repository.servlet= .RepositoryServlet - =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D Ending Request /repository/workf= low/push (1,763 mSec elapsed) ... (internal query example:) 2011-04-15 14:13:28,383 T=3Dhttp-8443-1 DEBUG org.eaglei.repository.util.SP= ARQL - SPARQL Query executed by org.eaglei.repository.model.User:findAll at line 227 in elapsed time (mSec)= 15 </pre> </div> </div> <p>You can also get the SPARQL Protocol endpoint to make log entries at the= INFO level for "slow" queries, i.e. ones that take longer than a certain t= hreshold.</p> <p>See the <code>eaglei.repository.slow.query</code> configuration property= for more details. Note that this <strong><em>only</em></strong> applies to= to queries made through the SPARQL Protocol endpoint, not the SPARQL queri= es generated internally by the repo code.</p> <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Tuning">Tuni= ng</h2> <p>The performance of Sesame's NativeStore implementation is extremely sens= itive to its index configuration. There is a major benefit to configuring i= ndexes that help resolve triple patterns used by the most frequent and/or v= oluminous SPARQL queries. A knowledgeable repository administrator should a= djust the setting of the <code>eaglei.repository.sesame.indexes</code> prop= erty to get the NativeStore to build the most necessary indexes. See doc on= that configuration for more details.</p> <p> </p> <h1 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Administrato= rTools">Administrator Tools</h1> <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-make-snapsho= t.shScript">make-snapshot.sh Script</h2> <p>The make-snapshot script creates a <strong><em>complete backup copy</em>= </strong> of a data repository, in a designated directory. It has to be giv= en a directory because the backup consists of multiple files. It is package= d with the repository distribution, under the <code>etc/</code> directory.<= /p> <p>Upon success, the directory will contain two files:</p> <ol> <li><code>resources.trig</code> -- RDF resource data in TriG format, read b= y /graph</li> <li><code>users.trig</code> -- user accounts, must be read by /import servi= ceUpon failure, it prints an explanatory messaeg and returns non-0 status.<= /li> </ol> <p>NO MESSAGE is printed upon success, which lets it run under cron.</p> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Usage">Usage= </h3> <p>Synopsis:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>make-snapshot.sh username password repo-URL directory </pre> </div> </div> <p>Where:</p> <ul> <li>username - username with which to authenticate to the repo</li> <li>password - password with which to authenticate to the repo</li> <li>repo-URL - prefix of repository URL, e.g. "https://localhost/"</li> <li>directory - directory in which to write the dump, will be created if ne= cessary</li> </ul> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-RestoringDum= psmadebymake-snapshot">Restoring Dumps made by make-snapshot</h3> <p>Given a dump created in e.g. ${DUMPDIR}, to restore this dump on a newly= -created, empty, repository, use these commands: (where ${REPOSITORY} is UR= L prefix of the repo)</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>curl -D - -s -S -u ADMIN:PASSWORD -F type=3Duser -F format=3Dapplicati= on/x-trig \ -F content=3D@${DUMPDIR}/users.trig -F duplicate=3Dreplace \ -F transform=3Dno ${REPOSITORY}/repository/import </pre> </div> </div> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>curl -s -S -D - -u ADMIN:PASSWORD -F action=3Dreplace -F all=3D \ -F "content=3D@${DUMPDIR}/resources.trig;type=3Dapplication/x-trig" \ ${REPOSITORY}/repository/graph </pre> </div> </div> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Examples">Ex= amples</h3> <p>For example, your crontab might invoke this command to write a daily sna= pshot</p> <p>in a differently-named directory each day, rotating through a week:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>make-snapshot.sh ADMIN PASSWORD https://localhost:8443 "daily_cron_`da= te +%u`" </pre> </div> </div> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-move-everyth= ing.sh:CopyingEverythingBetweenRepositoriesorFiles">move-everything.sh: Cop= ying Everything Between Repositories or Files</h2> <p>The <code>move-everything.sh</code> script replicates <strong><em>all</e= m></strong> of a repository's contents - including resources, users and met= adata - from one repository to a different one, or from a static file dump = to a live repository. It <strong><em>transforms</em></strong> all resource = (and user) URIs to match the URI prefix of the destination repository.</p> <div class=3D"confluence-information-macro confluence-information-macro-war= ning"> <p class=3D"title conf-macro-render">WARNING</p><span class=3D"aui-icon aui= -icon-small aui-iconfont-error confluence-information-macro-icon"></span> <div class=3D"confluence-information-macro-body"> <p>This command <span style=3D"text-decoration: underline;">obliterates</sp= an> all contents of the target repository.</p> </div> </div> <p>Why do you need this script instead of just export and import requests? = Because moving from one repo to another, the URIs of resources and users ha= ve to be transformed.</p> <p>Since resource URIs have to be resolvable, this effectively creates new = resources in the destination repository with URIs that resolve there. It do= es this by substituting the target's default prefix into all URIs that used= to resolve at the source repository.</p> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-ThisIsInhere= ntlyNotAGoodIdea">This Is Inherently Not A Good Idea</h3> <p>Before you start copying resources around, be sure you understand <stron= g><em>why this is not a good idea!</em></strong> Reasons include:</p> <ol> <li><strong>Poor Semantic Web Hygiene</strong>: Existing semantic web stand= ards and technologies have ways to show that multiple URIs really describe = the same object. This method does not use them, in the interest of making a= precisely accurate copy of the data and not changing the source.</li> <li><strong>Previous Content of Target Repo Is Lost</strong>: We can't emph= asize this enough because someone is sure to make a tragic mistake after no= t reading these instructions carefully enough. All contents of the reposito= ry you are moving resources into will be replaced by the copy of the source= repository. <span style=3D"color: rgb(255,0,0);"><strong><em>All previous = contents are lost irretrievably</em></strong></span>. You made a backup, ri= ght?</li> <li><strong>All State is Preserved</strong>: All metadata for the state of = e.g. data tools is copied faithfully, so claimed instances will have claims= on the new site. This is not intuitive.</li> <li><strong>User Accounts are added to Destination</strong>: All of the use= r login accounts from the source are added to the target repo. Previously e= xisting accounts are still available there too, but in the event of a dupli= cate, the target's account is replaced by the user URI and password from th= e source. <span style=3D"color: rgb(255,0,0);"><strong><em>You must be awar= e of any security issues this may create</em></strong></span>.</li> <li><strong>Abuse of auto-generated URIs</strong>: Since the script is impo= rting a bunch of URIs which were not generated by the native /new service, = there is some chance of overlap. This should not happen so long as all repo= sitories use the same time-based UUID generator for the suffix of URIs, but= there is always a chance of conflicts.</li> </ol> <p>However, move-everything has some advantages over move-resources:</p> <ol> <li><strong>Copy is Close to Perfect</strong>: The repository copy includes= all self-contained resource instances, including unpublished resources and= the user account instances referenced by administrative metadata.</li> <li><strong>Blank Nodes included</strong>: If your source data includes any= blank nodes, they get copied too.</li> <li><strong>Metadata should be complete</strong>: The user account instance= s referenced by creator and contributor properties are copied as well so re= ferences will work.</li> </ol> <p>Given all of these limitations, move-everything can still be an effectiv= e way of populating a repository for testing and demonstrations. Just stay = aware of what doesn't work, and <strong>only use it when the results are te= mporary and will be discarded</strong>.</p> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Restoringfro= mBackups">Restoring from Backups</h3> <p>There is one other legitimate use of <strong>move-everything</strong>: r= estoring a backup copy made with <strong>make-snapshot</strong>. In this ca= se you don't really have to transform the URIs, and the whole intent is to = re-create the original state of the repo so the side effects are all desire= d.</p> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-UsingtheScri= pt">Using the Script</h3> <p>The resource copying script is installed under etc/ in the repository ho= me directory. Its name is <code>move-everything.sh</code> . It only runs on= a Unix-based operating system such as <strong><em>Linux</em></strong> or <= strong><em>MacOS X</em></strong>. It requires <strong>bash</strong>, <stron= g>perl 5</strong>, and the <strong>curl</strong> executable.</p> <p>The synopsis for copying from <strong>repository</strong> to <strong>rep= ository</strong>:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>Usage: move-everything.sh [--version|--version] [ -f | --force ] [-exclude-users user,user,..|-exclude-users user,user,..] [-nousers] from-username from-password from-repo-URL to-username to-password to-repo-URL </pre> </div> </div> <p>The synopsis for copying from <strong>file</strong> to <strong>repositor= y</strong>:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>Usage: move-everything.sh [--version|--version] [ -f | --force ] [-exclude-users user,user,..|-exclude-users user,user,..] [-nousers] --from-snapshot directory --from-prefix from-prefix to-username to-password to-repo-URL </pre> </div> </div> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Options">Opt= ions</h3> <p>The <code>--force</code> option: Normally the script starts up with a di= alog explaining how dangerous it is and how the destination repo will be co= mpletely obliterated, and ask if you want to continue. Adding this option (= abbreviated <strong>-f)</strong> will bypass the question and run every tim= e, without asking. It is necessary when embedding it in another script. <st= rong><em>Only specify --force when you are very sure you're doing the right= thing</em></strong>. When prompted with the "Danger!" message, take time t= o actually <strong><em>read</em></strong> it before agreeing. You may be su= rprised.</p> <p>If you specify a <code>--exclude-users</code> option, its value is a lis= t of one or more usernames (separated by commas and/or spaces) to be left o= ut of the source export. This is handy when you do not want to import the a= dministrator accounts from the source system, for example. <em>Note that th= e excluded users' RDF metadata will</em> <strong><em>still</em></strong> <e= m>get copied, because it is in one of the named graphs which gets moved and= transformed.</em></p> <p>If you specify the <code>--nousers</code> option (has no value), this tu= rns off explicit copying of user accounts entirely. <em>Note that users' RD= F metadata will</em> <strong><em>still</em></strong> <em>get copied, becaus= e it is in one of the named graphs which gets moved and transformed</em>. T= here will just be no login accounts. Also note this allows you to run move-= everything without an Administrator login at the source repo, since all you= need is read access to all the graphs - that does not necessarily require = Administrator access.</p> <p>The <code>--from-snapshot</code> and <code>--from-prefix</code> options = must be specified together. They select the input data from a directory of = serialized files, in the same format as produced by the make-snapshot scrip= t. The value of <code>--from-snapshot</code> is the path to the direcotry c= ontaining the RDF serialization files. The value of <code>-from-prefix</cod= e> is the <strong>exact and complete URI prefix</strong> (including the tra= iling '<strong>/</strong>') of the repo that generated the dump in the dire= ctory. This is necessary because the script does not ahve access to that re= pository to query it for its prefix.</p> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-FixedArgumen= ts">Fixed Arguments</h3> <p>The fixed command arguments are either one or two triplets of repository= access information, i.e. the username, password, and URL of each repo.</p> <p>If you selected file as input with <code>--from-snapshot</code>, then yo= u must only specify the destination repository args. Otherwise, you specify= first the <strong>source</strong> or <strong>from</strong> repository, and= then the <strong>target</strong> or <strong>destination</strong> repo. Eac= h set of args consists of:</p> <ul> <li>Username of the login account; must be Administrator on the target.</li= > <li>Password for that login account.</li> <li>URL Prefix of the repository - just the part of the URL before "/reposi= tory/..."; e.g. <code><span class=3D"nolink">https://localhost:8443</s= pan></code></li> </ul> <p>Here is an example that copies from the production Harvard repo to a loc= al one:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>move-everything.sh bigbird PASSWORD https://harvard.eagle-i.net \ bigbird PASSWORD https://localhost:8443 </pre> </div> </div> <p>Here is an example that copies a snapshot the production Harvard repo to= a local one:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>make-snapshot bigbird PASSWORD https://harvard.eagle-i.net \ harvard.monday move-everything.sh -f \ --from-snapshot harvard.monday \ --from-prefix http://harvard.eagle-i.net/i/ \ bigbird PASSWORD https://localhost:8443 </pre> </div> </div> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Hints">Hints= </h3> <p><strong><em>We strongly recommend</em></strong> you <strong>avoid</stron= g> using the <strong>Superuser</strong> (administrator) login on the source= repository, to prevent accidentally obliterating it by getting the argumen= t order wrong. Use an account that has read access to every graph (e.g. the= <strong>Admin-Read-Only</strong> role). This restricts you to using the --= nousers version of the command but in most cases that is adequate. See the = <a href=3D"#RepositoryInstallation,UpgradeandAdministrationGuide-Procedures= ">#Procedures</a> section for recommendations on how to maintain copies of = repositories this way.</p> <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-move-resourc= es.sh-CopyingOnlyResourceInstances">move-resources.sh - Copying Only Resour= ce Instances</h2> <p>The goal of this procedure is to copy all of the resource instances <str= ong>in one Named Graph</strong> from one repository to another, along with = their relevant provenance and administrative metadata.</p> <p>Since resource URIs have to be resolvable, this effectively creates new = resources in the destination repository with URIs that resolve there. The h= ostname portion of the URI matches the new repository server, and even the = local name is allocated by the destination repository -- so there is no pre= dictable way to relate new URIs to the old ones.</p> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-ThisIsInhere= ntlyNotAGoodIdea.1">This Is Inherently Not A Good Idea</h3> <p>Before you start copying resources around, be sure you understand <stron= g>why this is not a good idea</strong>! Reasons include:</p> <ul> <li><strong>Poor Semantic Web Hygiene</strong>: Existing semantic web stand= ards and technologies have ways to show that multiple URIs really describe = the same object. This method does not use them, in the interest of making a= precisely accurate copy of the data and not changing the source.</li> <li><strong>Copy is Imperfect</strong>: The copy is bound to be missing som= e objects, and have some URIs that get translated without all of their desc= riptions being copied over. It's an inevitable consequence of the way data = is stored in the repository. For example, the Person object of a dcterms:cr= eator property may not be resolvable in the copy.</li> <li><strong>Blank Nodes ignored</strong>: If your source data includes any = blank nodes, their contents will probably not get copied or may get scrambl= ed, since blank node identifiers are only unique within one site's reposito= ry.</li> <li><strong>Metadata gets Broken</strong>: If you're copying from a file, t= he provenance metadata is all lost. If the source is a live repo, it gets c= opied, but the creator and contributor properties will refer to Person inst= ances on the original repo. That's another reason "this is inherently not a= good idea."</li> </ul> <p>Given all of these limitations, the resource-mover script can still be a= n effective way of populating a repository for testing and demonstrations. = Just stay aware of what <em>doesn't</em> work.</p> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-UsingtheScri= pt.1">Using the Script</h3> <p>The resource copying script is installed under <code>etc/</code> in the = repository home directory. Its name is <code>move-resources</code>. It only= runs on a Unix-based operating system such as <strong><em>Linux</em></stro= ng> or <strong><em>MacOS X</em></strong>. It requires p<strong>erl 5</stron= g> and the <strong>curl</strong> executable.</p> <p>Run it with -h to get the synopsis:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>Usage: move-resources [-verbose] [-replace] [--type published|workspace]{ --file source-file --prefix uri-prefix | --so= urce source-repo-url =09--user login:password --graph src-graph-URI } dest-repo-url dest-login:dest-password dest-graph-URI (options may be abbreviated to first letter, e.g. -f) </pre> </div> </div> <p>By default it <strong>adds</strong> data to the destination graph, <code= >--replace</code> changes that to replacing the entire graph.</p> <p>You can change the type of the destination graph with the <code>--type</= code> arg. E.g. set it to either <strong>workspace</strong> or <strong>publ= ished</strong>. By default the type is left alone.</p> <p>You must choose a <em>source</em> by specifying either the <em>file</em>= arguments (<strong>-f</strong> and <strong>-p</strong>), or <em>repository= </em> arguments (<strong>-s</strong>, <strong>-u</strong>, <strong>-g</stro= ng>). You must always specify the destination repository, login, and graph = so they are plain args, not options.</p> <p>Here is an example command, it copies from the <strong>Published</strong= > graph on <strong>qa.harvard</strong> to an "Experimental" graph on the lo= cal repo (on <code><span class=3D"nolink">https://localhost:8443</span= >)</code></p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>move-resources -s https://qa.harvard.eagle-i.net:8443 -u bert:ernie \ -g http://eagle-i.org/ont/repo/1.0/NG_Published https://localhost:8443 \ root:password http://eagle-i.org/ont/repo/1.0/NG_Experimental Moved 4694 data statements and 322 metadata statements. </pre> </div> </div> <h1 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Procedures">= Procedures</h1> <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Procedure:Up= gradingPackagedTomcat">Procedure: Upgrading Packaged Tomcat</h2> <div class=3D"confluence-information-macro confluence-information-macro-war= ning"> <p class=3D"title conf-macro-render">IMPORTANT</p><span class=3D"aui-icon a= ui-icon-small aui-iconfont-error confluence-information-macro-icon"></span> <div class=3D"confluence-information-macro-body"> <p>If you are using the Tomcat server from e.g. a Linux distro's package sy= stem, you must be aware of the following serious pitfall that can affect th= e repository when you upgrade Tomcat through the package system:</p> </div> </div> <p>Some if not all packaged Tomcat servers include a sample webapp installe= d as teh ROOT webapp, so that the default server address can respond with a= page congratulating you on installing Tomcat.</p> <p>Meanwhile, the Repository <strong><em>replaces</em></strong> this ROOT w= ebapp with its own (for good and compelling reasons detailed in the design = documents). Thus, we destructively modify the installed state of Tomcat.</p= > <p>Some Tomcat package upgrade proceduers (notably Fedora Core 12) have bee= n observed to simply replace files in the expanded ROOT webapp without chec= king that it was the original default ROOT webapp installed from the packag= e. While we consider this a serious bug in the distribution package, it is = unlikely to be fixed, so you must learn to expect and recover from it.</p> <p>So, <strong>after</strong> upgrading a packaged Tomcat:</p> <p>Remove the ROOT webapp directory (with Tomcat still shutdown) and the <c= ode>${CATALINA_HOME}/webapps/ROOT.war</code> file, to ensure any replaced o= r corrupted version is gone.Replace it with the ROOT.war from your Reposito= ry distribution - you DID save it, right?</p> <p>Finally, delete the entire <code>${CATALINA_HOME}/work</code> direcotry.= Tomcat rebuilds it on startup anyway, but it can contain mistaken caches t= hat do not get updated. Now you can start up Tomcat as usual.</p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Procedure:In= stallingRepoonUbuntu10'spackagedTomcat6">Procedure: Installing Repo on Ubun= tu 10's packaged Tomcat6</h2> <p><strong>See also</strong>: The <strong>Procedure to redirect Port 80</st= rong> so your URLs are simplified.</p> <p>This section describes the differences in the install procedure when usi= ng the packaged tomcat6 server on Ubuntu Linux 9.10 (karmic koala). It was = based on experience with package tomcat6, version 6.0.20-2ubuntu2.1 .</p> <p><strong>NOTE</strong>: This procedure only lists the steps specific to U= buntu's tomcat package. You need to review the previous section and follow = that procedure, referring to this one for the steps related to tomcat.</p> <ol> <li><strong>Shut down tomcat</strong>. This is major surgery, and tomcats d= on't like to be vivisected no matter how much more satisfying you may find = it.</li> <li><strong>Disable Java Security</strong> -- alternately, you could try to= configure all the authorization grants to give the repository webapp acces= s to the filesystem and property resources it needs, but I found it much ea= sier to just disable java security. <strong><em>DO NOT RUN THE TOMCAT PROCE= SS AS ROOT</em></strong> if you do this, but you should not be running it a= s root in any case. That's just insane.=20 <ol> <li><p>Edit the file <code>/etc/init.d/tomcat6</code> and change the follow= ing variable to look like this:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>TOMCAT6_SECURITY=3Dno</pre> </div> </div></li> </ol></li> <li><strong>Install Derby jars</strong>: <span style=3D"color: rgb(255,0,0)= ;">ONLY IF DERBY IS NOT ALREADY INSTALLED IN THE COMMON AREA OF YOUR TOMCAT= </span>. If another webapp is already using Derby, they should share that v= ersion.=20 <ol> <li>Find the Derby jars in the <code>lib/</code> subdirectory under where y= ou installed the <code>create-user.sh</code> script.</li> <li><p>Copy them to the Tomcat common library directory:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>cp ${REPO-ZIP-DIR}/lib/derby* /usr/share/tomcat6/lib/</pre> </div> </div></li> </ol></li> <li><p><strong>Install the webapp</strong>: First, get rid of any existing = root webapp, then copy in the webapp (<code>ROOT.war</code> file from your = installation kit) and be sure it is readable by the tomcat6 user:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>rm /var/lib/tomcat6/webapps/ROOT*cp ROOT.war /var/lib/tomcat6/webapps/= ROOT.war</pre> </div> </div></li> <li><p><strong>Install cached webapp context</strong>: This is <em>VERY IMP= ORTANT</em>, and the Tomcat docs does not even mention it, but without it y= our server will be mysteriously broken. The file <code>/etc/tomcat6/Catalin= a/localhost/ROOT.xml</code> must be a copy of your app's <code>context.xml<= /code>. Redo this command after installing every new <code>ROOT.war</code>:= </p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>mkdir -p /etc/tomcat6/Catalina/localhost unzip -p /var/lib/tomcat6/webapps/ROOT.war META-INF/context.xml > /etc/t= omcat6/Catalina/localhost/ROOT.xml</pre> </div> </div></li> <li><p><strong>Add System Properties</strong>: Be sure you have added syste= m properties to the file <code>/etc/tomcat6/catalina.properties</code>= e.g.</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>org.eaglei.repository.home =3D /opt/eaglei/repoderby.system.home =3D /= opt/eaglei/repo</pre> </div> </div><p>...of course, the value of these properties will be your Repositor= y Home Directory path.</p></li> <li><p><strong>Start up Tomcat</strong>:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>sudo /etc/init.d/tomcat6 start</pre> </div> </div></li> <li><strong>Troubleshooting</strong>: If there are problems, check the foll= owing places for logs (because packaged apps make everything so much easier= ):=20 <ul> <li><code>/var/log/daemon.log</code> - really dire tomcat problems and stdo= ut/stderr go to syslog</li> <li><code>/var/log/tomcat6/*</code> - normal catalina logging</li> <li><code>${REPOSITORY_HOME}/logs/repository.log</code> - default repo log = file in release 1.1; under 1.0 the filename was <code>default.log</code>.</= li> </ul></li> </ol> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Procedure:Ru= nTomcatonPort80(and443)">Procedure: Run Tomcat on Port 80 (and 443)</h2> <p>We want the repository (and other Web tools) to have a simple URL, witho= ut the ugly port number after the hostname, e.g. like this <span class=3D"n= olink">http://dev.harvard.eagle-i.net/</span> and NOT <span class=3D"nolink= ">http://dev.harvard.eagle-i.net:8080/</span><code>...</code> (because real= ly, that first one is already enough to remember.) This procedure uses IP p= ort redirection to let your Tomcat server appear to be running on the canon= ical HTTP port, which is 80. It is the simplest and safest method to accomp= lish this under Linux.</p> <p>The sanest alternative, running an Apache httpd server as an AJP forward= er, is much more effort and adds another point of failure. We will not even= discuss running Tomcat as root so it has access to port 80, since that is = simply unacceptable.</p> <p>These procedures</p> <ul> <li>have been tested under Ubuntu Linux 9.10 _(krazy kitten), Fedora 12 and= 14, and CentOS 6.03</li> <li>assume you are running Tomcat on port 8080. To redirect the HTTPS (HTTP= on SSL) port, also run the 3 additional <strong>iptables</strong> commands= (assuming port 443) below.</li> <li>require root privileges</li> <li>assume the Bourne shell (/bin/sh)</li> </ul> <p> </p> <ol> <li><p>To check the what rules are running</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>iptables -t nat -n -L</pre> </div> </div></li> <li><p>Discover your machine's primary IP address and set the ADDR shell va= riable: (Note that this assumes <strong>eth0</strong> is your primary netwo= rk interface --use <code>ifconfig -a</code> to see them all)</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>ADDR=3D`ifconfig eth0 | perl -ne 'print "$1\n" if m/\sinet addr\:(\d+\= .\d+\.\d+\.\d+)\s/;'`</pre> </div> </div></li> <li><p>Run these iptables commands to redirect all port 80 requests to port= 8080.</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>iptables -t nat -A OUTPUT -d localhost -p tcp --dport 80 -j REDIRECT -= -to-ports 8080 iptables -t nat -A OUTPUT -d $ADDR -p tcp --dport 80 -j REDIRECT --to-ports= 8080 iptables -t nat -A PREROUTING -d $ADDR -p tcp --dport 80 -j REDIRECT --to-p= orts 8080</pre> </div> </div></li> <li><p>(If using SSL) Run these iptables commands to redirect all port 443 = requests to port 8443.</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>iptables -t nat -A OUTPUT -d localhost -p tcp --dport 443 -j REDIRECT = --to-ports 8443 iptables -t nat -A OUTPUT -d $ADDR -p tcp --dport 443 -j REDIRECT --to-port= s 8443 iptables -t nat -A PREROUTING -d $ADDR -p tcp --dport 443 -j REDIRECT --to-= ports 8443</pre> </div> </div></li> <li>Check that your new rules are running (use the command above)</li> <li>Additional configuration=20 <ol> <li>Ubuntu=20 <ol> <li><p>Save the rules in the canonical place to be reloaded on boot:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>iptables-save > /etc/iptables.rules</pre> </div> </div></li> <li><p>Create a script to be run by the network startup infrastructure that= will reload the iptables whenever the network is configured on:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>cat << EOF > /etc/network/if-pre-up.d/iptablesload #!/bin/sh iptables-restore < /etc/iptables.rules exit 0 EOF</pre> </div> </div></li> </ol></li> <li>Fedora=20 <ol> <li>Save the rules to be reloaded on boot:=20 <ol> <li><p>The cleaner/preferable method, but apparently <strong>not</strong> w= orking:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>/sbin/iptables-save</pre> </div> </div></li> <li>Hacky, but works: manually edit <code>/etc/sysconfig/iptables</code></l= i> </ol></li> <li><p>Update the startup settings so iptables will run upon reboot:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>chkconfig --level 35 iptables on</pre> </div> </div></li> </ol></li> </ol></li> <li>Test by accessing your server both locally and remotely by the port-80 = URL. Then reboot the machine and try it again to be sure the iptables comma= nds are run correctly on boot.</li> </ol> <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Procedure:Du= mpandRestoretheRDFResourceData">Procedure: Dump and Restore the RDF Resourc= e Data</h2> <p>The recommended way to dump out the RDF resource data content of the rep= ository is to <em>export</em> it as <em>serialized RDF</em>. If you are exp= orting the entire contents of the repository, it is essential to preserve t= he mapping of statements to named graphs, so you must use one of the format= s that encodes RDF as quads (statement plus graph-name/context).</p> <p>The reason for this is that the repository server employs an RDF databas= e, Sesame, to manage the RDF statements. It uses Sesame's "native" store, w= hich records statements in opaque data files on the host OS's filesystem --= but much like relational database systems, Sesame's files are never in a c= onsistent state while it is running so it would have to be shut down (by sh= utting down the repository Web service) to make a "cold" snapshot backup. I= t is much easier to simply export the live data. Another advantage of expor= ts as backups is that the data can easily be <em>imported</em> into a later= version of Sesame or even a different database.</p> <p>This is a complex <strong>manual</strong> procedure with many options --= for a simpler semi-automated backup snapshot procedure, see the section on= using the <strong>make-snapshot</strong> script.</p> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-MakeBackupDu= mp(obsolete-seemake-snapshot)">Make Backup Dump (obsolete - see make-snapsh= ot)</h3> <p>Typical command to make a backup, in TriG format to a file, e.g. all-dum= p.trig (here highlighted in yellow) from a server running locally on port 8= 0. In practice, you'll probably need to change all the highlighted parts, s= uch as the <code>username:password</code> login credentials, and the hostna= me in the target URL if not running locally.</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>curl -G -X GET -s -S -u username:password -o all-dump.trig -d all \ --write-out 'status=3D%{http_code}, %{time_total}sec\n' \ -d format=3Dapplication/x-trig https://localhost:8443/repository/graph </pre> </div> </div> <p>Be <strong>sure</strong> the output shows a successful status code (name= ly <strong>200</strong>), as shown here, since curl will return a successfu= l status even if the HTTP service did not succeed; curl only reports on the= success of the network request-and-response transaction.</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>status=3D200, 13.283sec </pre> </div> </div> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-RestoreRepos= itoryfromBackup">Restore Repository from Backup</h3> <p><strong>NOTE</strong>: This form of the procedure is a bit obsolete, sin= ce the new move-everything.sh script can also restore the state of a reposi= tory from its own backup -- effectively moving data to itself. See that com= mand for details.</p> <p>The procedure is still worth mentioning since it demonstrates the nature= of the backup's contents:</p> <p>Typical restore command.</p> <div class=3D"confluence-information-macro confluence-information-macro-war= ning"> <p class=3D"title conf-macro-render">WARNING</p><span class=3D"aui-icon aui= -icon-small aui-iconfont-error confluence-information-macro-icon"></span> <div class=3D"confluence-information-macro-body"> <p>this <span style=3D"text-decoration: underline;">replaces</span> the <sp= an style=3D"text-decoration: underline;">entire</span> contents of the repo= sitory!</p> </div> </div> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>curl -s -S -u username:password -F action=3Dreplace -F all=3D \ --write-out 'status=3D%{http_code}, %{time_total}sec\n' \ -F 'content=3D@all-dump.trig;type=3Dapplication/x-trig' https://localhost:8= 443/repository/graph </pre> </div> </div> <p>Be <strong>sure</strong> the output shows a successful status code (name= ly <strong>201</strong>, since it created graphs), as shown here, since cur= l will return a successful status even if the HTTP service did not succeed;= curl only reports on the success of the network request-and-response trans= action.</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>status=3D201, 13.283sec </pre> </div> </div> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Procedure:Sa= vingandRestoringUserAccounts">Procedure: Saving and Restoring User Accounts= </h2> <p>As of the MS6 release, you can use the new Export/Import service to crea= te user accounts automatically (e.g. on a newly-created repository). This i= s <em>NOT</em> the same thing as true backup and restore; rather, it is int= ended more for setting up a test environment. The export and import service= s are very complex and powerful. This only gives one small example of what = they can do. For all the details, see their <strong>entry in the API Manual= </strong>.</p> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Step0.Create= PrototypeAccountsandExportThem">Step 0. Create Prototype Accounts and Expor= t Them</h3> <p><strong>Only do this once</strong>. Once you create a user file you like= , you can use it over and over, on any different sites and tiers you like.<= /p> <p>Create the user accounts you want on some repository instance. You will = export them to create a document describing the user accounts you want. The= re can be extra accounts, you can filter them out of the export. So, get al= l the accounts you want in order, with roles, passwords, and personal names= set up.</p> <p>Now run a command like this to export the accounts into the file <code>a= ll-users.trig</code></p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>curl -s -S -u username:password -G -d type=3Duser -d format=3Dapplicat= ion/x-trig \ --write-out 'status=3D%{http_code}\n' \ -o all-users.trig https://hostname:8443/repository/export </pre> </div> </div> <p>Note that you have to change the <strong>hostname</strong> and possibly = the login. If there are accounts you do not want in the export, add an <str= ong>exclude</strong> argument to filter them out, with a space-separated li= st, e.g.</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>.... -d 'exclude=3Dfrankenstein moreau lizardo' .... </pre> </div> </div> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Step1.Import= AccountsonDestinationSites">Step 1. Import Accounts on Destination Sites</h= 3> <p>You can start with a newly-created repository which needs to have user a= ccounts added. It only has the initial administrator login, e.g. <code>bigb= ird</code>. Use the import service to add users from the file you created i= n step 0. The following command adds all of the accounts <em>except</em> <s= trong>bigbird</strong> (since it already exists), and aborts without changi= ng anything if there are already duplicates of any of the users on the dest= ination repo. It will print "status=3D200" on success.</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>curl -s -S -u username:password -F type=3Duser -F format=3Dapplication= /x-trig \ -F transform=3Dyes --write-out 'status=3D%{http_code}\n' \ -F exclude=3Dbigbird \ -F content=3D@all-users.trig https://hostname:8443/repository/import </pre> </div> </div> <p>Note that the <strong>transform=3Dyes</strong> argument means <em>import= </em> will translate the instance URIs of the new users to newly-created UR= Is in the repository's default namespace. This is usually what you want. If= you are positively restoring users already in the correct namespace and yo= u want to preserve the old URIs, substitute <strong>transform=3Dno</strong>= .</p> <p> </p> <h3 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Step2.Testin= gUsers">Step 2. Testing Users</h3> <p>The easiest way to test the existence and details of a user is with the = <strong>/whoami</strong> service. It does not show roles, however, you'll h= ave to go to the repository administrative UI for that (or take it on faith= ). For example, after restoring users including <strong>curator</strong>, t= his is how you'd check that curator exists:</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>curl -s -S -u curator:password -G -d format=3Dtext/plain https://hostn= ame:8443/repository/whoami </pre> </div> </div> <p>It's probably only necessary to test one user like this, and to make sur= e the output includes a URI, as a check that the whole import succeeded.</p= > <p> </p> <h2 id=3D"RepositoryInstallation,UpgradeandAdministrationGuide-Procedure:Ex= portingandImportingPropertyAccessControls">Procedure: Exporting and Importi= ng Property Access Controls</h2> <p>This is only relevant to release 1.5MS1 and later, when resource propert= ies have access controls.</p> <p>To determine the URIs of the access controls, bring up the admin UI page= s and login as an Administrator. There will be a link to the Properites acc= ess control page named <span style=3D"text-decoration: underline;">Manage P= roperty Access Controls</span>. If you go to that page, it will display two= sets of properties for which there is an access control list:</p> <ol> <li>"Hidden" properties</li> <li>Contact properties</li> </ol> <p>Go to each of these in turn and observe the URI of the subject, e.g.&nbs= p;<code><span class=3D"nolink">http://eagle-i.org/ont/app/1.0/PropertyGroup= _AdminData</span></code>. This is the URI to <em>include</em> in your expor= t request. Now do the same for contact properties and record that URI too.<= /p> <p>To export property grants, plug those URIs into the following command (y= ou need to replace italicized words):</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>curl -G -k -u ADMIN:PASSWORD -d type=3Dgrant -d "include=3DHIDE,CONTAC= T" \ -d format=3Dapplication/x-trig https://localhost:8443/repository/export </pre> </div> </div> <p>This writes a record of grants to the standard output. Since the URIs ar= e the same between other repositories running the same data model, you shou= ld be able to import them with the command (shows standard input in the exa= mple):</p> <div class=3D"preformatted panel" style=3D"border-width: 1px;"> <div class=3D"preformattedContent panelContent"> <pre>curl -k -u ADMIN:PASSWORD -F type=3Dgrant \ -F duplicate=3Dabort -F transform=3Dno -F content=3D@- \ -F format=3Dapplication/x-trig https://localhost:8443/repository/import </pre> </div> </div> </div> </body> </html> ------=_Part_2724_78692282.1760389810503--