Defining a custom content model involves creating an XML file. The model is then deployed with a Spring bean. Localized labels for the content model items can also be part of the deployment.
Now when you have read the Introduction, it is time to define and deploy a content model. If you have already done this and are looking for information about how to configure the Share UI for a content model, then have a look at Configuring the User Interface.
To define a new content model means to add an XML file to the repository. This can be done via a bootstrapping procedure, shown below, or dynamically via the user interface (for more information, see Dynamic Deployment Approach). If you are running an Enterprise edition it is also possible to manage content models from the Admin Console (see Repository Admin Console). At the end of this article you will find information about where this XML file should be located in an SDK project and where it should be located in a standard installation.
A new custom content model starts with, you guessed it, a model element definition, which is the container for all other definitions as follows :
<?xml version="1.0" encoding="UTF-8"?> <model name="acme:contentModel" xmlns="http://www.alfresco.org/model/dictionary/1.0"> <description>Content model for the fictional company ACME</description> <author>Martin Bergljung</author> <version>1.0</version> <published>2015-09-01</published> <imports>...</imports> <namespaces>...</namespaces> <constraints>...</constraints> <types>...</types> <aspects>...</aspects> </model>
The order in which these elements are specified is important. It will not work to specify them in any other order than in this example. This is because in the meta model they are defined within a <xs:sequence> element. This is true throughout the model schema so be careful to always use the elements in the right order.
The imports definition is the container for import definitions of other content models referred to by the model being defined:
<imports> <!-- Import Alfresco Dictionary Definitions --> <import uri="http://www.alfresco.org/model/dictionary/1.0" prefix="d"/> <!-- Import Alfresco System Model Definitions --> <import uri="http://www.alfresco.org/model/system/1.0" prefix="sys"/> <!-- Import Alfresco Content Domain Model Definitions --> <import uri="http://www.alfresco.org/model/content/1.0" prefix="cm"/> </imports>
This example imports the Data Dictionary model with all the data-type definitions, such as d:text. It also imports the system model with types such as base, descriptor, container, reference, and aspects such as referenceable and temporary. Finally you import the generic content model (cm) with types such as content (files) and folder and aspects such as versionable and dublincore.
As you can imagine, it is important to get familiar with the out-of-the-box content models (see Out-of-the-Box Content Models) so you don’t start defining types and aspects that are already available. And it’s also important to get familiar with existing models to avoid clashing with namespaces and prefixes used by those.
The namespaces definition is the container for all new custom namespace definitions that will be used by this model:
<namespaces> <namespace uri="http://www.acme.org/model/content/1.0" prefix="acme"/> </namespaces>
This example defines the namespace for the new content model to http://www.acme.org/model/content/1.0. All definitions done in the model will be prefixed with acme. Any new name (referred to as local name) defined in the model for things like types, aspects, properties, etc. need to be unique. You could also define more than one namespace to be used in the model, such as in the following example:
<namespaces> <namespace uri="http://www.acme.org/model/content/1.0" prefix="acmec"/> <namespace uri="http://www.acme.org/model/workflow/1.0" prefix="acmew"/> </namespaces>
Here, you have defined one namespace acmec that will be used for document content classification and another namespace acmew that will be used exclusively for workflow-related content. You could have the same local name in both these namespaces, such as acmec:project and acmew:project.
A good idea is to use the following format for the URI:
http://<company domain address>/model/[content|workflow|...]/1.0
The constraints definition is the container for all new constraints to be used by the properties defined in the custom model. There are different types of constraints:
- LIST = property value must match an entry in the list
- REGEXP = property value must match the regular expression
- MIN-MAX = property value must be numeric and within this range
- LENGTH = property value must be a string and within min and max length
- java class = a custom constraint implementation in the form of a Java class
The following is an example definition of a LIST constraint:
<constraints> <constraint name="acme:securityClassificationOptions" type="LIST"> <parameter name="allowedValues"> <list> <value></value> <!-- Empty for default search --> <value>Public</value> <value>Client Confidential</value> <value>Company Confidential</value> <value>Strictly Confidential</value> </list> </parameter> </constraint>
The following is a LENGTH constraint example:
<constraints> <constraint name="acme:summary" type="LENGTH"> <parameter name="minLength"> <value>5</value> </parameter> <parameter name="maxLength"> <value>100</value> </parameter> </constraint>
And here is an example of a MINMAX constraint:
<constraints> <constraint name="acme:percentage" type="MINMAX"> <parameter name="minValue"> <value>0</value> </parameter> <parameter name="maxValue"> <value>100</value> </parameter> </constraint>
A regular expression (REGEXP) constraint looks like this:
<constraints> <constraint name="acme:contractIdFormat" type="REGEX"> <parameter name="expression"> <value><![CDATA[[^C\d{3}$]]></value> </parameter> <parameter name="requiresMatch"> <value>true</value> </parameter> </constraint>
Finally, you can define custom constraint implementations by supplying a Java class with the implementation. Here is an example from the out-of-the-box generic content model (contentModel.xml):
<constraints> <constraint name="cm:userNameConstraint" type="org.alfresco.repo.dictionary.constraint.UserNameConstraint" />
Being able to implement your own constraints is useful as it is common to want to set a property from a dynamic list of values, such as a ‘project name’. It might not be possible to use a static LIST constraint for project names as new projects might be created frequently. And you do not want users to type in the project name manually as then you might get more than one version of the project name present in the repository, making search unpredictable.
These constraints will have an effect on what controls and validation that are generated for the user interface (that is for the Share user interface).
The types definition is the container for all new custom type definitions. A type models an object in the specific domain that your are implementing a content management solution for. This could be for example a project, marketing document, engineering drawing, software manual, legal case, book, chapter and so on. Remember also that an item (node) in the repository, such as a file or a folder, can only be assigned one type.
Here are a couple of examples of how to define new types:
<types> <!-- ACME Enterprise-wide Document root type. All other custom document types should extend this one. --> <type name="acme:document"> <title>Base document type</title> <parent>cm:content</parent> <properties> <property name="acme:documentId"> <title>Document Identification Number</title> <type>d:text</type> </property> </properties> <mandatory-aspects> <aspect>acme:securityClassified</aspect> </mandatory-aspects> </type> <type name="acme:contract"> <title>Contract document</title> <parent>acme:document</parent> <properties> <property name="acme:contractName"> <title>Contract Name</title> <type>d:text</type> <mandatory>true</mandatory> </property> <property name="acme:contractId"> <title>Contract Identification Number</title> <type>d:text</type> <constraints> <constraint ref="acme:contractIdFormat" /> </constraints> </property> </properties> </type> <!-- A folder can have this type applied to represent a project container. --> <type name="acme:project"> <title>Project folder</title> <parent>cm:folder</parent> <properties> <property name="acme:projectName"> <type>d:text</type> <mandatory>true</mandatory> </property> <property name="acme:projectDescription"> <type>d:text</type> </property> <property name="acme:projectStartDate"> <type>d:date</type> </property> </properties> <associations> <!-- Setup a child-association from the type folder to zero or more members. Note. peer associations are not indexed and searchable, so using child-association instead. --> <child-association name="acme:projectMember"> <source> <mandatory>false</mandatory> <many>true</many> </source> <target> <class>cm:person</class> <mandatory>false</mandatory> <many>true</many> </target> <duplicate>false</duplicate> <propagateTimestamps>false</propagateTimestamps> </child-association> </associations> <mandatory-aspects> <aspect>acme:projectIdentifier</aspect> </mandatory-aspects> </type> </types>
Here you have first defined a base document type called acme:document that all your custom document types should extend. This is good practice as it allows you to search for all documents classified in some way as ACME documents, it will for example match the next type in the model called acme:contract, which extends the acme:document type. A base document type is also useful for keeping metadata that is applicable to all documents in the enterprise, such as some form of document identifier (that is acme:documentId) and security classification (that is acme:securityClassified).
Note how the base type extends the out-of-the-box type cm:content. All types that should be applied to files need to eventually extend this type, and it in turn extends the cm:cmobject type, which has properties such as cm:name (the name of the file or folder), cm:created, cm:creator, cm:modified, and cm:modifier. The last four properties belong to the cm:auditable aspect, which is mandatory on the cm:cmobject type. The auditable properties are managed by the system and when you for example upload a file the created date will be set to now, not to when the file was initially created.
The last type that is defined is called acme:project and it extends the out-of-the-box type cm:folder, which extends the cm:cmobject type described previously. So you can actually use folders to represent objects such as a project, a legal case, or other “container” objects. Being able to classify folders according to a specific domain is particularly useful as you can then also create “auto-classifying” rules for the folder. These rules would classify any files dropped into the folder according to the folders type and properties. Which means that you get files automatically classified without the end-user having to fill anything in.
The following table explains the sub-elements of the type element (note that they are the same for an aspect):
Name | Type | Multiplicity | Description |
---|---|---|---|
name | attribute (string) | required | The name of the type including namespace and local name, such as acme:project |
title | element (string) | [0..1] | The title of the type, will be used in the user interface if there are no i18n resources available. |
description | element (string) | [0..1] | The description of the type, useful for documentation purposes. |
parent | element (string) | [0..1] | Reference to another type that this type extends. Single inheritance is supported. A type usually extends the out-of-the-box cm:content type if it should be applied to files uploaded to the repository, and cm:folder if it should be applied to new folders in the repository. |
archive | element (boolean) | [0..1] | true = content with this type should be archived when deleted. This means that it will end up in the Archive Store and it can be restored from the recycle bin. (default)false = content with this type is not archived when deleted and is therefore permanently gone and can never be restored. |
properties | element container | [0..1] | Container for all custom properties that make up the metadata for this type. |
associations | element container | [0..1] | Container for all parent-child and peer associations for this type. Such as the acme:projectMember child association that has been defined for the cm:project type. |
overrides | element container | [0..1] | Property overrides of super type class properties. You can only override the default value, mandatory parameter, and constraints definitions for a property.It is quite common to override type properties when working with workflow content models, such as in the following example: |
<type name="scwf:activitiRevise"> <parent>bpm:activitiOutcomeTask</parent> <properties> <property name="scwf:reviseOutcome"> <type>d:text</type> <default>Abort</default> <constraints> <constraint type="LIST"> <parameter name="allowedValues"> <list> <value>Re-submit</value> <value>Abort</value> </list> </parameter> </constraint> </constraints> </property> </properties> <overrides> <property name="bpm:packageItemActionGroup"> <default>edit_package_item_actions</default> </property> <property name="bpm:outcomePropertyName"> <default>{http://www.someco.com/model/workflow/1.0}reviseOutcome</default> </property> </overrides> </type>
It is also common to do this for aspects. For example, the cm:titled aspect that is available out-of-the-box provides 2 properties to set the title and description for a node. These are non-mandatory properties.
If you wanted to always have these properties as mandatory on types, then you could define your own titled aspect as follows:
<aspect name="acme:titled"> <title>Titled</title> <parent>cm:titled</parent> <overrides> <property name="cm:title"> <mandatory>true</mandatory> </property> <property name="cm:description"> <mandatory>true</mandatory> </property> </overrides> </aspect>
As the custom acme:titled aspect extends the out of the box cm:titled aspect it will work in all searches done on cm:titled.
Name | Type | Multiplicity | Description |
---|---|---|---|
mandatory-aspects | element container | [0..1] | Mandatory aspects for this type. When content is created with this type applied, then these aspects will also be applied automatically. |
The type properties, and also the aspect properties as we will see, play a central part of the content model definition as they depict the so called metadata that should be stored together with the file or the folder. Metadata is very important in a content management solution as it will determine what you can search on, create rules around, base policies on etc.
The following table describes all the parameters that can be used when defining a new property(metadata) for a type or aspect:
Name | Type | Multiplicity | Description |
---|---|---|---|
name | attribute (string) | required | The name of the property. The name has to be unique between all types, aspects, constraints and so on that are defined in the namespace. It is different from object-oriented programming where a member variable of one class can be named the same as in another class. |
title | element (string) | [0..1] | The title of the property. Will be used in the user interface as field label if there are no i18n resources available. |
description | element (string) | [0..1] | The description of the property, useful for documentation purposes. |
type | element (string) | [1..1] | Mandatory data type reference. This references a data-type definition in the dictionaryModel.xml content model. Here are some of the most commonly used data types: d:text, d:int, d:long, d:float, d:double, d:date, d:datetime, d:boolean, d:encrypted, d:noderef |
protected | element (boolean) | [0..1] | true = this property cannot be edited after the value has been set. And it cannot be edited at all from the user interface (that is Alfresco Share). This is usually used for system properties such as cm:created, cm:creator. They are set by the system and can then not be touched. It is sometimes also used in custom content models to make a property read-only after it has been initially set via an API call.false = the property can be updated as many times as you like. This is the default if this element is not specified. |
mandatory | element (boolean) | [0..1] | true = when a property is set as mandatory it tells Content Services that the property is required. By default, this is not enforced. Instead, Content Services marks content items with empty mandatory properties with the aspect sys:incomplete. This is done so that you can create items that have mandatory properties even if the value of the property is not known at the time of content creation, while still indicating that the property is required (eventually). Mandatory properties will have a * next to them in the UI.false = it is optional and this is the default if this element is not specified.The mandatory element can also have a boolean attribute called enforced. If this is set to true then you cannot create a node without this property having a value. |
multiple | element (boolean) | [0..1] | true = this property can have a list of values.false = only one value can be entered and this is the default if this element is not specified. |
default | element (any) | [0..1] | Default value for this property if the user does not specify any value. The UI input field will be pre-populated with this value. |
index | element | [0..1] | Solr/Lucene index configuration. The indexing behavior for each property can be configured. If we don’t configure any indexing behavior, then the default configuration is:<index enabled="true"> <atomic>true</atomic> <stored>false</stored> <tokenised>true</tokenised> </index>Explanation of default index configuration:enabled="true": index the value of the property. Setting this to false means that the property is not indexed at all and is not part of the index.<atomic>true</atomic>: atomic is not used when Content Services uses Solr for search. The default value is there to allow the continued use of the built in Lucene indexing engine when customers don’t want to, or cannot, switch to Solr. With Solr the index is eventually up-to-date with the database (properties/metadata are stored in the database).<stored>false</stored>: the property value is not stored in the index. This property is not used with Solr either, all fields are store in the new cached content store on the Solr side.<tokenised>true</tokenised>: the property value is tokenized when it’s indexed, so if the value is “Company Confidential” it will be tokenized into two strings that will be indexed separately, which might not always be what you want. You can also use false, which will just tokenize the value as one item. Further on, it also possible to set it to both, which means that “Company Confidential”, “Company”, and “Confidential” will be in the index. The tokenizer is determined by the property type in the data dictionary. This is locale sensitive as supported by the data dictionary, so you could switch to tokenize all your content in for example German. You cannot mix for example German and English tokenization.There is also another sub-element that can be specified for the index element. It is called facetable and controls the faceting behavior in Solr as follows:<facetable>true</facetable>: property is set up properly in Solr for faceting and is really fast, ordered, and sorted.<facetable>false</facetable>: property is not facetable, that is, you cannot create a facet from this property. In some cases it makes no sense to make a property facetable, for example if it is a unique property. Note that setting facetable to false will not save resources.Unspecified as above (default): faceting works but not as fast and efficient as when this element is explicitly specified and set to true. |
constraints | element | [0..1] | Container for property constraints. |
The last thing that is defined in the model are the aspects. The definition of an aspect does not differ much from a type definition. In fact, in the schema they are both based on the same class definition. The main difference between an aspect and a type is that a content node can have multiple aspects applied to it but only one type applied to it. This is important to think about when you decide if you should use an aspect or a type in the model.
One way of deciding if a domain object should be modelled as a type or an aspect is to think about if it is a noun or a verb. For example, a Legal document could be modelled as a type and a CAD drawing could be modelled as a type. And it is quite clear that it does not make much sense to apply both the legal document type and the CAD drawing type to the same file content item. Now, both these types of documents could be emailed into the system and if we wanted to model that it would probably be best done with an aspect called for example emailed. You might also want to version both of these types of documents. So it make sense to have a versioned aspect. You could then apply both the versioned and emailed aspects to for example a Legal document.
Now, it is necessary to look at dealing with nouns such as client and supplier. If you model them as types you could have problems, as many different file nodes could be related to a client. For example, a Legal document and a CAD drawing could both be related to them same client, so if you modelled client as a type you could not classify a document as both Legal and being associated with a specific client. So in this case client and supplier would be better off modelled as aspects so they can both be applied to a node that already has a type applied.
Another implementation difference between types and aspects is that an aspect usually doesn’t extend another aspect, it is more common for them to be stand alone. Aspects are used to implement cross-cutting concerns independently of the type of node. Note that there are a number of aspects already available out of the box, for example the aspects mentioned above, emailed and versionable are already available out-of-the-box. The following is a list of some of the aspects that comes with Content Services out-of-the-box:
- cm:titled - an extra title and description property - usually applied automatically by the system.
- cm:auditable - creator, created, modifier, modified, last access - this aspect is applied automatically by the system.
- cm:transformable - indicates that the content item is transformable to other formats.
- cm:templatable - applying this aspect to a file node makes this node available as a template in the “create from template…” action.
- cm:projectsummary
- cm:complianceable - holds remove after datetime data, there is no behavior attached to it. It was created before the Records Management module, and the idea was that people could use rules or timed actions to create the behavior they needed.
- cm:author - applied automatically by the system if author metadata can be extracted from document.
- cm:dublincore
- cm:partable - can be used to denote that a content item is part of another content item.
- cm:referencing - allows a reference association to be made between one document and another document.
- cm:replaceable - defines that a document has been replaced by another newer document.
- cm:effectivity - apply this to a content item if a to and from date is useful.
- cm:summarizable - apply this to a content item if a summary text is needed.
- cm:countable - apply this to a content item to count number of times it’s been accessed.
- cm:copiedfrom - applied when a content item is a copy of another content item.
- cm:workingcopy - a checked out working copy.
- cm:checkedOut- applied by the system when a content file is checked out and locked.
- cm:versionable - apply this aspect to a content file if it should be versioned (note. folders cannot be versioned).
- cm:lockable - used by the system when a content item is checked out and locked.
- cm:subscribable - can be used if users should be able to subscribe to content items.
- cm:classifiable - apply this to a content item if it should be categorized.
- cm:taggable - can be applied to any content item in the repository to allow tags to be applied to it.
- cm:rateable - apply to content item if it should be ratable (like/unlike and 5-star rating schemes exist).
- cm:attachable - apply if content item can have attachments, such as an email.
- cm:emailed - apply this for example to a content item that has been emailed into the repository.
- cm:geographic - applied automatically to items with longitude and latitude information embedded.
It is always preferable to use out-of-the-box aspects where possible.
The following is an example of a custom aspect definition:
<aspect name="acme:securityClassified"> <title>Security Classified</title> <description>Content has been security classified</description> <properties> <property name="acme:securityClassification"> <type>d:text</type> <index enabled="true"> <atomic>true</atomic> <stored>false</stored> <tokenised>false</tokenised> <facetable>true</facetable> </index> <constraints> <constraint ref="acme:securityClassificationOptions" /> </constraints> </property> </properties> </aspect> <aspect name="acme:webPublished"> <title>Web published</title> <description>Content has been published to website</description> <properties> <property name="acme:publishedDate"> <type>d:datetime</type> <mandatory>true</mandatory> </property> </properties> </aspect>
Here an aspect has been defined to keep track of the security classification for content items. Any ACME document can have a security classification applied and you can then build rules, policies, processing and other entities based on these security classifications. Faceted search has been specifically enabled for best performance and the default tokenised value has also changed to false so you only index the whole value, such as “Company Confidential”.
There is also a simple webPublished aspect that could be used to indicate if a content item has been published on the web.
After completing the aspect definitions you have almost finished the content model implementation. It would now be possible to deploy it (i.e. bootstrap it) and use it. The labels used in the Share UI for aspects and types, such as when creating rules, are by default picked from the title element in the content model.
However, in multilingual installations we need to supply localization labels for the types, aspects, properties etc. So it is possible to translate the model into different languages. Also, if the title element is not specified for an aspect or type in the content model, then the i18n resource file with the labels is needed.
We can do that by supplying a properties file with property definitions according to certain naming conventions, here is an example for our ACME content model:
acme_contentModel.type.acme_document.title=ACME Document acme_contentModel.type.acme_contract.title=ACME Contract acme_contentModel.type.acme_policy.title=ACME Policy acme_contentModel.type.acme_whitePaper.title=ACME White paper acme_contentModel.type.acme_project.title=ACME Project acme_contentModel.aspect.acme_webPublished.title=ACME Web Published acme_contentModel.aspect.acme_securityClassified.title=ACME Security Classified acme_contentModel.aspect.acme_projectIdentifier.title=ACME Project Identifier acme_contentModel.property.acme_documentId.title=Document Id acme_contentModel.property.acme_contractName.title=Contract Name acme_contentModel.property.acme_contractId.title=Contract Id acme_contentModel.property.acme_projectName.title=Project Name acme_contentModel.property.acme_projectDescription.title=Project Description acme_contentModel.property.acme_projectStartDate.title=Project Start Date acme_contentModel.property.acme_projectNumber.title=Project Number acme_contentModel.property.acme_publishedDate.title=Published Date acme_contentModel.property.acme_securityClassification.title=Security Classification acme_contentModel.child-association.acme_projectMember.title=Project Member
The naming convention patter for these resource properties are as follows:
{content model namespace}_{content model local name}.[type|aspect|property|association|child-association].{content model namespace}_{local name}.[title|description]
The resource properties file can now be bootstrapped into the repository at the same time as the content model XML file. This is done via a Spring bean definition as follows:
<bean id="org.alfresco.tutorial.customcontentmodelrepo.dictionaryBootstrap" parent="dictionaryModelBootstrap" depends-on="dictionaryBootstrap"> <property name="models"> <list> <value>alfresco/module/<module-id>/model/content-model.xml</value> </list> </property> <property name="labels"> <list> <!-- Bootstrap Resource Bundles for data list type and properties --> <value>alfresco/module/${project.artifactId}/messages/content-model</value> </list> </property> </bean>
The key is to define a Spring bean that extends (sets as parent) the out-of-the-box dictionaryModelBootstrap Spring bean. You also want your custom model to be bootstrapped after the out-of-the-box content models as you use them in your model. This is achieved via the depends-on definition. You specify the models that you want to bootstrap as a list contained in the models property.
So we now have a content model defined in XML and registered with the repository. It is now possible to use one of the Content Services APIs to create a node with a type from this content model, and one or more aspects applied from the model. Doing this via the Java API would look something like this:
private void createContractFile() { String acmeModelURI = "http://www.acme.org/model/content/1.0"; String filename = "ContractA.txt"; NodeRef parentFolderNodeRef = serviceRegistry.getNodeLocatorService().getNode(CompanyHomeNodeLocator.NAME, null, null); // Create Node QName associationType = ContentModel.ASSOC_CONTAINS; QName associationQName = QName.createQName(NamespaceService.CONTENT_MODEL_1_0_URI, QName.createValidLocalName(filename)); QName nodeType = ContentModel.TYPE_CONTENT; Map<QName, Serializable> nodeProperties = new HashMap<QName, Serializable>(); nodeProperties.put(ContentModel.PROP_NAME, filename); nodeProperties.put(QName.createQName(acmeModelURI, "documentId"), "DOC001"); nodeProperties.put(QName.createQName(acmeModelURI, "securityClassification"), "Company Confidential"); nodeProperties.put(QName.createQName(acmeModelURI, "contractName"), "The first contract"); nodeProperties.put(QName.createQName(acmeModelURI, "contractId"), "C001"); ChildAssociationRef parentChildAssocRef = serviceRegistry.getNodeService().createNode( parentFolderNodeRef, associationType, associationQName, nodeType, nodeProperties); NodeRef newFileNodeRef = parentChildAssocRef.getChildRef(); // Set content for node boolean updateContentPropertyAutomatically = true; ContentWriter writer = serviceRegistry.getContentService().getWriter(newFileNodeRef, ContentModel.PROP_CONTENT, updateContentPropertyAutomatically); writer.setMimetype(MimetypeMap.MIMETYPE_TEXT_PLAIN); writer.setEncoding("UTF-8"); String fileContent = "Contract A, this contract ..."; writer.putContent(fileContent); // Add an aspect to the node Map<QName, Serializable> aspectProperties = new HashMap<QName, Serializable>(); aspectProperties.put(QName.createQName(acmeModelURI, "publishedDate"), new Date()); serviceRegistry.getNodeService().addAspect(newFileNodeRef, QName.createQName(acmeModelURI, "webPublished"), aspectProperties); }
In this case you are creating an ACME Contract document file. You set the properties for the security classification aspect when you create the Contract node as it is a mandatory aspect on the Contract type. If you want to add other aspects you can do that as shown for the webPublished aspect.