<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The present and future of post production business and technology &#187; Metadata</title>
	<atom:link href="http://www.philiphodgetts.com/category/metadata/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.philiphodgetts.com</link>
	<description>Philip Hodgetts</description>
	<lastBuildDate>Thu, 02 Feb 2012 21:57:36 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Can a computer really recognize an individual face, or a car?</title>
		<link>http://www.philiphodgetts.com/2012/01/can-a-computer-really-recognize-an-individual-face-or-a-car/</link>
		<comments>http://www.philiphodgetts.com/2012/01/can-a-computer-really-recognize-an-individual-face-or-a-car/#comments</comments>
		<pubDate>Mon, 30 Jan 2012 00:58:32 +0000</pubDate>
		<dc:creator>Philip</dc:creator>
				<category><![CDATA[Interesting Technology]]></category>
		<category><![CDATA[Metadata]]></category>

		<guid isPermaLink="false">http://www.philiphodgetts.com/?p=4595</guid>
		<description><![CDATA[What is the state of image/facial recognition and how does it integrate into production workflows.]]></description>
			<content:encoded><![CDATA[<p>In this attempt to summarize the state of a technology and its application to production and postproduction my focus is on image recognition, including facial detection and recognition. We&#8217;re exposed to facial recognition/detection technology in some current apps: Premiere Pro CS5 onward; iPhoto, Final Cut Pro X, Picassa, Facebook, with mixed success.</p>
<p><span id="more-4595"></span></p>
<p>Firstly, the distinction between facial detection and facial recognition is the difference between recognizing &#8220;oh, that&#8217;s a face&#8221; compared with &#8220;oh that&#8217;s Philip Hodgetts&#8217; face&#8221;. That&#8217;s a huge distinction. Most digital still cameras sold today recognize faces in the image and attempt to lock focus on them. Heck, <a href="http://www.freetheapps.com/smile-click/">Smile Click</a> recognizes that a subject is smiling before taking a picture, and that&#8217;s a 99c app! Facial detection is a <a href="http://maniacdev.com/2011/11/tutorial-easy-face-detection-with-core-image-in-ios-5/">developer framework in iOS 5</a>, making it easy to add facial detection to any app &#8211; for tracking, changing or modifying! Sanyo have <a href="http://www.electronista.com/articles/08/06/05/sanyo.xacti.hd1010.debuts/">facial detection</a> (they call if Face Chaser) in a video camera. In professional cameras, Fujifilm have released a lens with its <a href="http://broadcastengineering.com/news/fujifilm-feature-facial-recognition-precision-focus-telephoto-lenses-20110317/">TRACE technology that tracks faces in shot to maintain focus and exposure</a> &#8211; up to 12 faces in a shot.</p>
<p>In postproduction software both Premiere Pro CS 5 and later and Final Cut Pro X attempt facial detection. Neither attempts to identify the individual, just that there is one or more people in the image. Subjectively, it&#8217;s still a work in progress. When it works, it&#8217;s great but even facial detection isn&#8217;t perfect with moving video. Final Cut Pro X further takes this useful metadata (how many people are in a picture) and attempts to infer the shot based on the size of the detected faces: big faces = closeup; many small faces = wide. (I&#8217;m sure that&#8217;s a gross simplification of someone&#8217;s very hard work, but you get the point.)</p>
<p>Facial recognition is valuable because it allows us to quickly group shots with the same person (or character) in them without any additional work. Taking the boring out of post, as I like to say. We&#8217;d have to identify the face once and probably deal with some false identifications, but if it gets more accurate than iPhoto currently is, it could be more useful.</p>
<p>I&#8217;d also suggest that it&#8217;s likely to be more useful in a video project, and more accurate, than in a general iPhoto library because of the more limited set of examples for each person. iPhoto (pre-Polar Rose technology) does a good job where it&#8217;s presented with a set of images of the same person in roughly the same time. The more examples in photos across a lifetime, and the accuracy is reduced. So, with the more limited time scale of the typical video project, I&#8217;d expect better accuracy.</p>
<p>Facial recognition is still very much a work in progress. In September 2010 Apple purchased Swedish facial recognition company Polar Rose, presumably to boost the facial recognition technology in iPhoto and across their entire product range (I hope!). Full facial recognition probably won&#8217;t make it to Final Cut Pro X this year, but you do have to wonder what they have planned for a &#8220;full revision&#8221; release (one we might pay for) after all the catch-up features are added. (I have absolutely no ideas, this is pure hypothesis/wishful thinking on my part.)</p>
<p>However, if we broaden out a little and consider the research that&#8217;s being done &#8211; the existing integration of facial recognition in social media and photo sharing sites, and how object detection/computer understanding of what it&#8217;s seeing is developing &#8211; then it&#8217;s obvious that computer recognized metadata will start to be a viable alternative.</p>
<h3>Facial recognition</h3>
<p>I mentioned Polar Rose, now integrated into Apple where I&#8217;d expect it will be used to improve the accuracy of facial recognition in iPhoto and Aperture, but also likely to be added to the facial detection framework now in iOS 5. As iOS and OS X are merging, I would expect the same frameworks to become available to OS X developers in due course. From GigOm - <a href="http://gigaom.com/apple/apples-ios-facial-recognition-could-lead-to-kinect-like-interaction/">Apple’s iOS facial recognition could lead to Kinect-like interaction</a>:</p>
<blockquote><p>The unearthed APIs are described as “highly sophisticated,” and can determine where a user’s mouth, and left and right eyes are located, as well as process images taken by the iPhone for face detection. Aside from providing Apple an easy way to introduce Faces (which recognizes specific people in iPhoto) to both its own Photos app and any third-party apps that access that library, it should also open the door for much more advanced facial recognition applications.</p></blockquote>
<p>Beyond Apple, facial recognition is becoming ubiquitous, even to the point where Slate&#8217;s <a href="http://www.slate.com/authors.farhad_manjoo.html">Farhad Manjoo</a> wrote in his &#8220;<a href="http://www.slate.com/articles/technology/technology/2011/07/smile_youre_on_everyones_camera.html">Smile, You&#8217;re on everyone&#8217;s camera</a>&#8220;:</p>
<blockquote><p>Soon, face recognition will be ubiquitous. While the police may promise to tread lightly, the technology is likely to become so good, so quickly that officers will find themselves reaching for their cameras in all kinds of situations. The police will still likely use traditional ID technologies like fingerprinting—or even iris scanning—as these are generally more accurate than face-scanning, but face-scanning has an obvious advantage over fingerprints: It works from far away. Bunch of guys loitering on the corner? Scantily clad woman hanging around that run-down motel? Two dudes who look like they&#8217;re smoking a funny-looking cigarette? Why not snap them all just to make sure they&#8217;re on the up-and-up?</p></blockquote>
<p>This is absolutely a technology who&#8217;s time is coming. Further in the Slate article:</p>
<blockquote><p>In 2006, <a href="http://techcrunch.com/2006/08/15/google-acquires-biometric-company-neven-vision/" target="_blank" data-linktype="External">Google acquired</a> the biometric recognition company Neven Vision, and Hartmut Neven, one of the world&#8217;s experts in computer vision, is a respected engineer at the company. <a href="http://ilabs.microsoft.com/Post/Pages/Post.aspx?PostId=16" target="_blank" data-linktype="External">A Microsoft research team</a> in Israel has built a fantastic app that uses face-recognition systems to search the Web for pictures of people who are in your photo album. And last year <a href="http://news.cnet.com/8301-1023_3-20069853-93/facebook-quietly-rolls-out-facial-recognition-tool/" target="_blank" data-linktype="External">Facebook rolled out</a> a tool that automatically suggests names of people to tag in your pictures.</p></blockquote>
<p>The Google acquisition has now rolled out as facial recognition in Picassa.</p>
<p>Late 2011 the New York Times had a feature article <a href="http://www.nytimes.com/2011/11/13/business/face-recognition-moves-from-sci-fi-to-social-media.html">Face Recognition Makes the Leap From Sci-Fi </a>in their business section. In the article they list these examples:</p>
<ul>
<li><a title="Scene Tap web site." href="http://www.scenetap.com/">SceneTap</a>, a new app for smart phones, uses cameras with facial detection software to scout bar scenes. Without identifying specific bar patrons, it posts information like the average age of a crowd and the ratio of men to women, helping bar-hoppers decide where to go. More than 50 bars in Chicago participate.</li>
<li><a title="The company’s site." href="http://immersivelabs.com/">Immersive Labs</a>, a company in Manhattan, has developed software for digital billboards using cameras to gauge the age range, sex and attention level of a passer-by.</li>
<li>Those endeavors pale next to the photo-tagging suggestion tool introduced by <a title="More articles about Facebook." href="http://topics.nytimes.com/top/news/business/companies/facebook_inc/index.html?inline=nyt-org">Facebook</a> this year. When a person uploads photos to the site, the “Tag Suggestions” feature uses facial recognition to identify that user’s friends in those photos and automatically suggests name tags for them. And apparently you <a href="http://blog.internetcases.com/2011/03/12/facebook-privacy-photo-tagging-attorney-chicago-lawyer-social-media/">don&#8217;t need a person&#8217;s permission</a> to tag them on Facebook.</li>
<li>“Millions of people are using it to add hundreds of millions of tags,” says Simon Axten, a Facebook spokesman. Other well-known programs like Picasa, <a title="Face suggestions on Picasa." href="http://picasa.google.com/support/bin/answer.py?answer=156272">the photo editing software from Google</a>, and third-party apps like <a title="Information about Face.com." href="http://face.com/about.php">PhotoTagger</a>, from <a href="http://face.com/" target="_">face.com</a>, work similarly.</li>
</ul>
<p>Powered by technology from Face.com (one of the key players in white label face detection) <a href="http://techcrunch.com/2009/05/14/find-the-clown-facecom-tags-400-million-facebook-photos-in-30-days-more-invites/">Facebook processed 400 million photographs in 30 days</a>!</p>
<p>So my dream of having all the people in my source video recognized and grouped. Perhaps for manual naming (once) but more likely we&#8217;ll be able to use existing resources to match the face somewhere and associate the name with it. In a study reported in the <a href="http://www.theatlantic.com/technology/archive/2011/09/cloud-powered-facial-recognition-is-terrifying/245867/">Cloud-Powered Facial Recognition Is Terrifying</a> article at The Atlantic:</p>
<blockquote><p>Unlike Groucho Marx, unfortunately, the cloud never forgets. That&#8217;s the logic behind a new application developed by Carnegie Mellon University&#8217;s Heinz College that&#8217;s designed to take a photograph of a total stranger and, using the facial recognition software PittPatt, track down their real identity in a matter of minutes. Facial recognition isn&#8217;t that new &#8212; the rudimentary technology has been around since the late 1960s &#8212; but this system is faster, more efficient, and more thorough than any other system ever used. Why? Because it&#8217;s powered by the cloud.</p>
<p>&#8230;.</p>
<p>With Carnegie Mellon&#8217;s cloud-centric new mobile app, the process of matching a casual snapshot with a person&#8217;s online identity takes less than a minute. Tools like PittPatt and other cloud-based facial recognition services rely on finding publicly available pictures of you online, whether it&#8217;s a profile image for social networks like Facebook and Google Plus or from something more official from a company website or a college athletic portrait. In their most recent round of <a href="http://www.heinz.cmu.edu/%7Eacquisti/face-recognition-study-FAQ/">facial recognition studies</a>, researchers at Carnegie Mellon were able to not only match unidentified profile photos from a dating website (where the vast majority of users operate pseudonymously) with positively identified Facebook photos, but also match pedestrians on a North American college campus with their online identities.</p></blockquote>
<p>At most you need one photograph of the individual, and good matching algorithms. <a href="http://news.cnet.com/8301-31921_3-20088456-281/face-matching-with-facebook-profiles-how-it-was-done/">CNET News has a background article</a> that gives another take on the experiment identifying random people in public spaces.</p>
<p>It&#8217;s getting quite common:</p>
<ul>
<li><a href="http://paidcontent.org/article/419-ftc-calls-for-comments-on-facial-recognition-technology/">FTC Calls For Comments On Facial Recognition Technology</a></li>
<li><a href="http://article.wn.com/view/2012/01/09/25_Big_Ideas_For_2012_Ubiquitous_Face_Recognition/">Ubiquitous facial-recognition software is coming</a></li>
<li><a href="https://www.networkworld.com/community/blog/fbi-rolling-out-nationwide-face-search-and-re#_jmp0_">FBI rolling out nationwide face search and recognition system</a></li>
<li><a href="http://bits.blogs.nytimes.com/2011/08/09/london-police-use-flickr-to-identify-looters/">London Police Use Flickr to Identify Looters</a></li>
</ul>
<h4>Facial Recognition Technologies</h4>
<p>Before moving on to other ways computers are &#8220;seeing&#8221; images, here&#8217;s a short summary of the primary technology providers.</p>
<ul>
<li>Google purchased PittPatt and are integrating the technology into Picassa</li>
<li>Apple purchase Polar Rose and are integrating the technology widely through their OS and applications.</li>
<li>Facebook relied on Face.com technology. In fact Face.com are one of the major providers of facial recognition technology with <a href="http://techcrunch.com/2010/05/03/7-billion-scanned-photos-later-face-com-opens-up-to-developers/">7 Billion Photos</a> in the last year in its Facebook apps, Face.com is now available for developers (such as Facebook, and well, even Assisted Editing) to use:</li>
</ul>
<ul>
<li>Developers who are interested in building their own facial recognition apps can now get full access to the open Face.com API, free of charge. That basically means developers can tap into Face.com’s face detection and recognition technology and create brand new ways for friends to engage through photos at zero cost. Hard to beat that offer.</li>
<li>It is also well funded: &#8220;<a href="http://www.crunchbase.com/company/yandex">Yandex<img id="snap_com_shot_link_icon" src="http://i.ixnp.com/images/v6.19.0.1/t.gif" alt="" /></a>, operator of Russia’s largest search engine, has invested in Tel-Aviv based facial recognition technology startup <a href="http://www.face.com/">Face.com<img id="snap_com_shot_link_icon" src="http://i.ixnp.com/images/v6.19.0.1/t.gif" alt="" /></a>, marking its first investment in an Israeli company. In total, Face.com has raised<a href="http://www.crunchbase.com/company/face-com">$4.3 million<img id="snap_com_shot_link_icon" src="http://i.ixnp.com/images/v6.19.0.1/t.gif" alt="" /></a> in Series B funding in a round led by previous investor <a href="http://www.crunchbase.com/financial-organization/rhodium">Rhodium<img id="snap_com_shot_link_icon" src="http://i.ixnp.com/images/v6.19.0.1/t.gif" alt="" /></a>.&#8221;</li>
</ul>
<li>DigitalSmiths.com offer facial recognition as part of its suit of metadata-generation tools, but I&#8217;ll expand in the next section.</li>
<li>The <a href="http://perception.csl.illinois.edu/recognition/Home.html" target="_blank">University of Illinois</a> is developing <a href="http://www.futurity.org/science-technology/about-face-with-new-recognition-software/">a face recognition system that is remarkably accurate in realistic situations</a>.</li>
<p>I want to draw special attention to<strong> </strong><strong><a href="http://www.affectiva.com/">Affectiva</a> </strong>who use the tag line &#8220;Respectful emotional measurement and communication.&#8221; Yes, they read emotion from video.</p>
<blockquote><p>The software “makes it possible to measure audience response with a scene-by-scene granularity that the current survey-and-questionnaire approach cannot,” Mr. Hamilton said. A director, he added, could find out, for example, that although audience members liked a movie over all, they did not like two or three scenes. Or he could learn that a particular character did not inspire the intended emotional response.</p></blockquote>
<p>I fully expect facial recognition to come into the postproduction world rapidly and provide useful metadata. Automatically identifying and labeling people in video content will be very empowering. I do foresee an issue with *automatic* identification with actors and roles in narrative, but the software would surely have the ability to manually tag every instance of &#8220;this identified face&#8221; with a character name instead of the actor&#8217;s name sourced from IMDB!</p>
<p>If software can detect emotional responses to movies, it can detect emotional performances and &#8211; for documentary/reality/news &#8211; detect the emotion in the face to help drive editing.</p>
<p>Oh, if you really want to avoid facial detection? <a href="http://venturebeat.com/2010/07/02/facial-recognition-camouflage/">Social Beat has a few tips</a>.</p>
<h3>Metadata for asset management</h3>
<p>Both DigitalSmiths and Asterpix have tools specifically intended to create visual metadata automatically for asset management and exploitation.</p>
<p><a href="http://news.cnet.com/8301-17939_109-9887619-2.html">Asterpix</a></p>
<blockquote><p>The actual process of machine-tagging involves pulling in imaging data from the video clip and matching it up to whatever text was included by the video&#8217;s creator. Nat Kausik, CEO of Asterpix tells me the process is a little similar to Google&#8217;s search algorithm in creating relevancy based on what bits of parts of the video get the most screen time. For example, in a video of a someone walking through a grocery store there would be a wealth of information about other products and people, but if you&#8217;re focused on that one person for the majority of the clip the engine will pick on it and react accordingly.</p></blockquote>
<p><a href="http://digitalsmiths.com">DigitalSmiths</a></p>
<blockquote><p>Digitalsmiths is the technology leader in the rapidly growing segment of video search and recommendation. Digitalsmiths Seamless Discovery™ has revolutionized the accuracy and ease with which end-users find relevant, personalized entertainment across multiple channels and devices.</p>
<p>From powering tablet-enabled set-top-boxes, to live integration of sporting events, to first-run premium web-content, Digitalsmiths has deep experience and proven results for increasing engagement and viewership though our unique, holistic solutions. Our solutions serve customers that span across all media channels and devices and reach a combined audience of millions of consumers.</p></blockquote>
<h3>Generating other visual metadata</h3>
<p>While we&#8217;re only just seeing facial detection and recognition rolling into useful applications, but already computers are being taught to read the text in images, and process the images themselves to recognize people, place and things.</p>
<p>Text recognition isn&#8217;t exactly new, but:</p>
<p><a href="http://techcrunch.com/2011/01/10/google-goggle-print-ads-sudoku/">Google Goggles Can Now Read Print Ads. Oh, And Play Freaking Sudoku!</a></p>
<p><a title="Permalink to OCRKit 1.2 – The simplest Text Recognition for the Mac" href="http://macmegasite.com/node/10775">OCRKit 1.2 – The simplest Text Recognition for the Mac</a></p>
<h4>Beyond Text</h4>
<p><a href="http://www.nytimes.com/2011/01/02/science/02see.html">Computers That See You and Keep Watch Over You</a></p>
<p>A New York Times article outlining how computer vision is being used:</p>
<blockquote><p>Perched above the prison yard, five cameras tracked the play-acting prisoners, and artificial-intelligence software analyzed the images to recognize faces, gestures and patterns of group behavior. When two groups of inmates moved toward each other, the experimental computer system sent an alert — a text message — to a corrections officer that warned of a potential incident and gave the location.</p>
<p>“Machines will definitely be able to observe us and understand us better,” said Hartmut Neven, a computer scientist and <a title="The Google research blog." href="http://googleresearch.blogspot.com/">vision expert at Google</a>. “Where that leads is uncertain.”</p></blockquote>
<p>The applications are quite amazing. From observing prisoners, to reminding hospital staff to wash their hands if it&#8217;s detected they haven&#8217;t, to a host of other uses, smart computer software is observing us and making accurate observations.</p>
<p><a href="http://techcrunch.com/2008/04/27/google-experiments-with-next-generation-image-search/">Google Experiments With Next Generation Image Search</a></p>
<blockquote><p>Notably, the new image search technology doesn’t just index text associated with an image in determining what’s in it. Google is now talking about using computers to analyze the stuff in photos, and using that to associate it in a ranked way with keyword queries. In effect, they’re talking about something similar to PageRank for images (but without the linking behavior).</p></blockquote>
<p><a href="http://searchengineland.com/teaching-google-to-see-images-10920">Teaching Google To See Images</a></p>
<blockquote><p>Nuno Vasconcelos, a professor of electrical engineering at the UCSD Jacobs School of Engineering, discusses the approach, called Supervised Multiclass Labeling (SML), in a recentnews release from the school (hat tip to Threadwatch for the pointer). Though SML sounds like a mouthful of jargon, what it really amounts to is systematically training a computer to recognize statistically similar objects, and teaching it to differentiate them from other objects that have similar characteristics.</p></blockquote>
<p><a href="http://techcrunch.com/2009/06/22/google-researchers-teach-computers-out-how-to-recognize-images-of-famous-landmarks/">Google Researchers Teach Computers Out How To Recognize Images Of Famous Landmarks</a></p>
<blockquote><p>In the experiment, the researchers fed “an unnamed, untagged picture of a landmark” found on the Internet and the system would spit back the name and location of the landmark, such as the Acropolis in Greece. Each untagged photo was be compared to 40 million GPS-tagged images on Picasa and Panoramio (both owned by Google), as well as related photos found through Google Image Search. Using clustering and new image indexing techniques, the Google researchers were able to identify untagged photos of the same landmarks from different angles and under various lighting conditions.</p>
<p>The researchers report that their system can identify 50,000 landmarks with 80 percent accuracy. I’m not sure that’s quite good enough to even roll that out in a beta product, but if Google can get it to 90 percent or 95 percent that would start to be consumer-friendly.</p></blockquote>
<p><a href="http://web.mit.edu/newsoffice/2009/object-recognition.html">Researchers from MIT&#8217;s CSAIL teach computers to recognize objects</a></p>
<blockquote><p>The system uses a modified version of a so-called motion estimation algorithm, a type of algorithm common in video processing. Since consecutive frames of video usually change very little, data compression schemes often store the unchanging aspects of a scene once, updating only the positions of moving objects. The motion estimation algorithm determines which objects have moved from one frame to the next. In a video, that&#8217;s usually fairly easy to do: most objects don&#8217;t move very far in one-30th of a second. Nor does the algorithm need to know what the object is; it just has to recognize, say, corners and edges, and how their appearance typically changes under different perspectives.</p>
<p>The MIT researchers&#8217; new system essentially treats unrelated images as if they were consecutive frames in a video sequence. When the modified motion estimation algorithm tries to determine which objects have &#8220;moved&#8221; between one image and the next, it usually picks out objects of the same type: it will guess, for instance, that the 2006 Infiniti in image two is the same object as the 1965 Chevy in image one.</p></blockquote>
<p><a href="http://seeit.com">SeeIT.com</a></p>
<p style="padding-left: 30px;"> SeeIT.com is in beta while the company is scaling the index from millions to hundreds of millions of images. You can try it by <a href="http://www.seeit.com/">clicking here</a>, then entering the user name <strong>picture</strong> and password <strong>picture93AE</strong> (exclusive access for Search Engine Land readers). See this<a href="http://www.seeit.com/popularsearches.html">information for new users</a> for more information, including some of the limitations of the current beta release.</p>
<p><a href="http://riya.com">Riya</a></p>
<blockquote><p>Riya started out focusing primarily on facial recognition, but now has a <a href="http://www.riya.com/">beta visual search</a> that lets you find similar faces and objects on many images across the web and then refine your results, using color, shape and texture.</p>
<p>Riya also powers the visually oriented product search service <a href="http://www.like.com/">Like.com</a> that lets you find clothing and a few home furnishing items based on visual similarity. Like also has a “celebrity” search that lets you see what the stars are currently wearing and find similar accoutrements for your own adornment.</p></blockquote>
<p><a href="http://arstechnica.com/old/content/2008/10/darpa-building-search-engine-for-video-surveillance-footage.ars">DARPA building search engine for video surveillance footage</a></p>
<blockquote><p>According to a <a href="https://www.fbo.gov/index?s=opportunity&amp;mode=form&amp;tab=core&amp;id=9a0bebd1e9ca9b36ea5e8a6d293242b5&amp;_cview=0">prospectus</a> written in March but released only this month, the Video and Image Retrieval and Analysis Tool (VIRAT) will enable intel analysts to &#8220;rapidly find video content of interest from archives and provide alerts to the analyst of events of interest during live operations,&#8221; taking both conventional video and footage from infrared scanners as input. The VIRAT project is an effort to cope with a growing data glut that has taxed intelligence resources because of the need to have trained human personnel perform time- and labor-intensive review of recorded video.</p></blockquote>
<p>Check out the &#8220;simple&#8217; diagram accompanying the article. This stuff isn&#8217;t simple.</p>
<p><a href="http://searchengineland.com/videosurf-new-genuinely-radical-video-search-14711">VideoSurf: New, Genuinely Radical Video Search</a></p>
<blockquote><p><a href="http://www.videosurf.com/">VideoSurf</a> is a computer vision search engine that processes all of the kinds of information most video search services do, but then goes a step further, applying a proprietary process using “multigrid fast computation” and some heavy-duty computer processing power to analyze videos, identify people, and extract all kinds of additional information directly from the video itself. Until I saw the demo, I thought this type of technology was still years away.</p></blockquote>
<p><a href="http://www.macnn.com/articles/11/11/23/could.pave.the.way.for.kinect.like.abilities/">Apple wins patent on 3D object-recognition technology</a></p>
<blockquote><p> The USPTO has awarded Apple a patent on <a href="http://macnn.com/rd/235113==http://patft1.uspto.gov/netacgi/nph-Parser?Sect1=PTO1&amp;Sect2=HITOFF&amp;d=PALL&amp;p=1&amp;u=%2Fnetahtml%2FPTO%2Fsrchnum.htm&amp;r=1&amp;f=G&amp;l=50&amp;s1=8,064,685.PN.&amp;OS=PN/8,064,685&amp;RS=PN/8,064,685" rel="nofollow">3D object-recognition technology</a> that goes well beyond the current <a href="http://macnn.com/rd/235114==http://www.ipodnn.com/articles/09/07/09/face.recognition.patent/" rel="nofollow">face recognition</a> already included in apps such as iPhoto and the iOS 5 camera application, allowing a device to &#8220;build&#8221; a 3D face or object by analyzing the curves, contours and shadows of a 2D image. Such technology would give Kinect-like detection and recognition capabilities to cameras such as those found in iOS and Mac devices.</p></blockquote>
<p>Another benefit of the Polar Rose acquisition, where the technology behind this patent was developed.</p>
<h4>Developer APIs</h4>
<p><a href="http://www.kooaba.com/">Kooaba</a></p>
<blockquote><p>The Swiss company aims to unlock its library of over 10 million images, ranging from album covers to books and movie posters, and provide access to all that precious data via the cloud.</p>
<p>Kooaba hopes that the launch of the API will trigger third-party developers to develop more mobile applications –<a href="http://www.kooaba.com/using-kooaba/on-iphone/">iPhone</a> and <a href="http://www.kooaba.com/using-kooaba/on-android/">Android</a> versions exist already – or tools that tap into social networking services like Facebook and Twitter, etcetera.</p></blockquote>
<p><a href="http://opencv.willowgarage.com/wiki/">OpenCV</a></p>
<blockquote><p>OpenCV (Open Source Computer Vision) is a library of programming functions for real time computer vision.  It has C++, C, Python and soon Java interfaces running on Windows, Linux, Android and Mac. The library has &gt;2500 optimized algorithms<em>.</em></p></blockquote>
<h4>Out at the Leading Edge of technology</h4>
<p id="headline"><a href="http://www.sciencedaily.com/releases/2008/05/080526000936.htm">New Image-Recognition Software Could Let Computers &#8216;See&#8217; Like Humans Do</a></p>
<blockquote><p>Using such small amounts of data per image makes it possible to search for similar pictures through millions of images in a database, using an ordinary PC, in less than a second, Torralba says. And unlike other methods that require first breaking down an image into sections containing different objects, this method uses the entire image, making it simple to apply to large datasets without human intervention.</p></blockquote>
<p><a href="http://www.physorg.com/news/2011-12-artificial-intelligence-images.html">Developing artificial intelligence systems that can interpret images</a></p>
<blockquote><p> Torralba is also attempting to develop systems that can scan a short video clip and predict what is likely to happen next, based on what people or objects are in the scene. To do this, the systems will need to understand what actions each object or person in the scene is capable of making, and what their limitations are. This will allow the systems to make predictions about what each of these entities is likely to do in the near future.</p></blockquote>
<p>We already have facial detection in our software, and identifying the person is definitely coming. There are technologies to recognize a smile in a cheap iPhone app, or recognize human emotion currently exploited for focus group work. There are computers patrolling prison yards and making sure doctors and nurses wash their hands between patients. There is no doubt in my mind that pre-edit processing will give editors name, context and emotion metadata. And smart companies, like us, will exploit that as input for automating certain editing tasks.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.philiphodgetts.com/2012/01/can-a-computer-really-recognize-an-individual-face-or-a-car/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Adobe Prelude</title>
		<link>http://www.philiphodgetts.com/2012/01/adobe-prelude/</link>
		<comments>http://www.philiphodgetts.com/2012/01/adobe-prelude/#comments</comments>
		<pubDate>Sat, 28 Jan 2012 21:45:13 +0000</pubDate>
		<dc:creator>Philip</dc:creator>
				<category><![CDATA[Interesting Technology]]></category>
		<category><![CDATA[Metadata]]></category>

		<guid isPermaLink="false">http://www.philiphodgetts.com/?p=4598</guid>
		<description><![CDATA[Pre-edit Log and rough cut tool from Adobe previewed at Supermeet.
]]></description>
			<content:encoded><![CDATA[<p>At the San Francisco Supermeet Friday 27th January, Adobe&#8217;s Al Mooney revealed a sneak peek at a new application for the Creative Suite called Prelude.<span id="more-4598"></span></p>
<p>Essentially Prelude is an ingest, logging and rough cut tool. It&#8217;s designed to make it easy to add log notes (a.k.a. metadata) quickly and easily, then perform a rough first string out to send to a craft (skilled) editor for the real work. Now obviously, any tool that encourages the entry of metadata is good by me!</p>
<p>So far, there&#8217;s not page for Prelude I can link to but from what I remember the key features are:</p>
<ul>
<li>Works directly off non-tape media</li>
<li>Media can be previewed and selects trimmed before ingest to Prelude</li>
<li>Icons can be &#8220;skimmed&#8221; (I think Adobe has a different term) FCP X style.</li>
<li>Will copy media from the card to a specified location</li>
<li>Will (optionally) do proxy (or really any specified format) generation in parallel with ingest</li>
<li>Has great tools for subclipping and adding log notes easily</li>
<li>Can build simple cuts-only timelines</li>
<li>Exports direct to Premiere Pro or to Final Cut Pro 7 XML.</li>
</ul>
<p>That&#8217;s about what&#8217;s known right now. Given that Al finished off by saying &#8220;See you at NAB&#8221; I can only guess we&#8217;ll learn more then (or perhaps before).</p>
<div id="attachment_4600" class="wp-caption aligncenter" style="width: 867px"><a href="http://www.philiphodgetts.com/wp-content/uploads/2012/01/IMG_13491.jpg"><img class="size-full wp-image-4600" title="IMG_1349" src="http://www.philiphodgetts.com/wp-content/uploads/2012/01/IMG_13491.jpg" alt="" width="857" height="600" /></a><p class="wp-caption-text">Adobe&#39;s Al Mooney previews Prelude. Shown here is the simple timeline assembly tool.</p></div>
]]></content:encoded>
			<wfw:commentRss>http://www.philiphodgetts.com/2012/01/adobe-prelude/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
		</item>
		<item>
		<title>Text is the New Timecode</title>
		<link>http://www.philiphodgetts.com/2012/01/text-is-the-new-timecode/</link>
		<comments>http://www.philiphodgetts.com/2012/01/text-is-the-new-timecode/#comments</comments>
		<pubDate>Tue, 17 Jan 2012 00:39:36 +0000</pubDate>
		<dc:creator>Philip</dc:creator>
				<category><![CDATA[Metadata]]></category>
		<category><![CDATA[The Technology of Production]]></category>

		<guid isPermaLink="false">http://www.philiphodgetts.com/?p=4574</guid>
		<description><![CDATA[Text metadata is becoming increasingly valuable but where are we at, and what technologies and software are available.]]></description>
			<content:encoded><![CDATA[<p>Although I&#8217;ve shamelessly stolen the title from Joe B (@zbutcher on Twitter) I think it does represent a shift in the way we work with our source media.</p>
<p>Now, before I start let me be clear. I am NOT saying timecode is unimportant. I&#8217;m NOT saying that timecode is passé and suddenly irrelevant. Timecode remains incredibly important for any tape based access.</p>
<p>What I am saying is that text search &#8211; or phonetic search derived from text &#8211; is becoming a highly viable, and in many ways superior, way to search and find content. Timecode&#8217;s primary role was in being able to identify any given frame from a tape by tape and frame number. There&#8217;s nothing wrong with that approach, but as humans we don&#8217;t think in &#8220;reel and Timecode&#8221;, which is why text is a superior option.</p>
<p><span id="more-4574"></span></p>
<p>In this technology summary, I&#8217;m going to consider:</p>
<ul>
<li>What tools are at our finger tips right now and their relative merits,</li>
<li>Why speech transcription is ultimately more valuable than phonetic search (long term),</li>
<li>Developments in speech transcription, and</li>
<li>Transcription technologies.</li>
</ul>
<h3>Software and tools available now</h3>
<p>There are two broad approaches to text search in the marketplace today: those that transcribe the speech into text and those that use phonetic search. In the first category we have Adobe Premiere/Audition/Soundbooth using Autonomy&#8217;s technology to transcribe speech to text. In the second we have Avid and Boris using Nexidia&#8217;s phonetic search technology in Media Composer (for Phrasefind and Script Sync) and Soundbite (formerly Get!) respectively. Our own <a href="http://assistedediting.intelligentassistance.com/prEdit/">prEdit</a> edits video using a text transcript.</p>
<p>Phonetic search makes no attempt to understand meaning. Essentially, Nexidia&#8217;s technology &#8220;understands&#8221; what an audio waveform would look like when the text is input. Nexidia scans all the audio files ahead of search and indexes them for search. When you input a word, it estimates the waveform then finds matches for that waveform by comparing the index with the target waveform. This technique is great when you don&#8217;t have a transcript, or have a transcript without any time stamps.</p>
<p>This technology is also used to align a text file (script) with the audio in the audio and video files in Avid&#8217;s ScriptSync with the optional ScriptSync package. This also works great if you have a transcript of interviews to align the audio in the video with the transcript. However, if you have a transcript available, working with prEdit may be faster, manipulating the audio and video by editing and modifying text.</p>
<p>Going the other way &#8211; from speech to text &#8211; provides an additional advantage in that we have meaning associated with the text, and we have a transcript that carries through production into distribution. If you want searchable text for distribution then Adobe&#8217;s professional tools are the only automated tools: phonetic search isn&#8217;t viable because it would require distributing the Nexidia engine into distribution (and with their licensing model, I don&#8217;t see that happening any time soon).</p>
<p>However, the speech to text engine used by Adobe (licensed from Autonomy) is still a work in progress: results can be quite good but the average result is less spectacular, sometimes completely useless. That is why Adobe have added, and strongly pushed, the <a href="http://www.video2brain.com/en/videos-3771.htm">ability to provide the speech transcript engine a guide script</a>. Surprisingly, a guide script alone &#8211; even one where it is an exact transcript &#8211; does not transcribe perfectly. The best results require a trip via Adobe Story. This has the advantage of keeping all punctuation and paragraph breaks  (and automatic subclipping into paragraph subclips in prEdit). For a comparison on accuracy you might be interested in <a href="http://forums.adobe.com/thread/633439?decorator=print&amp;displayFullThread=true">Colin Brougham&#8217;s comparisons</a>.</p>
<p>The big disadvantage to this approach is that it requires a transcript, the very thing most people want to automate because of the cost.</p>
<h3>Why speech transcription is more valuable than phonetic search</h3>
<p>Speech transcription carries meaning. Phonetic search does not carry meaning. This is an important distinction, because it means that speech transcription is valuable metadata as well as a production tool, while phonetic search is a useful production tool, but has no value as metadata away from the Nexidia engine.</p>
<p>Speech transcription can be carried with the media throughout its life, even into edited versions. Indeed, this is Adobe&#8217;s intent. They tend to focus on the speech transcript metadata as part of a distribution strategy more than its use in postproduction. The speech transcript metadata is carried inside media files as XMP metadata. (There are other alternatives for speech transcript metadata in distribution files, but I&#8217;ll discuss that further down.)</p>
<p>Speech transcription can be searched by anyone, at any stage, without needing a proprietary engine.</p>
<p>Speech transcription can be used to derive keywords and other expressions of meaning, which is valuable not only for automating some types of production (some types folks, only some) but extremely valuable as metadata for later finding content.</p>
<p>Transcribed speech is the input to prEdit. Briefly prEdit allows you to easily add metadata (log notes), break interviews into thought segments and eliminate less useful material, before searching and building a story by dragging and dropping text blocks. Editing can continue through the story building process and narration added (converted to voice instantly). At any time you can preview a clip or clips, the full story, or any selected part of the story before exporting to Final Cut Pro 7 or Premiere Pro CS 5.5 or later.</p>
<p>So, while phonetic search is a great post-production tool, transcription into real text has a wider range of uses and uses outside post-production. The only trouble is, speech transcription is expensive!</p>
<h3>Developments in speech transcription</h3>
<p>It&#8217;s a great time to live if you&#8217;re interested in speech transcription. The most significant developments have been hybrid computer-human approaches used by <a href="http://3PlayMedia.com">3PlayMedia</a>, <a href="http://www.speakertext.com/">SpeakerText</a> and <a href="http://www.speechpad.com/">SpeechPad</a> to reduce the cost and time of transcription. These companies particularly are focused on the need for transcription for video in distribution, but are excellent choices for transcriptions for prEdit or other postproduction needs.</p>
<p>Until we get a fully automatic speech transcription with adequate accuracy, these will help. Even with human correction some uncommon words or names will not be transcribe accurately.</p>
<h3>Transcription technologies</h3>
<p>I&#8217;ve talked about Nexidia and the phonetic search technology and why my long term preference is for speech transcription, so it&#8217;s no surprise that I spend some time following what&#8217;s happening with the technology. What is interesting to me is that the two companies who are generally recognized as having the most accurate under the widest range of conditions are not (yet) available for postproduction work.</p>
<p>Google has been amassing huge numbers of examples of speech for recognition &#8211; an important first step to accurate speech recognition &#8211; via the (now defunct) Goog411 initiative. Google have been using this technology for <a href="http://googleblog.blogspot.com/2009/11/automatic-captions-in-youtube.html">automated captioning for YouTube videos</a> and for voicemail transcriptions within Google Voice. As a Google voice customer I&#8217;d say that the results are definitely much better than the attempts by Vonage (laughable) and comparable or better than Premiere Pro&#8217;s use of Autonomy.</p>
<p>More information on Google&#8217;s voice recognition plans can be found in this <a href="http://techcrunch.com/2011/02/13/the-power-of-voice-a-conversation-with-the-head-of-googles-speech-technology/">Techcruch article with Mike Cohen</a>, head of Google&#8217;s Voice recognition efforts. Also interesting to note is that Google is slowly <a href="http://mikepultz.com/2011/03/accessing-google-speech-api-chrome-11/">opening up an API for their speech recognition efforts</a>, starting with Chrome version 11. How open that is for third party developers to use, remains unknown, but it&#8217;s an interesting direction from the search giant. If the API became open, and this is one of the two most accurate speech transcription technologies, why wouldn&#8217;t savvy developers like us, start to use it and integrate it into our software?</p>
<p>Equally prominent in the speech transcription/recognition community is Nuance, probably best known for powering Dragon Dictate, Dragon Naturally Speaking (and the variations) and the speech recognition component of Apple&#8217;s Siri technology. (Siri adds a lot of powerful tools on top of this basic recognition layer, but if the speech isn&#8217;t recognized accurately nothing good can come from it downstream.) There is no public API for any of Nuance&#8217;s technology (nor Siri for that matter). Nuance tends to do direct deals with companies who want to license its technology &#8211; a fairly standard practice in the technology world.</p>
<p>My fondest hope is that Apple&#8217;s license from Nuance will be extended to OS X and a speech recognition framework be included in OS X for developers. It&#8217;s a fond hope, not anything real!</p>
<p>Google and Nuance have the most accurate technologies that do not require speech training. I&#8217;m a Dictate user but that product uses training to obtain high (very high) accuracy in transcription. To be able to be accurate without needing training is what we need for interview transcriptions for post.</p>
<p style="padding-left: 30px;"><strong>Tip:</strong> If you do have Dictate or any of the PC variants of Nuance&#8217;s products, one technique is to listen to the interview and speak it with your own (trained) voice. I&#8217;m not yet there, but it is possible to speak-as-you-hear (the basis of the common ear-bud presenter trick) for fast, accurate transcription.</p>
<p>Beyond the giants, there are many other technology companies, or open source projects, in the speech recognition field that are worth mentioning. One thing that should be noted is that most of these technologies are cloud based (as is Apple&#8217;s Siri). They work as long as they have a connection to their primary servers.</p>
<p>I&#8217;ll let these folk speak for themselves. Whichever technologies prevail, we&#8217;re definitely seeing a surge in the accuracy and flexibility of speech recognition, that is going to factor in post production in the coming years, beyond where we are at right now.</p>
<p><a href="http://www.creaceed.com/ceedvocalsdk/">SeedVocal SDK</a></p>
<p>Speech Recognition for your iPhone application. The claim is that the technology is:</p>
<blockquote>
<ul>
<li>currently unavailable from Apple APIs</li>
<li>easy to add to your application</li>
<li>convenient for all types of apps: games, fun and promo applications or utilities</li>
<li>suitable for the iPhone and the iPod Touch</li>
<li>cheaper than you might expect.</li>
</ul>
</blockquote>
<p><a href="http://cmusphinx.sourceforge.net/sphinx4/">Sphinx4</a></p>
<blockquote><p>Sphinx-4 is a state-of-the-art speech recognition system written entirely in the Java<sup>TM</sup>programming language. It was created via a joint collaboration between the Sphinx group at Carnegie Mellon University, Sun Microsystems Laboratories, Mitsubishi Electric Research Labs (MERL), and Hewlett Packard (HP), with contributions from the University of California at Santa Cruz (UCSC) and the Massachusetts Institute of Technology (MIT).</p>
<p>Sphinx-4 started out as a port of Sphinx-3 to the Java programming language, but evolved into a recognizer designed to be much more flexible than Sphinx-3, thus becoming an excellent platform for speech research.</p></blockquote>
<p id="post-9300"><a title="Permanent Link to Speech Recognition with Javascript; speechapi.com" href="http://ajaxian.com/archives/speech-recognition-with-javascript-speechapi-com" rel="bookmark">Speech Recognition with Javascript; speechapi.com</a></p>
<blockquote><p>With <a href="http://www.speechapi.com/">speechapi.com&#8217;s</a> javascript API, it is possible to build interesting speech-web mashups that include both speech-to-text as well as text-to-speech.</p>
<p>A combination of several technologies and open source tools make this possible. In the browser, Flash is used to access the microphone and stream the audio to an RTMP server. Red5 is used because its a versatile media server that has the benefit of being open source and free.</p></blockquote>
<p><a href="http://julius.sourceforge.jp/en_index.php">Open-Source Large Vocabulary CSR Engine Julius</a></p>
<blockquote><p><strong>&#8220;Julius&#8221;</strong> is a high-performance, two-pass large vocabulary continuous speech recognition (LVCSR) decoder software for speech-related researchers and developers. Based on word N-gram and context-dependent HMM, it can perform almost real-time decoding on most current PCs in 60k word dictation task. Major search techniques are fully incorporated such as tree lexicon, N-gram factoring, cross-word context dependency handling, enveloped beam search, Gaussian pruning, Gaussian selection, etc. Besides search efficiency, it is also modularized carefully to be independent from model structures, and various HMM types are supported such as shared-state triphones and tied-mixture models, with any number of mixtures, states, or phones. Standard formats are adopted to cope with other free modeling toolkit such as HTK, CMU-Cam SLM toolkit, etc.</p></blockquote>
<p>I have no idea what that means either!</p>
<p><a href="http://scripto.org/">Scripto</a></p>
<blockquote><p>Scripto is a light-weight, open source, tool that will allow users to contribute transcriptions to online documentary projects.  The tool will include a versioning history and full set of editorial controls, so that project staff and manage public contributions.  The design and development of the tool is being supported by grant funding from the <a title="National Endowment for the Humanities" href="http://www.neh.gov/">National Endowment for the Humanities</a>, <a href="http://www.neh.gov/ODH/">Office Digital Humanities</a>, and the <a href="http://www.archives.gov/nhprc/">National Historical Publication and Records Commission</a>.</p></blockquote>
<p><a href="http://www.encodingall.com/feature/media-transcription/">Encoding All</a></p>
<blockquote><p>From an audio or video media, transcription can automatically generate a file ofsubtitles, the keyword list in XML format and the entire plain text.  (That&#8217;s the translation from the French &#8211; speech recognition is not limited to English!)</p></blockquote>
<p>And interesting, although not currently speech recognition:</p>
<p><a href="http://www.soundhound.com/">Soundhound</a></p>
<blockquote><p>Currently, SoundHound’s specialty is delivering information about music. Users can sing or hum a tune into <a href="http://gigaom.com/2010/04/27/soundhounds-pet-project-a-music-search-engine/">its SoundHound app</a> and the app returns the song name, as well as other information. Last week, the company released its Hound app, which can identify when a user says the name of an artist or album.</p>
<p>The slightly frivolous-seeming “name that tune” aspects of SoundHound’s applications belie the seriousness of the technology and business underneath it all. SoundHound has raised $16 million in venture capital and currently has 55 full-time employees. Investors have been attracted to the company by the future potential of SoundHound’s core technology, Mohajer told me. “We own all of our technology, while a lot of other apps in this space license their core technology,” he said. “We built everything in-house and we own all of our intellectual property.</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://www.philiphodgetts.com/2012/01/text-is-the-new-timecode/feed/</wfw:commentRss>
		<slash:comments>14</slash:comments>
		</item>
		<item>
		<title>Artificial Intelligence Predicts What Will Happen Next in a Video.</title>
		<link>http://www.philiphodgetts.com/2011/12/artificial-intelligence-predic/</link>
		<comments>http://www.philiphodgetts.com/2011/12/artificial-intelligence-predic/#comments</comments>
		<pubDate>Mon, 12 Dec 2011 23:21:52 +0000</pubDate>
		<dc:creator>Philip</dc:creator>
				<category><![CDATA[Interesting Technology]]></category>
		<category><![CDATA[Item of Interest]]></category>
		<category><![CDATA[Metadata]]></category>

		<guid isPermaLink="false">http://www.philiphodgetts.com/2011/12/artificial-intelligence-predic/</guid>
		<description><![CDATA[Just imagine what this technology will do when applied to pre-editing products.]]></description>
			<content:encoded><![CDATA[<p>Artificial Intelligence Predicts What Will Happen Next In A Video <a href="http://t.co/TPopRqQq" rel="nofollow">http://t.co/TPopRqQq</a></p>
<p>Many thanks to my friend Don Berube for pointing this out.  While the headline slightly overstates the case, it&#8217;s clear we&#8217;re heading for an era when computers in general will understand meaning and the content of images. <span id="more-4485"></span></p>
<blockquote><p>Face Detection and Recognition is now prevalent in digital cameras which can detect many faces and even recognize those you tag, giving them preference as to focus and exposure. But Antonio Torralba is more concerned with what’s not the face in a photo: his Artificial Intelligence (A/I/) software tags all objects in an image and makes the image more easily searchable so as to be able to place the photo in an appropriate context. This ability will eventually mean that robots can recognize their surroundings, say in a house or office building, based on what furniture and objects they see around them.</p>
<p>Going one step further with this research, Torralba is also developing systems that can scan a short video clip and predict what is likely to happen next, based on the people and objects in the scene. This should eventually allow robots to anticipate how their actions will influence future events.</p></blockquote>
<p>Seems like magic to me, but I seen enough of these type of articles &#8211; fully acknowledging they&#8217;re at the cutting edge of research and many years from practical application in an editing situation &#8211; to know that the future will be computer aided way beyond what we have now.</p>
<p>I&#8217;ve long said that the application of at least basic editing algorithms (such as in our <a href="http://assistedediting.intelligentassistance.com/FirstCuts/"><em>First Cuts</em> </a>software) isn&#8217;t that difficult being largely a matter of modeling the behavior of human edits and what drives their decisions. The problem is having the source information &#8211; metadata if you will &#8211; to feed the algorithm. Speech transcription, facial detection, emotion detection, and now software to tag all objects in the image and predict what will happen next all point to a much more computer-assisted editing future than most expect.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.philiphodgetts.com/2011/12/artificial-intelligence-predic/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Who&#8217;s looking at you? Apparently everything!</title>
		<link>http://www.philiphodgetts.com/2011/11/facial-recognition-app-detects/</link>
		<comments>http://www.philiphodgetts.com/2011/11/facial-recognition-app-detects/#comments</comments>
		<pubDate>Wed, 09 Nov 2011 22:53:26 +0000</pubDate>
		<dc:creator>Philip</dc:creator>
				<category><![CDATA[Assisted Editing]]></category>
		<category><![CDATA[Interesting Technology]]></category>
		<category><![CDATA[Item of Interest]]></category>
		<category><![CDATA[Metadata]]></category>

		<guid isPermaLink="false">http://www.philiphodgetts.com/2011/11/facial-recognition-app-detects/</guid>
		<description><![CDATA[A new camera app that won't take a picture until it sees a smile; and a stand that follows you around, but not triggered by axial recognition!]]></description>
			<content:encoded><![CDATA[<p>Two stories today that caught my attention are:</p>
<p><strong><a href="http://t.co/sj6zAr1B">Facial Recognition App Detects, Captures Smiles</a></strong> Technology intrudes more &amp; more into &#8220;human&#8221; territory</p>
<p><a href="http://t.co/a3XOY1MP"><strong>Meet Swivl, The Motion Tracking iPhone Dock That Always Keeps You On Camera</strong> </a>More and more automatics!</p>
<p>Now, it would be really cool if Swivl tracked you and kept you on camera using facial detection but it does not: instead it uses a hand held transmitter/controller to &#8220;know&#8221; where to point the camera.  Even with that it will make a great addition to a video blogger, web episode producer as the producer/talent can move and have the camera follow them as they do.</p>
<p><span id="more-4405"></span>The SmileClick App is much more interesting:</p>
<blockquote><p>SmileClick is a newly launched, unique application that uses proprietary facial recognition software to detect how big a photo subject is smiling. Once the app determines that the person to be photographed is sporting a large, genuine smile, it automatically snaps the picture at the right time, helping to create happy memories.</p></blockquote>
<p>Waltham MA based <strong> </strong><strong><a href="http://www.affectiva.com/">Affectiva</a> </strong>takes it one step further:</p>
<blockquote><p>&#8230;has facial recognition software that can accurately determine the difference between a happy smile, an embarrassed smile and a smirk!</p></blockquote>
<p>As well as other emotions. Currently used only in focus groups and the like, wouldn&#8217;t it make a great source of metadata if, as well as the person&#8217;s face being (accurately) identified, there was another tag &#8220;smiling&#8221;, &#8220;embarrassed&#8221;, &#8220;annoyed&#8221;, et. al. I see it as inevitable but a few years off.</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.philiphodgetts.com/2011/11/facial-recognition-app-detects/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>The New iPhone&#8217;s Face Recognition Capabilities.</title>
		<link>http://www.philiphodgetts.com/2011/10/the-new-iphones-face-recognit/</link>
		<comments>http://www.philiphodgetts.com/2011/10/the-new-iphones-face-recognit/#comments</comments>
		<pubDate>Wed, 05 Oct 2011 21:45:01 +0000</pubDate>
		<dc:creator>Philip</dc:creator>
				<category><![CDATA[Interesting Technology]]></category>
		<category><![CDATA[Item of Interest]]></category>
		<category><![CDATA[Metadata]]></category>

		<guid isPermaLink="false">http://www.philiphodgetts.com/2011/10/the-new-iphones-face-recognit/</guid>
		<description><![CDATA[As well as "redefining privacy" facial recognition is great metadata!]]></description>
			<content:encoded><![CDATA[<p>The New iPhone&#8217;s Face Recognition Capabilities Could Redefine Privacy <a href="http://t.co/WayE1Abv" rel="nofollow">http://t.co/WayE1Abv</a></p>
<p>Following on the heels of yesterday&#8217;s post about <a title="Facial recognition in the cloud" href="http://www.philiphodgetts.com/2011/10/facial-recognitiion-in-the-clo/">facial recognition in the cloud</a> here&#8217;s information on how Apple are applying the technology they gained when they acquired Polar Rose last September, at least within iOS frameworks.</p>
<blockquote><p>When coders dug through Apple&#8217;s beta versions of iOS5 they found what were deemed to be &#8220;<a href="http://gigaom.com/apple/apples-ios-facial-recognition-could-lead-to-kinect-like-interaction/" target="_blank">highly sophisticated</a>&#8221; API systems that let an iPhone automatically track eye positions and mouth positions (so the angle to the user, and possibly where their attention is being directed could be calculated) as well as passing key data on to a face recognition algorithm that would be accessible to all apps&#8230;not just Apple&#8217;s own.</p></blockquote>
<p>Combine this with the Nuance-licensed voice recognition technology in Siri &#8211; also new with iOS 5 and iPhone 4S &#8211; and we have the foundation of a very powerful metadata generation system that would automate naming people in clips and form the basis of speech transcription and then keyword extraction.</p>
<p>In my dreams these are technologies that will come to Final Cut Pro X 10.2 or 10.3 in future years.</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.philiphodgetts.com/2011/10/the-new-iphones-face-recognit/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Why are roles superior to tracks?</title>
		<link>http://www.philiphodgetts.com/2011/09/roles-superior-to-tracks/</link>
		<comments>http://www.philiphodgetts.com/2011/09/roles-superior-to-tracks/#comments</comments>
		<pubDate>Tue, 20 Sep 2011 15:04:42 +0000</pubDate>
		<dc:creator>Philip</dc:creator>
				<category><![CDATA[Apple Pro Apps]]></category>
		<category><![CDATA[Metadata]]></category>

		<guid isPermaLink="false">http://www.philiphodgetts.com/?p=4268</guid>
		<description><![CDATA[Final Cut Pro X's first update has one very powerful new metadata feature that makes the absence of tracks more palatable to those who used them as metadata.]]></description>
			<content:encoded><![CDATA[<p>When I <a title="How Tracks evolved from function to Metadata" href="http://www.philiphodgetts.com/2011/09/how-tracks-evolved-from-function-to-metadata/">wrote on Saturday</a> about how tracks have evolved from their compositing role, to one where they became defacto metadata I had no idea Apple were about to release the first Final Cut Pro X update (I only found out on Monday). I also wrote in <em><a href="http://www.philiphodgetts.com/books/conquering-metadata-fcpx/">Conquering the metadata foundations of Final Cut Pro X</a>:</em></p>
<blockquote><p>Apple teases us with Audio Role metadata that seems to have no current use within Final Cut Pro X.</p>
<p>If you open the Info Pane and select the Info tab, you will see the pop-up menu in Figure 20.1. showing the Roles</p>
<p>One magic way for this to be useful would be as a solution to the missing audio output options. In the near-magic, near-future I expect that a future version of Final Cut Pro X will use this Audio Role metadata to route audio outputs. At version 1 Final Cut Pro X’s audio output options are very basic, and there’s improvement coming, for sure.</p></blockquote>
<p>So you can imagine how excited I was to open Final Cut Pro X 10.0.1 and see that those fairly limited audio roles (no doubt implemented specifically to enable export to OMF/AAF) had evolved into not only customizable Audio Roles, but to have the same option available for Video clips as well. And a Title Role comes rolled in!</p>
<p><span id="more-4268"></span></p>
<div id="attachment_4270" class="wp-caption alignright" style="width: 310px"><a href="http://www.philiphodgetts.com/wp-content/uploads/2011/09/Roles-instead-of-Clip-Names.png"><img class="size-medium wp-image-4270" title="Roles instead of Clip Names" src="http://www.philiphodgetts.com/wp-content/uploads/2011/09/Roles-instead-of-Clip-Names-300x187.png" alt="" width="300" height="187" /></a><p class="wp-caption-text">Display the role, or the clip name.</p></div>
<p>Commenter <a href="http://media.keepertech.com/" rel="external nofollow">Andrew Richards</a> nailed it! Second commenter <a href="http://www.philiphodgetts.com/2011/09/how-tracks-evolved-from-function-to-metadata/#comment-91172">Marcus R. Moore</a> also called out a real need. Well in the new release, not only can you use Roles in a Project to enable or disable clips assigned that Role, but selecting a Role highlights all Clips in the Project that have that Role assigned (either Audio or Video). Marcus kind of also gets his wish, because there&#8217;s a new option in the Project View settings to display the Role on clips, as an alternate to displaying the Clip name. It&#8217;s a non-destrcutive toggle.</p>
<p>There can be different Roles assigned for Audio and Video on the same clip and Roles can have sub-roles.</p>
<div id="attachment_4269" class="wp-caption aligncenter" style="width: 310px"><a href="http://www.philiphodgetts.com/wp-content/uploads/2011/09/Roles.png"><img class="size-medium wp-image-4269 " title="Roles" src="http://www.philiphodgetts.com/wp-content/uploads/2011/09/Roles-300x83.png" alt="" width="300" height="83" /></a><p class="wp-caption-text">When selected all shots tagged with B-roll highlight in the Project. They can be deactivated temporarily with a single checkbox. (click to enlarge)</p></div>
<p>In fact, I think the implementation of Roles in Final Cut Pro X really completes the metadata story, and makes the Magnetic Timeline more valuable.</p>
<p>Here&#8217;s some of the ideas we&#8217;ve been thinking of:</p>
<ul>
<li>Multi-language Titles. Each language is a Role that is applied to the Title. With a single click on a checkbox, all titles from a language set can be enabled or disabled.</li>
<li>Identifying A-roll and B-roll so one can turn off all B-roll to see the underlying A-roll story.</li>
<li>Being able to selectively turn on or off the tracks that make up the audio mix, because now we can not just assign a Role of &#8220;SFX&#8221; or Dialog, but audio could be tagged with a Role that was, for example &#8220;Background Farm noise submix&#8221;. When (if) Apple implement a mixer in Final Cut Pro X then those clips could be assigned a single fader by Role name. Much smarter than track number mixing.</li>
</ul>
<p>Roles now appear in the Event Browser&#8217;s list view as an optional column. This also means they are now available as a filterable field in the Filters window, or for Smart Collections.</p>
<p>I think (i.e. my opinion) that Roles provide all the advantages of using Tracks as de-facto metadata by making the job explicit and allowing us to use that metadata in a Smart Collection.</p>
<h3>In other Final Cut Pro X update news</h3>
<p>I truthfully have been focused on only part of the new version &#8211; the XML import and export specifically &#8211; so that we can have a new product to be announced at LAFCPUG @ DVExpo on Wednesday night, so I haven&#8217;t explored the other changes too much. I did go through every menu and dialog to find differences, but I&#8217;m sure this is not comprehensive.</p>
<p><strong>Xsan is now officially supported for shared storage.</strong> At this point it&#8217;s shared storage of Events or Projects, not shared Events or Projects. Each Event or Project can only have one user active at a time.</p>
<p><strong>Trial version.</strong> Check it out for yourself without commitment or cost.  Apparently fully functional but time limited, so you can even finish something for yourself.</p>
<p><strong>Improvements to Share</strong> with better movie export options (from memory)</p>
<p><strong>API </strong>for camera manufacturers to add their own format support. So, if we expect or want Alexa RAW or R3D native, then those companies apparently will be stepping up to make it happen. Definitely an improvement though, because users should not have to wait for a Final Cut Pro X release in order to get support for a new format. The manufacturer can develop it themselves and have it available when their camera is launched.</p>
<p>This release was pretty much what I expected: XML and at least one other banner feature, a lot of tweaking and (presumably) bug fixes.  They&#8217;ve knocked off a good number of features promised in the earlier FAQ, and Roles are an adequate (I say vastly better) way of achieving the same goal as track layouts did.</p>
<p>So, it seems they&#8217;re on the right path.</p>
<p><strong>UPDATE:</strong> Turns out there&#8217;s more to Roles than meets the eye. Not only can you enable/disable particular roles in the Final Cut Pro X timeline, but there are new export options. The Export Media dialog now has options for exporting &#8220;Roles as Multitrack QuickTime Movie&#8221;, &#8220;Roles as Separate Files&#8221;, &#8220;Video Roles Only as Separate Files&#8221; and &#8220;Audio Roles Only as Separate Files&#8221;.</p>
<p><strong>UPDATE 2:</strong> Here&#8217;s the <a href="http://support.apple.com/kb/HT4589">knowledge base article</a> listing the new features and bug fixes.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.philiphodgetts.com/2011/09/roles-superior-to-tracks/feed/</wfw:commentRss>
		<slash:comments>38</slash:comments>
		</item>
		<item>
		<title>How Tracks evolved from function to Metadata</title>
		<link>http://www.philiphodgetts.com/2011/09/how-tracks-evolved-from-function-to-metadata/</link>
		<comments>http://www.philiphodgetts.com/2011/09/how-tracks-evolved-from-function-to-metadata/#comments</comments>
		<pubDate>Sat, 17 Sep 2011 21:15:20 +0000</pubDate>
		<dc:creator>Philip</dc:creator>
				<category><![CDATA[Metadata]]></category>

		<guid isPermaLink="false">http://www.philiphodgetts.com/?p=4258</guid>
		<description><![CDATA[As NLEs have gained more tracks, the tracks have taken on significance as metadata.]]></description>
			<content:encoded><![CDATA[<p>While watching the<a href="http://www.youtube.com/watch?v=g6Jl19uSZ6s&amp;feature=channel_video_title"> LAFCPUG &#8220;X Night&#8221; videos</a> I noted that Michael Wohl advises that &#8220;all edits should be in V1&#8243; and not doing that is a sign of a failure to commit.  Similarly reviewing some FCP 1 release videos, they once again (and again from Michael) seem to advocate a mostly single track approach.  That would certainly parallel historic ways of working with video or film where only one track, or an A/B configuration was standard.</p>
<p>I think the use of tracks has evolved since then. One of the reasons that people became concerned when Final Cut Pro X took away the traditional track layout was simply because those tracks have become valuable metadata. A typical audio example might be</p>
<p><span id="more-4258"></span></p>
<ul>
<li>Audio 1-4  Dialog allowing for stereo channel overlap</li>
<li>Audio 5-8 Music, allowing for overlapping stereo</li>
<li>Audio 9-12 (or more) Spot Effects.</li>
</ul>
<p>That&#8217;s an indicative example, not prescriptive.</p>
<p>Similarly in the video tracks I&#8217;ve used layouts that had:</p>
<ul>
<li>On-camera footage on Video 1</li>
<li>B-roll on Video 2 (and 3)</li>
<li>Titles on Video 4 (and if there were more than one language, one track for titles in each language).</li>
</ul>
<p>With that configuration one could easily turn off b-roll, or titles, to view the underlying video. Or turn on only one language track of titles at a time.</p>
<p>Some usage was as a workaround for the lack of something like Final Cut Pro X&#8217;s Auditions feature, but that&#8217;s not really a metadata-focused use and it doesn&#8217;t count here.</p>
<p>Simply because they were there, tracks evolved from being there, without specific purpose, to being used to carry very valuable metadata.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.philiphodgetts.com/2011/09/how-tracks-evolved-from-function-to-metadata/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Metadata looms large at IBC</title>
		<link>http://www.philiphodgetts.com/2011/08/metadata-looms-large-at-ibc-ht/</link>
		<comments>http://www.philiphodgetts.com/2011/08/metadata-looms-large-at-ibc-ht/#comments</comments>
		<pubDate>Mon, 22 Aug 2011 22:18:12 +0000</pubDate>
		<dc:creator>Philip</dc:creator>
				<category><![CDATA[Item of Interest]]></category>
		<category><![CDATA[Metadata]]></category>

		<guid isPermaLink="false">http://www.philiphodgetts.com/2011/08/metadata-looms-large-at-ibc-ht/</guid>
		<description><![CDATA[Broadcast Engineering looks at what's coming up around metadata at IBC, and I'll be sharing at DV Expo.]]></description>
			<content:encoded><![CDATA[<p>Metadata looms large at IBC <a rel="nofollow" href="http://t.co/gUiIvHX">http://t.co/gUiIvHX</a> I&#8217;ll be teaching Metadata &amp; Asset Mngmnt  #<a class="aktt_hashtag" href="http://search.twitter.com/search?q=%23dvExpo">dvExpo</a> dvExpo.com code SFC11 to save money.</p>
<p>As expected, metadata is going to be a dominant topic at IBC, whether identified as such, or under the heading of Media Asset Management (MAM), because all media management is done using metadata, which is why I&#8217;ve combined them in my <strong><a href="http://www.dvexpo.com/conference/session_detail.php?sid=934">Using Metadata For Production and Asset Management</a> </strong>day at DV Expo. (Check out<a href="http://www.philiphodgetts.com/2011/07/my-dv-expo-topics-metadata-a/"> my other sessions at DV Expo</a>.)</p>
<p><span id="more-4154"></span>Philip Hunter previews where metadata is heading, with a surprise resurgence of interest in MPEG-7 or the principles involved, at least.</p>
<blockquote><p>MPEG-7 evolved from ideas developed in the IT world of object-oriented programming, where an objective was to describe different types of data including images and sound. Video did not really figure then, so MPEG-7 picked up some of the object-oriented ideas and took them further by adding a temporal dimension for the moving image. This led to the creation of nine subclasses to categorize content within an AV sequence. Spatial and temporal segments are categorised separately and then combined as necessary</p></blockquote>
<p>I promise, I will translate these concepts into language people can understand &#8211; at least people who aren&#8217;t high level geeks, but still need to understand and use these ideas.</p>
<p>Hunter also reiterates the differing needs between metadata in the production phase, and what&#8217;s needed for distribution. He also notes the cost of producing metadata and therefore the necessity for moving metadata between use cases.</p>
<p>Finally he previews the vendors who will have new metadata or MAM related products at IBC.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.philiphodgetts.com/2011/08/metadata-looms-large-at-ibc-ht/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Can a computer Predict a Hit Movie or Song?</title>
		<link>http://www.philiphodgetts.com/2011/08/can-a-computer-predict-a-hit-m/</link>
		<comments>http://www.philiphodgetts.com/2011/08/can-a-computer-predict-a-hit-m/#comments</comments>
		<pubDate>Thu, 04 Aug 2011 22:38:06 +0000</pubDate>
		<dc:creator>Philip</dc:creator>
				<category><![CDATA[Interesting Technology]]></category>
		<category><![CDATA[Item of Interest]]></category>
		<category><![CDATA[Metadata]]></category>

		<guid isPermaLink="false">http://www.philiphodgetts.com/2011/08/can-a-computer-predict-a-hit-m/</guid>
		<description><![CDATA[Apparently computer can predict a movie, song or TV show's earning potential, far more accurately than anything previous.]]></description>
			<content:encoded><![CDATA[<p>Can a computer Predict a Hit Movie or Song? <a rel="nofollow" href="http://tinyurl.com/3g8ovfk">http://tinyurl.com/3g8ovfk</a> If  you mean profitability,  yes. Fascinating use of neural networks.</p>
<p>This is a long, and not new, article that rambles through a fascinating story of how a lawyer, &#8220;Mr Pink&#8221;, &#8220;Mr Brown&#8221; and &#8220;Mr Bootstrap&#8221; collectively cracked the code for predicting the profitability of movies, TV shows and (separately) another team shows the likelihood of whether a song is going to be a hit.</p>
<p>The specifics of how they achieved both breakthroughs is interesting: have the computer software (usually some sort of neural net) analyze existing successes &#8211; music or movie.  It then analyzes new music or movie proposals to determine whether it is likley be be a hit (music) or how much money it will make at the box office (movie).</p>
<p><span id="more-4105"></span>The head of hit-predictor Platinum Blue, Mike McCready claims:</p>
<blockquote><p>Record executives have tended to be Humean: though they can tell you how they feel when they listen to a song, they don’t believe anyone can know with confidence whether a song is going to be a hit, and, historically<strong>, fewer than twenty per cent of the songs picked as hits by music executives</strong> have fulfilled those expectations. Platinum Blue thinks it can do better. It has a proprietary computer program that uses “spectral deconvolution software” to measure the mathematical relationships among all of a song’s structural components: melody, harmony, beat, tempo, rhythm, octave, pitch, chord progression, cadence, sonic brilliance, frequency, and so on. On the basis of that analysis, the firm believes i<strong>t can predict whether a song is likely to become a hit with eighty-per-cent accuracy.</strong></p>
</blockquote>
<p>The emphasis is mine in case you didn&#8217;t pick that the software was four times more likely to predict a hit song than experienced record company executives &#8211; the people who make final decisions on these things! (Quite separately, how does anyone with a 20% success rate keep their job? Just asking.)</p>
<p>What is interesting is that it takes no &#8220;industry knowledge&#8221;:</p>
<blockquote><p>McCready didn’t care about who the artist was, or the cleverness of the lyrics. He didn’t even have a way of feeding lyrics into his computer. He cared only about a song’s underlying mathematical structure.</p>
</blockquote>
<p>Similarly the movie profitability prediction software was built without regard to &#8220;artistic&#8221; elements at all. It picks up on plot points, settings, and other aspects to determine &#8211; based entirely on what&#8217;s come before &#8211; how successful a movie will be.</p>
<p>Both technologies can be used to fine tune and re-predict the likelihood of a success. Now, I have all sorts of concerns about the application of the technology and how it might &#8220;force out&#8221; small, niche movies or music that wouldn&#8217;t make the hit status but would otherwise still be a satisfactory story and a profitable project.</p>
<p>In both instances it is metadata at the foundation of the input to the software: for music metadata about mathematic relationships within the music; for movies the details of plot points, setting, story, characters, etc. It&#8217;s another way that metadata makes the data more valuable by fine tuning the music or the plot/characters to make the project more profitable. Not exactly classic metadata!</p>
<p>By the way, the first half of the article I linked to above is the most salient &#8211; mostly in the middle. The end is a verbatim account of a meeting and doesn&#8217;t really go anywhere. I kept looking for the conclusion.</p>
<p>And don&#8217;t forget, if you&#8217;re interested in Metadata, I have a book &#8211; <em><a href="http://www.philiphodgetts.com/books/conquering-metadata-fcpx/">Conquering the Metadata Foundations of Final Cut Pro X</a> &#8211; </em>and an upcoming full day seminar at DV Expo &#8211; <em><a href="http://www.dvexpo.com/conference/session_detail.php?sid=934">Using Metadata for Production and Asset Management</a> </em>coming up on September 21.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.philiphodgetts.com/2011/08/can-a-computer-predict-a-hit-m/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>My DV Expo Topics</title>
		<link>http://www.philiphodgetts.com/2011/07/my-dv-expo-topics-metadata-a/</link>
		<comments>http://www.philiphodgetts.com/2011/07/my-dv-expo-topics-metadata-a/#comments</comments>
		<pubDate>Tue, 26 Jul 2011 21:45:41 +0000</pubDate>
		<dc:creator>Philip</dc:creator>
				<category><![CDATA[Item of Interest]]></category>
		<category><![CDATA[Metadata]]></category>
		<category><![CDATA[Presentations]]></category>
		<category><![CDATA[The Business of Production]]></category>
		<category><![CDATA[The Technology of Production]]></category>
		<category><![CDATA[Video Technology]]></category>

		<guid isPermaLink="false">http://www.philiphodgetts.com/2011/07/my-dv-expo-topics-metadata-a/</guid>
		<description><![CDATA[DV Expo is coming up September 20-22, and I'm teaching three one-day classes.]]></description>
			<content:encoded><![CDATA[<p>My DV Expo topics</p>
<p><strong>9-5 September 20 </strong><a href="http://www.dvexpo.com/conference/session_detail.php?sid=933"><strong>Basic Tech for Producers (and recent Film School Graduates)</strong></a></p>
<p>In this session, technology expert and DV magazine contributor Philip Hodgetts will cover the technological choices in production and post in a non-geeky way to help producers — and others without a technical background — make good technology choices for their productions. From formats to software choices; selecting cameras to creating Web video; designing graphics that will work and much more.  PRICE: $195 ($245 after Aug 31) <a href="https://elandregistration.com/v2/elandreg/reg1.php?e=5pd%2FOVveL1o%3D">Click here to register now. </a></p>
<p><strong>9-5 September 21 </strong><a href="http://www.dvexpo.com/conference/session_detail.php?sid=934"><strong>Using Metadata For Production and Asset Management</strong></a></p>
<p>Metadata is becoming increasingly important throughout the production cycle&#8211;from camera to asset management. In this session learn about the types of metadata in use; how each major NLE (Final Cut Pro 7, Final Cut Pro X, Premiere Pro CS 5.5 and Media Composer 5.5) handles metadata and how we can use that metadata to speed postproduction and VFX. Once post is done, assets need to be management through through distribution and repurposing. What tools are available, how are they used and how do they fit into the metadata structures promoted by SMPTE and other standards bodies.  PRICE: $195 ($245 after Aug 31) <a href="https://elandregistration.com/v2/elandreg/reg1.php?e=5pd%2FOVveL1o%3D">Click here to register now. </a></p>
<p><strong>9-5 September 22 </strong><a href="http://www.dvexpo.com/conference/session_detail.php?sid=935"><strong>Avoiding Postproduction Nightmares</strong></a></p>
<p><strong><span style="font-weight: normal;">Post expert and DV magazine contributor Philip Hodgetts details the most common (and costly) problems inadvertently created during production that will be “fixed in post.” From color correction to audio, and editing to the final QC pass on deliverables, he’ll not only reveal the tricks of the trade that he’d use to save your production, but also explain how you can avoid these costly issues in the first place. </span> <span style="font-weight: normal;">PRICE: $195 ($245 after Aug 31) | </span><a href="https://elandregistration.com/v2/elandreg/reg1.php?e=5pd%2FOVveL1o%3D"><span style="font-weight: normal;">Click here to register now. </span></a></strong></p>
<p><strong><span style="font-weight: normal;">In the light of full disclosure, I certainly expect to be paid but I always deliver good value. There will be some overlap between the Basic Tech and Avoiding Postproduction Nightmares sessions as they both seek to make the technology understandable, but with a different focus to each day&#8217;s class.</span></strong></p>
]]></content:encoded>
			<wfw:commentRss>http://www.philiphodgetts.com/2011/07/my-dv-expo-topics-metadata-a/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Smile, You&#8217;re On Everyone&#8217;s Camera</title>
		<link>http://www.philiphodgetts.com/2011/07/smile-youre-on-everyones-ca/</link>
		<comments>http://www.philiphodgetts.com/2011/07/smile-youre-on-everyones-ca/#comments</comments>
		<pubDate>Fri, 15 Jul 2011 21:12:04 +0000</pubDate>
		<dc:creator>Philip</dc:creator>
				<category><![CDATA[Interesting Technology]]></category>
		<category><![CDATA[Item of Interest]]></category>
		<category><![CDATA[Metadata]]></category>

		<guid isPermaLink="false">http://www.philiphodgetts.com/2011/07/smile-youre-on-everyones-ca/</guid>
		<description><![CDATA[What does ubiquitous facial recognition have to offer postproduction?]]></description>
			<content:encoded><![CDATA[<p>Smile, You&#8217;re On Everyone&#8217;s Camera <a rel="nofollow" href="http://tinyurl.com/5w2p4qb">http://tinyurl.com/5w2p4qb</a></p>
<p>The article is about ubiquitous facial recognition spurred by a report of a new app for police that allows facial recognition at five feet away. To be clear, many consumer still cameras, and some software, does facial detection: that is there is one or more faces in the picture. Some even recognize when people are smiling, but they do not identify the individual. Apple&#8217;s iPhoto and others try and do facial recognition but my experience to date is that it&#8217;s been very hit or miss. Apple purchased a Swedish company last year to improve it&#8217;s facial recognition technology.</p>
<p>Clearly others already have better technology and it has been pitched for law enforcement work for a long time. However, it&#8217;s the postproduction implications that interest me. If we can have a software tool identify all the people in our footage, at lest to the stage of identifying each instance of the individual. Reading through the article it is likely the name could be discovered or derived from Facebook or other social network or public record. At worst the person would need to be manually named only once.</p>
<p>For a metadata-based application each clip could be tagged with the person&#8217;s ID for as long as they&#8217;re in the shot.</p>
<p>We would end up with &#8216;bins&#8221; for each individual.</p>
<p>Identifying people in shots is Derived Metadata and then can be used as input into other smart algorithms to take more of the boring out of post.</p>
<p>There are a lot of other interesting applications and implications of this increasingly popular (and capable) technology.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.philiphodgetts.com/2011/07/smile-youre-on-everyones-ca/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>What did UPS just deliver? The first copies of my new book.</title>
		<link>http://www.philiphodgetts.com/2011/06/what-did-ups-just-deliver-the/</link>
		<comments>http://www.philiphodgetts.com/2011/06/what-did-ups-just-deliver-the/#comments</comments>
		<pubDate>Tue, 28 Jun 2011 01:24:56 +0000</pubDate>
		<dc:creator>Philip</dc:creator>
				<category><![CDATA[Apple Pro Apps]]></category>
		<category><![CDATA[Item of Interest]]></category>
		<category><![CDATA[Metadata]]></category>

		<guid isPermaLink="false">http://www.philiphodgetts.com/2011/06/what-did-ups-just-deliver-the/</guid>
		<description><![CDATA[The parallels between this production and the philosophy of Final Cut Pro X.]]></description>
			<content:encoded><![CDATA[<p>What did UPS just deliver? The very 1st copies of  my new FCP X metadata book, Conquering the metadata foundations of Final Cut Pro X. Amazon shortly. PDF is $4.95 under the Books menu above!</p>
<p><span id="more-3932"></span>Apple provided me with early access preview to Final Cut Pro X in a private meeting on June 13 &#8211; in the morning.  That afternoon and next morning I read the manual cover to cover (while this shiny new loaner Four Core i7 laptop that Apple had loaned me laughed at me).  Then I explored it as much as I could that day and next morning.</p>
<p>Thursday I outlined the chapters and worked out how to integrate what I already had from the Mastering Metadata webinar I taught for Larry Jordan a few months back. Revised it became the second half of the book &#8211; the deeper background foil to the first half&#8217;s very practical approach.</p>
<p>Friday morning I started writing; continued Saturday, Sunday and Monday and finished late afternoon Monday. A cover still needed to be designed, and some graphic work done, and of course it needed to be proof read, a task ably performed by my partner in most things, Greg Clarke.</p>
<p>What&#8217;s interesting is the parallels what seems like a design philosophy behind Final Cut Pro X. From what I&#8217;ve done in Final Cut Pro X, it seems to me to encourage a much more polished project along the way. It&#8217;s as easy to add an animated, high production value title as it would be to add a placeholder to remind you to do it later, as I&#8217;ve always done.</p>
<p>With the way clips (and titles) stick to other clips, I feel we&#8217;re encouraged to add polish as we go, by making ti no extra pain to add the polish: a little color touch up; maybe a reposition; animate some clips. These are all so easy in Final Cut Pro X that I have to think this is part of the design philosophy.</p>
<p>What&#8217;s that got to do with the book? Every other book I&#8217;ve written or attempted to write, the draft was written in a Word processor and careful track had to be kept, with strict formatting guidelines, of what images were being added later in layout. It&#8217;s not until much later in the process, after the text is finalized, that it goes to layout.</p>
<p>That process wasn&#8217;t going to work in this case, so as I wrote on my computer, I would have Apple&#8217;s loaner beside me, snap a screen shot with the built in tools that are not perfect but functional. Rename it, drag it to iChat and dropped it via Bonjour to my own laptop. Then dragged the imaged directly from iChat to Pages and into the layout. It turned out to be a very efficient workflow.</p>
<p>When I needed to take a break from that writing, Greg would review what I&#8217;d written, and it&#8217;s much more precise and readable because of it. Those would be the times I wrote last week&#8217;s blog posts, among other things.</p>
<p>Greg also, in parallel set up the book page, and all the consequent eCommerce stuff.</p>
<p>After I finished on Monday, I started on the cover design, which took a couple of hours while Greg gave the new material a revision before we both had one final read through and check for layout issues.</p>
<p>I left the final version as a PDF with cover for Greg when I fell asleep around 1:30 Tuesday morning, and it was on sale by 9:30 am.</p>
<p>Then the cover design got a slight revision and it was off to upload to Amazon Createspace and prep it for printing. And today, five working days after I finished writing, printed copies are on my doorstep.</p>
<p>It&#8217;s a wonderful time to be alive, don&#8217;t you think?</p>
]]></content:encoded>
			<wfw:commentRss>http://www.philiphodgetts.com/2011/06/what-did-ups-just-deliver-the/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Show me the (meta) data</title>
		<link>http://www.philiphodgetts.com/2011/06/show-me-the-meta-data-http/</link>
		<comments>http://www.philiphodgetts.com/2011/06/show-me-the-meta-data-http/#comments</comments>
		<pubDate>Mon, 27 Jun 2011 20:09:48 +0000</pubDate>
		<dc:creator>Philip</dc:creator>
				<category><![CDATA[Item of Interest]]></category>
		<category><![CDATA[Metadata]]></category>

		<guid isPermaLink="false">http://www.philiphodgetts.com/2011/06/show-me-the-meta-data-http/</guid>
		<description><![CDATA["Data about data is more important than ever" Seth Godin on metadata.]]></description>
			<content:encoded><![CDATA[<p>Show me the (meta) data <a rel="nofollow" href="http://tinyurl.com/5ts48nw">http://tinyurl.com/5ts48nw</a></p>
<p>Given that Final Cut Pro X is so heavily built on metadata, I thought this article was relevant. Although Godin is talking about the metadata we leave behind as we work or play across the Internet (and our purchase history), it helps with the understanding of what metadata is, in a broader sense.</p>
<p>In any case the conclusion is very relevant:</p>
<blockquote><p>Data about data is more important than ever, and being on the side of the person creating that data is a smart place to be.</p>
</blockquote>
]]></content:encoded>
			<wfw:commentRss>http://www.philiphodgetts.com/2011/06/show-me-the-meta-data-http/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Why did Apple base Final Cut Pro X on Metadata?</title>
		<link>http://www.philiphodgetts.com/2011/05/why-did-apple-base-final-cut-pro-x-on-metadata/</link>
		<comments>http://www.philiphodgetts.com/2011/05/why-did-apple-base-final-cut-pro-x-on-metadata/#comments</comments>
		<pubDate>Tue, 31 May 2011 21:08:54 +0000</pubDate>
		<dc:creator>Philip</dc:creator>
				<category><![CDATA[Apple Pro Apps]]></category>
		<category><![CDATA[Metadata]]></category>

		<guid isPermaLink="false">http://www.philiphodgetts.com/?p=3654</guid>
		<description><![CDATA[Or Why metadata is increasingly important in modern production because it can be used to automate the dull tasks.]]></description>
			<content:encoded><![CDATA[<p>Last September, in my <em><a href="http://www.philiphodgetts.com/2010/09/what-should-apple-do-with-final-cut-pro/">What should Apple do with Final Cut Pro</a></em> article, one of the bullet points was:</p>
<blockquote><p>Better media and metadata management.</p></blockquote>
<p>Right in with 64 bit, all processors used and the use of the GPU. I immediately qualified myself:</p>
<blockquote><p>Ok, there probably aren’t that many people clamoring for better metadata management, but it’s a significant part of better media management, and crucially important for the future of automation in post production.</p></blockquote>
<p>Then toward the end of that article, under the heading of what I thought that Apple <em>should</em> do, other than what everyone expected, I said:</p>
<blockquote><p><strong>More metadata automation</strong>. Well, part of me hopes they won’t because that’s my field, but it would be nice to see source metadata being used to auto-populate Titles or Master Templates (like iMovie for iPhone does).</p></blockquote>
<p>Truthfully, I was indulging in some wishful thinking. I still don&#8217;t think we&#8217;ll get &#8211; at least not with Final Cut Pro X v1 &#8211; auto-populating titles or Master Templates, but I am very pleasantly surprised how far Apple have &#8220;come around to my way of thinking&#8221;.</p>
<p>OK, let&#8217;s call it parallel development then, as I&#8217;m fairly sure that Apple had their metadata-centric rewrite well under way by the time I was writing, but it is gratifying to have one&#8217;s position validated. For a company that didn&#8217;t really show much sign of &#8220;getting&#8221; metadata with Final Cut Pro 1-7, they have certainly embraced it for Final Cut Pro X.</p>
<p><span id="more-3654"></span>Final Cut Pro X is entirely metadata based. You find clips initially via their date/format/source metadata. Each metadata category gets its own smart collections. From there, each time a keyword is applied to a clip, a new Smart Collection is created. Add the keyword to another clip  (they autocomplete even for multiple keywords at once, just like our <em>prEdit</em> does) and it goes into the same bin (whether the keyword is applied to the full clip or a range). Or drag the clip or range to a Smart Collection and have the keyword applied.</p>
<p>Media organization/location, Bins and Subclips are all metadata functions in Final Cut Pro X. There&#8217;s no need to assign a Bin for imported footage (a.k.a. Log and Capture/Transfer target bin) because all media finds its way into Keyword Collections and Smart Collections based on metadata.</p>
<p>I also suspect that audio routing to replicate the functionality of fixed tracks will also use metadata, so that similarly tagged clips are always output to the same channels, i.e. dialog, music, effects, etc.</p>
<p>To be more precise, we&#8217;re talking about two different types of metadata. I&#8217;ve written extensively about the Six Types of Metadata we use in postproduction. <a href="http://www.philiphodgetts.com/2009/01/what-are-the-different-types-of-metadata-we-can-use-in-production-and-post-production/">Initially identifying four</a>: Source, Added, Derived and Inferred; and then later adding <a href="http://www.philiphodgetts.com/2009/06/what-is-the-fifth-type-of-metadata/">Analytical</a> and <a href="http://www.philiphodgetts.com/2009/06/i-think-theres-a-sixth-type-of-metadata/">Transform</a>. You can find out more in my <em><a href="http://www.larryjordan.biz/app_bin/Store/catalog/product_info.php?products_id=215">Mastering Metadata for Postproduction</a> </em>webinar that I did for Larry Jordan and Associates back in March.</p>
<p>In Final Cut Pro X Apple are using Source, Added and Derived Metadata. They also use Source and Derived for automation, beyond simply automatically creating Bins (as Smart Collections).</p>
<p>The information that comes with the file from the camera is clearly<strong> Source Metadata</strong>: camera, format, frame rate, codec, date shot, transfer date, and other information comes from the camera with no additional work. Interestingly there is more camera metadata available than apparently there are columns that can be displayed in Final Cut Pro X. In the NAB sneak peek we caught a brief look at the optional columns for the Browser and there are missing columns for metadata I know is available from the camera. Maybe our <em><a href="http://assistedediting.intelligentassistance.com/MetadataExplorer/">Metadata Explorer</a></em> isn&#8217;t dead after all?</p>
<p>Obviously keywords are <strong>Added Metadata</strong>: it has to be added by someone. Keywords and especially Range-based Keywords, provide much more flexible media sorting than Clips and Subclips ever could.</p>
<p>More interesting is that Apple have gone beyond those two types of metadata: something I&#8217;d assume any modern NLE to embrace one way or the other. Apple have already embraced <strong>Derived</strong> <strong>Metadata</strong>. My definition:</p>
<blockquote><p><strong>Derived Metadata</strong> is calculated using a non-human external information source and includes location from GPS, facial recognition, or automatic transcription.</p></blockquote>
<p>In Final Cut Pro X Apple <strong>derive:</strong></p>
<ul>
<li>Facial Detection;</li>
<li>Shot detection (plausibly derived from the size of any faces, and amount of sky area in the picture, among other Source criteria);</li>
<li>Stabilization and rolling shutter correction based on detecting and removing the artifacts (saved purely as metadata);</li>
<li>Automatic neutral white balance, based essentially on the auto-white balance technologies that are available (saved purely as metadata);</li>
<li>Audio cleanup based on the detection of common source problems and applying a correction.</li>
</ul>
<p>These functions, being metadata driven, are optional. Turn them off or have them performed only when you need them (but I would let them be done in the background during ingest, and just turn off automation when it doesn&#8217;t work to my favor).</p>
<p>Metadata driven automation is something that highly established editors well into their career tend to not want, while those in short term, faster turnaround jobs tend to appreciate it more. When demonstrating our <em><a href="http://assistedediting.intelligentassistance.com/FirstCuts/">First Cuts</a></em> software the response of editors with traditional experience is generally kind of negative; but the response from those earlier in their career, or who have tight turnaround, is more &#8220;Thanks for providing these great tools&#8221;.</p>
<p>You see, this metadata stuff isn&#8217;t just theory to me. I live in it every day. Every one of our Assisted Editing applications manipulates or uses metadata to make editing easier: to take the boring out of post.</p>
<p>We believe in metadata. I believe that the automations Apple have already built into Final Cut Pro X are just the beginning. The focus on metadata is good for us as one of the problems we&#8217;ve faced in the past is to have our metadata needs force-fitted into Final Cut Pro&#8217;s earlier inflexible metadata structures.</p>
<p>Now, with metadata embraced so comprehensively in Final Cut Pro X, perhaps it will make it easier for people to understand our goal of taking the boring out of post by automating as much as possible using the metadata provided or derived/inferred.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.philiphodgetts.com/2011/05/why-did-apple-base-final-cut-pro-x-on-metadata/feed/</wfw:commentRss>
		<slash:comments>23</slash:comments>
		</item>
		<item>
		<title>prEdit: Edit paper cuts without the pain with any type of transcript!</title>
		<link>http://www.philiphodgetts.com/2011/05/predit-edit-paper-cuts-without-the-pain-with-any-type-of-transcript/</link>
		<comments>http://www.philiphodgetts.com/2011/05/predit-edit-paper-cuts-without-the-pain-with-any-type-of-transcript/#comments</comments>
		<pubDate>Thu, 19 May 2011 19:21:25 +0000</pubDate>
		<dc:creator>Philip</dc:creator>
				<category><![CDATA[Assisted Editing]]></category>
		<category><![CDATA[Metadata]]></category>

		<guid isPermaLink="false">http://www.philiphodgetts.com/?p=3618</guid>
		<description><![CDATA[New version works with any industry standard transcript, in any format.]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m incredibly proud to announce that <a href="http://assistedediting.intelligentassistance.com/prEdit/">prEdit</a> &#8211; our paper cut editing tool for Final Cut Pro and Premiere Pro users &#8211; has been updated to version 1.5 with some great new features.</p>
<p>prEdit 1.0.x required the transcript to be generated in Adobe Premiere Pro or Soundbooth; or be processed through either of those applications to lock text to video. While prEdit 1.5 still supports that workflow, it now also works with:</p>
<blockquote><p>Text, DOC and RTF formats that have timecode at (least) the start of each speaker. More frequent Timecode entries will make the text/video match more accurate.</p>
<p><a href="http://www.3playmedia.com/">3Play Media&#8217;s</a> JSON format, also available from <a href="http://www.mediasilo.com/">Media Silo</a>, is supported and has absolute word accuracy. We recommend the JSON format for new transcripts in the future, for now prEdit works with what you&#8217;ve already got. (3Play Media also offer more conventional formats with the JSON format, so you can have multiple options for the same transcription fee.)</p>
</blockquote>
<p><span id="more-3618"></span>We tested with every type of text, rtf and Microsoft Word Doc format that we could find and made sure that prEdit accurately read in the transcript accurately identifying the timecodes regardless of how that timecode is formatted.</p>
<p>If you&#8217;re working with transcripts you can swap over to prEdit at any time because any log notes entered in Final Cut Pro or Premiere Pro will be editable in prEdit. New log notes can be added in prEdit&#8217;s more efficient interface &#8211; with autocomplete &#8211; and merged with work done in the NLE.</p>
<p>Since I last wrote about  prEdit we added the ability to write in narration to link the interview segments. Narration is immediately turned into an audio file using the current (at the time) system voice for a temp audio file used during previews and for when the project is sent to Final Cut Pro or Premiere Pro CS5 or later.</p>
<p>As well as working with any transcript <a href="http://assistedediting.intelligentassistance.com/prEdit/">prEdit</a> 1.5:</p>
<ul>
<li>Let’s producers or editors cut transcripts into selects in seconds</li>
<li>Add and update log notes with auto-complete logging fields</li>
<li>Uses notes already added in Final Cut Pro or Premiere Pro</li>
<li>Previews the video for any clip, subclip, paper cut or section of paper cut</li>
<li>Exports to Excel spreadsheets and Final Cut Pro, or Premiere Pro Sequences.</li>
</ul>
<p>There&#8217;s a <a href="http://assistedediting.com/downloads">free demo</a> and prEdit is only $395 for a massive boost in productivity. As Lee Schneider said:</p>
<blockquote><p>Thanks for prEDIT!  It&#8217;s a wonderful application that &#8220;thinks&#8221; exactly like an editor does.  The workflow is smooth and intuitive and it speeds the process of logging by theme and topic that makes it much easier to assemble sequences.</p>
</blockquote>
<p> </p>
]]></content:encoded>
			<wfw:commentRss>http://www.philiphodgetts.com/2011/05/predit-edit-paper-cuts-without-the-pain-with-any-type-of-transcript/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Semantic Text Startup for Textual Analysis</title>
		<link>http://www.philiphodgetts.com/2011/03/semantic-text-startup-4-cliff/</link>
		<comments>http://www.philiphodgetts.com/2011/03/semantic-text-startup-4-cliff/#comments</comments>
		<pubDate>Sat, 19 Mar 2011 00:40:07 +0000</pubDate>
		<dc:creator>Philip</dc:creator>
				<category><![CDATA[Assisted Editing]]></category>
		<category><![CDATA[Interesting Technology]]></category>
		<category><![CDATA[Item of Interest]]></category>
		<category><![CDATA[Metadata]]></category>

		<guid isPermaLink="false">http://www.philiphodgetts.com/2011/03/semantic-text-startup-4-cliff/</guid>
		<description><![CDATA[Automatically derives keywords, "Cliff Notes" summaries, key point and important facts. I'll bet you can guess what I want to do with it!]]></description>
			<content:encoded><![CDATA[<p>Semantic Text Startup for: Cliff notes, keywords, key points and important facts derived from raw text. <a rel="nofollow" href="http://tinyurl.com/5sr5myk">http://tinyurl.com/5sr5myk</a></p>
<p>One of the technologies I&#8217;ve been following, because I think it&#8217;s relevant to my goals with Assisted Editing (to take the boring out of postproduction). One piece of the &#8220;boring&#8221; is deriving keywords and concepts from spoken word (transcribed, of course).</p>
<p>Technologies like this, and others developed for the Library and Archivist industries, are becoming very sophisticated.</p>
<p>In an Assisted Editing context, the extraction of keywords (particularly) from a &#8220;chunk&#8221; of transcribed spoken word (let&#8217;s say an interview for a documentary), removes the need for a human to enter the keywords.</p>
<p>Having keywords is valuable because you can search for all instances of the keyword (to find common themes), which is something prEdit really does well, whether you&#8217;re going to build the initial outline manually in a tool like prEdit or Final Cut Pro, or use an Assisted Editing tool to get to a rough first assemble.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.philiphodgetts.com/2011/03/semantic-text-startup-4-cliff/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Master Metadata for Post-Production</title>
		<link>http://www.philiphodgetts.com/2011/02/master-metadata-for-post-produ/</link>
		<comments>http://www.philiphodgetts.com/2011/02/master-metadata-for-post-produ/#comments</comments>
		<pubDate>Thu, 24 Feb 2011 17:51:30 +0000</pubDate>
		<dc:creator>Philip</dc:creator>
				<category><![CDATA[Item of Interest]]></category>
		<category><![CDATA[Metadata]]></category>
		<category><![CDATA[The Technology of Production]]></category>

		<guid isPermaLink="false">http://www.philiphodgetts.com/2011/02/master-metadata-for-post-produ/</guid>
		<description><![CDATA[A FREE webinar I'm presenting in conjunction with Larry Jordan &#038; Associates.]]></description>
			<content:encoded><![CDATA[<p>Master Metadata for Post-Production &#8211; 2 Mar 2011, 11:00 AM EST #<a class="aktt_hashtag" href="http://search.twitter.com/search?q=%23webinar">webinar</a> <a rel="nofollow" href="http://t.co/NXv0XvZ">http://t.co/NXv0XvZ</a></p>
<p>It&#8217;s repeated during the day.</p>
<p><span id="more-2935"></span></p>
<p>“<strong>Master Metadata for Post-Production</strong>.”  <a href="http://www.larryjordan.biz/sem/webinar_2010b.php#mar02">http://www.larryjordan.biz/sem/webinar_2010b.php#mar02</a> Metadata can be used for tracking media content, usage and rights management, but metadata &#8211; log notes &#8211; can also drive post production by automating tasks that aren&#8217;t a part of the creative process. Metadata driven post production frees editors to focus on creative outcomes. Nowhere is this more important than in Final Cut Pro.</p>
<p>Most editors avoid creating and updating metadata because it is viewed as boring and unessential to editing. However, metadata has taken on an increasing importance in production and post production with the shift to tapeless media. Today, metadata is being acquired on set to pass through to post and VFX departments. Metatdata is essential to tracking existing assets. And, metadata is required for distribution and rights management control.  Knowing how to organize, enter, and search on metadata is essential.</p>
<p>My webinar will explain why metadata is so important and provide some simple ways to begin adding metadata to your own projects. In addition, he will explain the concept of metadata, outline the various types of metadata in use today, such as Dublin Core, XMP, MPEG-7, MPEG-2, and SMPTE-2021 (BXF) &#8211; and tell you which ones matter. As well as these metadata standards, the presentation will outline the six types of metadata used in post production to help you take the boring parts out. He’ll consider which commonly used applications supports metadata, and how to preserve metadata when moving between applications.</p>
<p>I will show how to efficiently enter metadata in Final Cut Pro, reducing the amount of work needed, and how to use Intelligent Assistance&#8217;s prEdit to even more competently enter metadata and work with text transcript metadata, before the final section of the webinar when he will consider what are the future possibilities of metadata in post production.</p>
<p>Did I mention it&#8217;s FREE?</p>
]]></content:encoded>
			<wfw:commentRss>http://www.philiphodgetts.com/2011/02/master-metadata-for-post-produ/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Can A Computer Do Your Job?</title>
		<link>http://www.philiphodgetts.com/2011/02/can-a-computer-do-your-job-ht/</link>
		<comments>http://www.philiphodgetts.com/2011/02/can-a-computer-do-your-job-ht/#comments</comments>
		<pubDate>Tue, 01 Feb 2011 20:41:11 +0000</pubDate>
		<dc:creator>Philip</dc:creator>
				<category><![CDATA[Assisted Editing]]></category>
		<category><![CDATA[Item of Interest]]></category>
		<category><![CDATA[Metadata]]></category>

		<guid isPermaLink="false">http://www.philiphodgetts.com/2011/02/can-a-computer-do-your-job-ht/</guid>
		<description><![CDATA[This Business Insider article suggests that it can, and in some cases, better.]]></description>
			<content:encoded><![CDATA[<p>Can A Computer Do Your Job? <a rel="nofollow" href="http://tinyurl.com/4nt5kf8">http://tinyurl.com/4nt5kf8</a></p>
<p>The examples in the article are surprisingly &#8220;high end&#8221;, pitting humans choosing potential University entrants against a simple algorith, and the algorithm wins.</p>
<p>Could a computer do your job as a &#8216;creative&#8217; individual? An editor, writer or producer?</p>
<p><span id="more-2843"></span></p>
<p>If your role is truly creative probably not. But as we try and take the boring out of post, we&#8217;ve realized a lot of so-called &#8216;creative work&#8217; isn&#8217;t. It&#8217;s not creative synchronizing separate system audio and video. It&#8217;s not creative stringing out a first assemble of scripted content. That&#8217;s grunt work, to get to a point where the final story can be crafted out of the script and shoot. (The first assemble bears little relationship to the final edit.)</p>
<p>We&#8217;ve even established that, with the right metadata a basic first cut can be pulled out of log notes &#8211; that&#8217;s First Cuts, which we released more than two years ago.</p>
<p>So, can a computer do your job? If someone can analyze what you do, it can be replicated in software. Fortunately most &#8220;creative&#8217; tasks are hard to analyze as to why something &#8220;works&#8221; &#8211; intuition is hard to replicate (at least until we understand what mechanisms happen beneath the intuition).</p>
<blockquote><p>Anyway, with hindsight, I found most of the facade put forth by various departments (eg, auto lending, health care lending), was very misleading. Everyone made the simple complicated. I think deep down, no one likes to think a computer can do their job, and there are many instances where exceptions matter, so a great deal is made out of these special cases. Yet the false positives made them great anecdotes, but horrible for generalizations. Thus, simple rules dominate their much more costly, confusing, and non-quantitative product created by teams of analysts.</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://www.philiphodgetts.com/2011/02/can-a-computer-do-your-job-ht/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Episode 18 of The Terence and Philip Show</title>
		<link>http://www.philiphodgetts.com/2011/01/whats-going-to-happen-in-2011/</link>
		<comments>http://www.philiphodgetts.com/2011/01/whats-going-to-happen-in-2011/#comments</comments>
		<pubDate>Fri, 21 Jan 2011 20:05:50 +0000</pubDate>
		<dc:creator>Philip</dc:creator>
				<category><![CDATA[Assisted Editing]]></category>
		<category><![CDATA[Business & Marketing]]></category>
		<category><![CDATA[Item of Interest]]></category>
		<category><![CDATA[Metadata]]></category>
		<category><![CDATA[Monetizing]]></category>
		<category><![CDATA[The Business of Production]]></category>
		<category><![CDATA[The Technology of Production]]></category>

		<guid isPermaLink="false">http://www.philiphodgetts.com/2011/01/whats-going-to-happen-in-2011/</guid>
		<description><![CDATA[What's going to happen in 2011?]]></description>
			<content:encoded><![CDATA[<p>What&#8217;s going to happen in 2011? <a rel="nofollow" href="http://tinyurl.com/48f979s">http://tinyurl.com/48f979s</a></p>
<p>Our longest show ever!</p>
<blockquote><div>
<p>In this episode Terence and Philip, with Greg Huson from <a href="http://www.secrethq.com/">Secret HQ</a>, look forward to what we can expect in 2011.  You might want to pace yourself on this one as we&#8217;ve set a new record for show length.</p>
<p>What will the Microsoft Kinect be used for? Who&#8217;ll be releasing new software this year &#8211; will we see new versions of Media Composer, Final Cut Pro or Adobe Creative Suite? Will Avid open up to 3rd party hardware? What will be in those releases? Is this the year Metadata (finally) takes off?</p>
<p>How many movies have to not make money from 3D before the fad is over? Or will 3D TV spark 3D production? Will we see RED Epic this year and will it be a success? What will develop with large sensor cameras?</p>
<p>How will the collapse of State Governments affect production subsidies? Will runaway production come back to LA?</p>
<p>When will the tipping point come when distribution breaks out of broadcast and cable channel models? Is ivi going to be ruled legal? What&#8217;s the future of Netflix? Is a social network a replacement for channel guides?</p>
<p>What do we wish we could predit for this year? More use of metadata for production automation and where it comes from? What if we didn&#8217;t do a first string-out manually? This leads to a discussion of the philosophy of editing.</p>
<p>What will be this year&#8217;s surprise? Another DSLR? Another daVinci/Smoke on Mac?</p>
<p>What will happen in distribution? What&#8217;s the future of DVD Extras?</p>
<p>What isn&#8217;t going to happen that needs to happen?</p>
</div>
</blockquote>
]]></content:encoded>
			<wfw:commentRss>http://www.philiphodgetts.com/2011/01/whats-going-to-happen-in-2011/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

