![]() Even if such a small team was able to build a high quality ASR system in such a short time frame (highly unlikely), it’s not like it’s something you can just build and then expect to have it run smoothly. While it’s a nice dream, the world of software development is inherently messy. I’ll just pay them for 6 months to 1 year and then I’ll have my ASR system finished for use in perpetuity.” “Well,” you might be thinking, “that’s not so bad. Therefore, you’re looking at a yearly annual expenditure of at least $300k-600k for a lean, bare-bones development team. The salary range you’ll need to pay for these positions, at least if you want to attract decent engineers, will easily be in the range of $100k-200k per person and more if you want to pay for more experienced people. ![]() Usually the people you’ll need to hire to build an ASR system will include at least one of each of a Machine Learning Scientist/Researcher (PhD level), a software engineer for creating APIs and deployment, and a data engineer to help warehouse and manage all the text, audio, and other training data. You’ll need to hire multiple people to distribute the workload. You’ll typically have to hire more than one person because the chances of finding all of this domain expertise in a single individual is rare, not to mention that even if they have it, building an ASR is not typically a one-man job. The technology behind ASR is machine learning, which involves a huge amount of math, data, and software domain expertise. The first thing you’ll need to realize about building an ASR is that it’s not a simple task that you can offshore or hand off to inexperienced workers. However, we’ve found that for almost all businesses with typical usage, paying for an on-demand speech recognition service is a much higher value than building your own. It’s a tricky question, and the decision may skew very clearly one way or the other if you’re on either of the extreme ends of usage. buying on-demand access via a service such as Rev.ai. If your company operates in a domain that requires frequent speech to text transcriptions, you’re probably wondering whether there’s a long term payoff in building your own automated speech recognition system (ASR) vs. ![]()
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |