IF YOU REGISTER FOR A FREE TRIAL OF THE SERVICE, THE APPLICABLE PROVISIONS OF THIS AGREEMENT ALSO GOVERN YOUR USE OF THOSE SERVICES

Terms of Service

These Terms of Service ("Terms") are between you ("you" or "Customer") and Knowi, Inc. ("Knowi," "we," "us," or "our"). Please read them carefully. They form a contract between you and Knowi that governs your access to and use of the Knowi Services. You may use the Services only if you have the power to form a contract with Knowi and are not barred under any applicable laws from doing so. Your use of or registration for any of the Services will constitute your agreement to be bound by these Terms. If you do not agree to be bound by these Terms, you must not use the Services. If you are using the Services on behalf of an organization, unless that organization has a separate paid contract in effect with us, you are agreeing to these Terms for that organization, and representing to Knowi that you have the authority to bind that organization to these Terms (in which event, "you" and "you" will refer to the organization). If you are using the Services on behalf of an organization that has a separate paid contract in effect with us, the terms of that contract will govern your use of the Services.

These Terms may be modified from time to time. The date of the most recent revisions will appear on this page, so please check back often. Your use of or continued access to the Services after any changes constitutes your acceptance of those changes, whether or not you have reviewed them. If you do not agree to changes to the Terms, you must stop using the Services and cancel your user account.

For ease of reference, these Terms are broken into the following sections:

Definitions
Availability of the Services
Your Responsibilities relating to Use of the Services
Fees and Payment
Cancellation of Services
Confidentiality
Ownership
No Warranty
Indemnification
Limitation of Liability
Suspension and Termination of your Use of the Services
General Provisions

Definitions

"Account"

means an online account created by you or on your behalf within the Services.

"Administrator"

means a User you identify as having administrative rights including, without limitation, the permission to add licenses, cancel licenses and define the scope of the Services.

"Affiliate"

means, with respect to a party, any entity which directly or indirectly controls, is controlled by, or is under common control with such party (where "control" means ownership or control, directly or indirectly, of more than 50% of the voting interests of the subject entity).

"Content"

means data, text, audio, video, images or other content.

"Documentation"

means written or online user documentation that describe the functionality, operation, and use of the Services, and that Knowi provides or makes generally available to customers of the Services.

"Services"

refers, collectively, to the hosted storage solution we provide for online storage, sharing and processing of Content, the Software, the Website, and Documentation.

"Software"

means the software used, provided or made available by Knowi for use in connection with the Services. Software includes the Knowi Data Connector Software which is that portion of the Software that is installed on Customer's local server, desktop, mobile or other device (for example, mobile apps, desktop apps, and group apps) and enables a User to submit data to Knowi.

"User"

means an individual who accesses the Services and Software.

"Website"

means any websites owned or operated by Knowi, including those located at and www.knowi.com.
Availability of the Services
- Services.
  
  We will make the Services available for your use on a non-exclusive basis and in strict compliance with these Terms and all applicable laws. Your use includes allowing Users to transmit, store, share, retrieve, and process Content through the Services solely through an Account registered to you and in accordance with the orders you place with Knowi. In the event that your Users exceed the quantity or User type for which you paid, you agree to pay for your additional Users at Knowi's then-current pricing.
- Software Provided for Use with the Services.
  
  Subject to your continued compliance with these Terms, we grant you the nonexclusive, nontransferable, worldwide, personal license to install and use the Knowi Data Connector for the sole purpose of submitting data into Knowi Service.
- Support for the Services.
  
  Knowi will provide the level of support you select in your order from those we make available.
- Updates to the Services.
  
  We reserve the right, in our sole discretion, to change, update, and enhance the Services at any time including to add functionality or features to, or remove them from, the Services. We may also suspend the Services or stop providing the Services all together.
- Free Trials.
  
  If you register on our website or via a Service Order for a Free Trial, we will make the Service available to you under the Free Trial until the earlier of (a) the end of the Free Trial period for which you registered to use the Service, or (b) the start date of any Full Knowi Service subscription ordered by you for such Service, or (c) termination by us in our sole discretion. Additional Free Trial terms and conditions may appear on the Free Trial registration web page. Any such additional terms and conditions are incorporated into this Agreement by reference and are legally binding. We reserve the right, in our absolute discretion, to determine your eligibility for a Free Trial, and, subject to applicable laws, to withdraw or to modify a Free Trial at any time without prior notice and with no liability, to the greatest extent permitted under law. ANY DATA YOU ENTER INTO THE SERVICE, AND ANY CONFIGURATION CHANGES MADE TO THE SERVICE BY OR FOR YOU, DURING YOUR FREE TRIAL WILL BE PERMANENTLY LOST UNLESS YOU PURCHASE A SUBSCRIPTION TO THE SAME SERVICE AS THOSE COVERED BY THE FREE TRIAL OR EXPORT SUCH DATA, BEFORE THE END OF THE FREE TRIAL PERIOD. IF YOUR SUBSCRIPTION DOES NOT INCLUDE FEATURES AVAILABLE IN THE FREE TRIAL, YOU MUST EXPORT YOUR DATA BEFORE THE END OF THE TRIAL PERIOD OR YOUR DATA WILL BE PERMANENTLY LOST. Please review the applicable Documentation for the Service during the Free Trial period so that you become familiar with the functionality and features of the Service before you make your purchase.
Your Responsibilities relating to Use of the Services.
- Passwords and Account.
  
  To obtain access to certain Services, you will be required to obtain an Account with Knowi by completing a registration form and designating a user ID and password. Until you apply for and are approved for an Account, your access to the Services will be limited to those areas of the Services, if any, that Knowi makes available to the general public. You agree and represent that all registration information you provide is accurate, complete, and current, and that you will update it promptly when that information changes. Knowi may withdraw Account approval at any time in its sole discretion, with or without cause. You are responsible for safeguarding the confidentiality of your User ID and passwords, and for all activities that take place with your Account. Knowi will not be liable for any loss or damage arising from any unauthorized use of your Account.
- Notices from Knowi.
  
  You acknowledge that once you have registered with us, we may send you communications or data regarding the Services using electronic means. These may include, but are not limited to (i) notices about your use of the Services, including any notices concerning violations of use, (ii) updates to the Services, (iii) promotional information and materials regarding Knowi's products and services, and information the law requires us to provide. We give you the opportunity to opt-out of receiving certain of these communications from us by following the opt-out instructions provided in the message. However, even if you opt-out, you understand that we may continue to provide you with required information by e-mail at the address you specified when you signed up for the Services or via access to a website that we identify. Notices we e-mail to you will be deemed given and received when the e-mail is sent. If you don't agree to receive required notices via e-mail, you must stop using the Services. If you provide Knowi with legal notices, you must transmit it to us via email to legal@cloud9charts.com. Any such notice, in either case, must specifically reference that it is a notice given under these Terms.
- Notices from You regarding Unauthorized Use.
  
  You agree to notify us promptly in writing when you become aware of any unauthorized use of an Account, the Content or the Services, including if you suspect there has been any loss, theft or other security breach of your password or user ID. If there is an unauthorized use by a third party which obtained access to the Services through you or your Users, whether directly or indirectly, you agree to take all steps necessary to terminate the unauthorized use. You also agree to provide Knowi with any cooperation and assistance related to that unauthorized use which we reasonably request.
- Content.
  
  Knowi does not monitor any data transmitted or processed hrough, or stored in, the Services. You agree that you:
  - are responsible for the accuracy and quality of all Content that is transmitted or processed through, or stored in, your Account;
  - will ensure that the Content (including its storage and transmission) complies with these Terms, and applicable laws and regulations;
  - will promptly handle and resolve any notices and claims from a third party claiming that any Content violates that party's rights, including regarding take-down notices pursuant to the Digital Millennium Copyright Act;
  - will maintain appropriate security, protection and backup copies of the Content, which may include (A) the use of encryption technology to protect the Content from unauthorized access and (B) routine archiving of the Content. Knowi will have no liability of any kind as a result of any deletion, loss, correction, or destruction of Content or damage to or failure to store or encrypt any Content.
- Use Restrictions.
  
  You are responsible for Users' compliance with these Terms and for the quality, accuracy and legality of the Content. You will not, and will ensure that your Users do not
  - use the Services in any manner or for any purpose other than as expressly permitted by these Terms including, without limitation, allowing Power Users to use the logins of your Business Partner Users;
  - sell, rent, resell, lease, or sublicense the Services to any third party;
  - modify, tamper with or otherwise create derivative works of the Services;
  - reverse engineer, disassemble or decompile the Services, or attempt to derive source code from the Services;
  - remove, obscure or alter any proprietary right notice related to the Services;
  - use the Services to send unsolicited or unauthorized junk mail, spam, chain letters, pyramid schemes or any other form of duplicative or unsolicited messages;
  - store or transmit Content: (A) containing unlawful, defamatory, threatening, pornographic, abusive, or libelous material, (B) containing any material that encourages conduct that could constitute a criminal offense, or (C) that violates the intellectual property rights or rights to the publicity or privacy of others;
  - use the Services to store or transmit viruses, worms, time bombs, Trojan horses or other harmful or malicious code, files, scripts, agents or programs;
  - interfere with or disrupt servers or networks connected to the Services or the access by other Knowi client to the servers or networks, or violate the regulations, policies or procedures of those networks;
  - access or attempt to access Knowi's other accounts, computer systems or networks not covered by these Terms, through password mining or any other means; or
  - access or use the Services in a way intended to avoid incurring fees, exceeding usage limits and the like.
- Third Party Services and Content.
  
  All transactions using the Services are between the transacting parties only. The Services may contain features and functionalities linking or providing you with certain functionality and access to third party content, including Web sites, directories, servers, networks, systems, information and databases, applications, software, programs, products or services, and the Internet as a whole. You acknowledge that Knowi is not responsible for such content or services. We may also provide some content to you as part of the Services. However, Knowi is neither an agent of any transacting party nor a direct party in any such transaction. Any of those activities, and any terms associated with those activities, are solely between you and the applicable third-party. Similarly, we are not responsible for any third party content you access with the Services, and you irrevocably waive any claim against Knowi with respect to such sites and third-party content. Knowi has no liability, obligation or responsibility for any such correspondence, purchase or promotion between Customer and any such third-party. You are solely responsible for making whatever investigation you feel is necessary or appropriate before proceeding with any transaction with any of these third parties and your dealings with any third party related to the Services, whether online or offline, including the delivery of and payment for goods and services. In the event you have any problems resulting from your use of a third party service, or suffer data loss or other losses as a result of problems with any of your other service providers or any third-party services, we are not responsible unless the problem was the direct result of our breaches.
Fees and Payment
- Fees.
  
  You agree to pay, using a valid credit card (or other form of payment which we may accept from time to time), the charges and fees (such as recurring monthly or annual fees) set forth in Schedule A, Taxes (as defined below), and other charges and fees incurred in order to access the Services. You will pay Fees in the currency we quoted for your account (and we reserve the right to change the quoted currency at any time). We will automatically charge your credit card or other account at the start of the billing period and at the start of each renewal period. Except as specifically set forth in this section, all Services are prepaid for the period selected (monthly, annually or otherwise) and are non-refundable. This includes accounts that are renewed.
- Fees for Upgrade.
  
  If you upgrade or expand consumption of the Services , additional fees may be due at Knowi's then-current pricing. If additional fees are due, those fees will be immediately charged to your credit card or other account and will apply for the entire month in which the Services Upgrade occurred. If you have paid for an annual period, Services Upgrades will be coterminous with the affected Services period.
- Fee Increases.
  
  We will notify you in advance, either through a posting on this Website or by email to the address you have most recently provided to us, if we increase Fees or institute new charges or fees. Any increase in Fees will take effect at the beginning of the next renewal subscription term for the Services. For example, if you pay monthly, your use of the Services will be charged at the new price when Services are renewed in the month that follows the notice. If you don't agree to these changes, you must cancel and stop using the Services.
- Invoicing and Payment Terms.
  
  You agree to keep all information in your billing account current. You may change your payment method or modify your billing account information at any time by using the means provided on the Website. Your notice to us will not affect charges we submit to your billing account before we reasonably could act on your request. In the event that we invoice you, then all fees will be due and payable upon receipt. We reserve the right to charge, and you agree to pay, a late fee on past due amounts. The late fee will be equal to the lesser of 1.5% of the unpaid amount each month or the maximum amount allowed by applicable law. We may use a third party to collect past due amounts. You must pay for all reasonable costs we incur to collect any past due amounts, including reasonable attorneys' fees and other legal fees and costs. In addition, we may suspend your access to the Services, or cancel the Services, if your account is past due.
- Taxes.
  
  Fees are exclusive of Taxes and you will pay or reimburse Knowi for all Taxes arising out of these Terms, whether assessed at the time of your purchase or are thereafter determined to have been due. For purposes of these Terms, "Taxes" means any sales, use and other taxes (other than taxes on Knowi's income), export and import fees, customs duties and similar charges applicable to the transactions contemplated by these Terms that are imposed by any government or other authority. You agree to promptly provide Knowi with legally sufficient tax exemption certificates for each taxing jurisdiction for which you claim exemption.
Cancellation of Services

To cancel the Services, you must provide us with at least 30 days' notice and follow the process we specify on the Website. If you cancel, the Services will end at the end of your current Services period following the 30 days' notice. If you fail to cancel as required, we will automatically renew the Service for the same term and will charge your payment information on file with us commencing on the first day of the renewal term.
Confidentiality
- Description of Confidential Information.
  
  In connection with each party's rights and obligations under these Terms, each party (as the "disclosing party") may disclose to the other party (as the "recipient") certain of its confidential or proprietary information ("Confidential Information"). In the case of Knowi, the Services, these Terms and any other proprietary or confidential information we provide to you constitute Knowi Confidential Information. In the case of Customer, Content provided to Knowi by Customer constitutes Customer Confidential Information.
- Protection of Confidential Information.
  
  Each party as recipient agrees: (i) to exercise at least the same degree of care to safeguard Confidential Information of the disclosing party as the recipient exercises to safeguard the confidentiality of its own confidential information, but not less than reasonable care; (ii) to use the disclosing party's Confidential Information only in connection with exercising its rights and performing its obligations under these Terms; and (iii) to not disclose or disseminate the disclosing party's Confidential Information to any third party and that the only employees and contractors who will have access to the disclosing party's Confidential Information will be those with a need to know who have agreed to abide by the obligations set forth in this Section pursuant to a written confidentiality agreement.
- Protection of Content.
  
  We agree to maintain appropriate administrative, physical, and technical safeguards to protect the security, confidentiality, and integrity of the Content. The third party data center providers utilized by Knowi in the provision of the Services will maintain at a minimum SSAE 16 audit certification or its equivalent. Except as requested by you in connection with customer support, we will not (i) modify Content, (ii) disclose Content except pursuant to the requirements of a governmental agency, by operation of law, to investigate occurrences that may involve violations of system or network security, or as you expressly permit in writing, or (iii) access Content except to provide the Services or to address other service or technical problems.
- >Exceptions to Confidentiality.
  
  Information will not be deemed Confidential Information of either of us under these Terms if such information: (i) is or becomes rightfully known to the recipient without any obligation of confidentiality or breach of these Terms; (ii) becomes publicly known or otherwise ceases to be secret or confidential, except through a breach of these Terms by the recipient of such Confidential Information; or (iii) is independently developed by the recipient of such Confidential Information without breach of these Terms. Confidential Information will remain the property of the disclosing party.
Ownership
- Ownership by Customer.
  
  As between Customer and Knowi, Customer or its licensors own all right, title and interest in and to the Content. Customer hereby grants Knowi the right to transmit, use, modify, adapt, reproduce, display or disclose the Content solely (i) to provide the Services to Customer or any User, (ii) to comply with any request of a governmental or regulatory body (including subpoenas or court orders) or as otherwise required by law, (iii) for statistical use (provided that such data is not personally identifiable), and (iii) as necessary to monitor and improve the Services. Customer represents and warrants that Customer has all rights in the Content necessary to grant these rights and use the Services, and that the transmission, storage, retrieval, and processing of the Content do not violate any law or these Terms.
- Ownership by Knowi.
  
  As between Knowi and Customer, Knowi or its licensors own and reserve all right, title and interest in and to the Services and all hardware, software and other items used to provide the Services. No title to or ownership of any proprietary rights related to the Services is transferred to Customer or any User pursuant to these Terms or any transaction contemplated by these Terms. Knowi reserves all rights not explicitly granted to Customer. Knowi is free to use any comments, suggestions, recommendations, and other feedback you provide with respect to the Services for any purpose, without obligation.
No Warranty

KNOWI PROVIDES THE SERVICES "AS IS," "WITH ALL FAULTS," AND "AS AVAILABLE." TO THE MAXIMUM EXTENT PERMITTED BY APPLICABLE LAW, KNOWI MAKES NO REPRESENTATIONS OR WARRANTIES OF ANY KIND, WHETHER EXPRESS, IMPLIED, STATUTORY OR OTHERWISE. KNOWI SPECIFICALLY DISCLAIMS, WITHOUT LIMITATION, ANY WARRANTY THAT THE SERVICES WILL BE UNINTERRUPTED, ERROR-FREE OR FREE OF HARMFUL COMPONENTS, THAT THE CONTENT WILL BE SECURE OR NOT OTHERWISE LOST OR DAMAGED, OR ANY IMPLIED WARRANTY OF MERCHANTABILITY, SATISFACTORY QUALITY, FITNESS FOR A PARTICULAR PURPOSE, OR NON-INFRINGEMENT, AND ANY WARRANTY ARISING OUT OF ANY COURSE OF PERFORMANCE, COURSE OF DEALING OR USAGE OF TRADE. SOME JURISDICTIONS DO NOT ALLOW THE FOREGOING EXCLUSIONS. IN SUCH AN EVENT SUCH EXCLUSION WILL NOT APPLY SOLELY TO THE EXTENT PROHIBITED BY APPLICABLE LAW.
Indemnification

To the maximum extent permitted by applicable law, you agree to defend, indemnify, and hold harmless Knowi, its officers, directors, employees, and agents, against any cost, loss, damage, or other liability arising from any third party demand or claim that any Content provided by you, or your use of the Services, in breach of these Terms: (a) infringes a registered patent, registered trademark, or copyright of a third party, or misappropriates a trade secret (to the extent that such misappropriation is not the result of Knowi's actions) or (b) violates applicable law or these Terms. Knowi will provide you with notification of any such claim or demand that is subject to your indemnification obligation.
Limitation of Liability

TO THE FULLEST EXTENT PERMITTED BY APPLICABLE LAW, IN NO EVENT (a) WILL THE LIABILITY OF KNOWI, ITS AFFILIATES, OFFICERS, EMPLOYEES, OR AGENTS FOR ANY AND ALL CLAIMS RELATING TO THE SERVICES EXCEED THE GREATER OF $100.00 OR THE TOTAL AMOUNT OF FEES THAT YOU PAID US DURING THE PREVIOUS THREE MONTH PERIOD AND (b) WILL KNOWI, ITS AFFILIATES, OFFICERS, EMPLOYEES, OR AGENTS BE LIABLE FOR ANY INDIRECT, INCIDENTAL, SPECIAL, PUNITIVE, COVER OR CONSEQUENTIAL DAMAGES (INCLUDING, WITHOUT LIMITATION, DAMAGES FOR LOST PROFITS, REVENUE, GOODWILL, USE OR CONTENT) HOWEVER CAUSED, UNDER ANY THEORY OF LIABILITY, INCLUDING, WITHOUT LIMITATION, CONTRACT, TORT, WARRANTY, NEGLIGENCE OR OTHERWISE, EVEN IF KNOWI HAS BEEN ADVISED AS TO THE POSSIBILITY OF SUCH DAMAGES. SOME STATES DO NOT ALLOW THE LIMITATION OR EXCLUSION OF LIABILITY FOR INCIDENTAL OR CONSEQUENTIAL DAMAGES, SO THE ABOVE EXCLUSIONS OR LIMITATIONS MAY NOT APPLY TO YOU. YOU MAY ALSO HAVE OTHER RIGHTS THAT VARY FROM STATE TO STATE.
Suspension and Termination of your Use of the Services
- General.
  
  Knowi reserves the right to temporarily suspend or terminate your access to the Services at any time in Knowi's sole discretion, with or without cause, and with or without notice, without incurring liability of any kind. For example, we may suspend or terminate your access to or use of the Services for: (i) the actual or suspected violation of these Terms; (ii) the use of the Services in a manner that may cause Knowi to have legal liability or disrupt others' use of the Services; (iii) the suspicion or detection of any malicious code, virus or other harmful code in your Account; (iv) downtime, whether scheduled or recurring; (e) your use of excessive storage capacity or bandwidth; or (v) unplanned technical problems and outages. If, in our determination, the suspension might be indefinite or we have elected to terminate your access to the Services, we will use commercially reasonable efforts to notify you through the Services. You acknowledge that if your access to the Services is suspended or terminated, you may no longer have access to the Content that is stored with the Services.
- Termination for Lack of Activity.
  
  In addition to our other rights of termination, if your Account is not currently subject to a paid subscription plan with us, we may terminate your Account if: (i) you do not engage in any activity in the Account within 30 days after registering for the Services, or (ii) you do not engage in any activity in an Account for 120 consecutive days. In the event of such termination, any of your Content may be lost.
- Post-Termination Obligations.
  
  Upon termination of these Terms for any reason, all of your rights to use or access the Services will cease. You agree, within five days of such termination, to destroy all copies of the Software, the Documentation, and any Confidential Information of Knowi, including any Documentation in written or electronic form and any Software stored on your servers or other systems. In addition, if requested by Knowi, you will promptly provide to Knowi a written certification signed by an authorized representative certifying that all copies of the Software and any written or electronic documentation and Confidential Information of Knowi have been destroyed. For 30 days following the expiration of the Termination of these Terms or the applicable subscription term for which you have paid, and subject to your prior written request, we will grant you with limited access to the Services solely for purposes of your retrieval of the Content. After that 30-day period, Knowi has no further obligation to maintain the Content and will delete the Content unless legally prohibited.
- Survival.
  
  The terms of any sections that by their nature are intended to extend beyond termination will survive termination of these Terms for any reason.
General Provisions
- Governing Law.
  
  These Terms will be construed and enforced in all respects in accordance with the laws of the State of California, without reference to its choice of law rules. Any dispute between the parties will be brought in a court in Alameda County and each party irrevocably waives any claim that such court does not have personal jurisdiction over the party. All use of the Services is expressly governed by any applicable export and import laws, and Customer must comply with all such laws. Claims arising out or related to these terms must be filed within one year of the date on which the claim arose unless local law requires a longer time to file claims. If a claim is not filed accordingly, then it is permanently barred.
- Government Users.
  
  If you are a U.S. government entity, you acknowledge that any Software and Documentation are provided as "Commercial Items" as defined at 48 C.F.R. 2.101, and are being licensed to U.S. government end users as commercial computer software subject to the restricted rights described in 48 C.F.R. 2.101 and 12.212.
- Independent Contractors; Third Party Beneficiaries.
  
  You and we are independent contractors, and nothing in these Terms creates a partnership, employment relationship or agency. There are no third-party beneficiaries of these Terms. Knowi may subcontract portions of the Services provided that Knowi shall remain responsible for all such obligations under these Terms.
- Waiver.
  
  Our failure to enforce any of these Terms will not be considered a waiver of the right to enforce them. Our rights under these Terms will survive any termination.
- Assignment.
  
  You may not assign these Terms or your rights and obligations under them, in whole or in part, to any third party without our prior written consent, and any attempt by you to do so will be invalid.
- Severability.
  
  Should any part of these Terms be held invalid or unenforceable, that portion will be construed consistent with applicable law and the remaining portions will remain in full force and effect.
- Force Majeure.
  
  Neither party will be liable to the other for any delay or failure to perform its obligations under these Terms (excluding payment obligations) if the delay or failure arises from any cause or causes beyond that party's reasonable control.
- Public Announcement.
  
  Knowi reserves the right to release a press announcement regarding the parties' relationship, and to include Customer's name on Knowi's customer lists on Knowi's web site and in any other marketing materials.
- Entire Agreement and Changes.
  
  These Terms, including fees for Services on the Website, constitutes the entire agreement, and supersedes any and all prior agreements, between the parties with regard to the subject matter hereof. Knowi reserves the right to modify or replace these Terms at any time in its sole discretion. Knowi will indicate at the top of these Terms the date these Terms were last updated. Any changes will be effective upon posting the revised version of these Terms on the Services (or such later effective date as may be indicated at the top of the revised Terms). Customer's continued access or use of any portion of the Services constitutes Customer's acceptance of such changes. If Customer doesn't agree to any of the changes, Customer must cancel and stop using the Services.
- Privacy.
  
  In order to operate and provide the Services, Knowi collect certain information about Customer. As part of the Services, Knowi may also automatically upload information about Customer's computer or other device, Customer's use of the Services, and the Services performance. Knowi will use and protect that information as described in the privacy policy located on the Website ("Privacy Policy"). Customer further acknowledges and agrees that Knowi may access or disclose information about Customer, including the content of Customer communications, in order to: (i) comply with the law or respond to lawful requests or legal process; (ii) protect the rights or property of Knowi or Knowi's customers, including the enforcement of Knowi's agreements or policies governing Customer's use of the Services; or (iii) act on a good faith belief that such access or disclosure is necessary to protect the personal safety of Knowi employees, customers, or the public.
- DMCA.
  
  We respect the intellectual property of others, and reserve the right to delete or disable Content that appears to violate these terms or applicable law. The Digital Millennium Copyright Act of 1998 (the "DMCA") provides recourse for copyright owners who believe that material appearing on the Internet infringes their rights under U.S. copyright law. If you believe in good faith that Content infringes your copyright, you (or your agent) may send us a notice requesting that the Content be removed or access to it blocked. Federal law requires that your notification include the following information: (i) a physical or electronic signature of a person authorized to act on behalf of the owner of an exclusive right that is allegedly infringed; (ii) identification of the copyrighted work claimed to have been infringed or, if multiple copyrighted works at a single online site are covered by a single notification, a representative list of such works at that site; (iii) identification of the material that is claimed to be infringing or to be the subject of infringing activity and that is to be removed or access to which is to be disabled and information reasonably sufficient to permit us to locate the material; (iv) information reasonably sufficient to permit us to contact you, such as an address, telephone number, and, if available, an electronic mail; (v) a statement that you have a good faith belief that use of the material in the manner complained of is not authorized by the copyright owner, its agent, or the law; and (vi) a statement that the information in the notification is accurate, and under penalty of perjury, that you are authorized to act on behalf of the owner of an exclusive right that is allegedly infringed.

The notification must be sent to legal@cloud9charts.com

We provide the above contact information for purposes of the DMCA only and reserve the right to respond only to correspondence that is relevant to this purpose.

What information do we collect?

We collect information from you when you register on our site, products and services, place an order, subscribe to our newsletter or fill out a form. When ordering or registering on our site, as appropriate, you may be asked to enter your name, e-mail address or credit card information. You may, however, visit our site anonymously.

What do we use your information for?

Any of the information we collect from you may be used to personalize your experience, to improve our service offerings or to improve customer service, or to process transactions or to send periodic emails if indicated as such or transactional emails pertaining your order.

Your information, whether public or private, will not be sold, exchanged, transferred, or given to any other company for any reason whatsoever, without your consent, other than for the express purpose of delivering the purchased product or service requested.

How do we protect your information?

We implement a variety of security measures to maintain the safety of your personal information when you place an order or enter, submit, or access your personal information. We offer the use of a secure server. All supplied data and sensitive/credit information is transmitted via Secure Socket Layer (SSL) technology and then encrypted into our Payment gateway providers database only to be accessible by those authorized with special access rights to such systems, and are required to keep the information confidential.

After a transaction, your credit card information will not be stored on our servers.

Do we use cookies?

Yes (Cookies are small files that a site or its service provider transfers to your computers hard drive through your Web browser (if you allow) that enables the sites or service providers systems to recognize your browser and capture and remember certain information

We use cookies to understand and save your preferences for future visits and compile aggregate data about site traffic and site interaction so that we can offer better site experiences and tools in the future.

Do we disclose any information to outside parties?

We do not sell, trade, or otherwise transfer to outside parties your personally identifiable information. This does not include trusted third parties who assist us in operating our website, conducting our business, or servicing you, so long as those parties agree to keep this information confidential. We may also release your information when we believe release is appropriate to comply with the law, enforce our site policies, or protect ours or others rights, property, or safety. However, non-personally identifiable visitor information may be provided to other parties for marketing, advertising, or other uses.

California Online Privacy Protection Act Compliance

Because we value your privacy we have taken the necessary precautions to be in compliance with the California Online Privacy Protection Act. We therefore will not distribute your personal information to outside parties without your consent.

Childrens Online Privacy Protection Act Compliance

We are in compliance with the requirements of COPPA (Childrens Online Privacy Protection Act), we do not collect any information from anyone under 13 years of age. Our website, products and services are all directed to people who are at least 13 years old or older.

Online Privacy Policy Only

This online privacy policy applies only to information collected through our website, products and services and not to information collected offline.

Terms and Conditions

Please also visit our Terms and Conditions section establishing the use, disclaimers, and limitations of liability governing the use of our website, products and services.

Your Consent

By using our site, products and services you consent to our privacy policy.

Changes to our Privacy Policy

If we decide to change our privacy policy, we will post those changes on this page.

Contacting Us

If there are any questions regarding this privacy policy you may contact us by emailing us at support@knowi.com

Machine Learning

Buddha once said, "To reach Enlightenment, you must turn data into insight and insight into action". Ok, he didn't say that, but Knowi can help you blend hindsight with foresight and drive actions from your data.

Overview

Currently, Knowi supports Classification, Regression and Time-Series Anomaly Detection type Machine Learning use cases, with clustering and deep learning coming soon. We also have a data preparation wizard that will guide you through the steps necessary to clean your data prior to any supervised modeling activities.

Anomaly detection is often used to identify unusual patterns that do not conform to expected behavior (called outliers). There ares many applications in business, from intrusion detection to system health monitoring and from fraud detection in credit card transactions to fault detection in operating environments.

For supervised learning, algorithms are selected based on the type of prediction response:

if your response is continuous numbers, then you will be using regression algorithms.
if your response is categories or classes, then you will be using classification algorithms.

For example, if you are building a model to predict the $ amount by which a person is likely to default on a credit card payment, then it's regression. However, if your you just want to know if they are likely to default or not then it's classification.

To start the Machine Learning process, simply select the Machine Learning icon, create your workspace and let Knowi guide you through the steps required to create your Machine Learning models!

Trigger Notification and Actions

Triggers and actions can be applied to the results. For example, you can send an alert or a webhook into your application for the users with a high risk of default for the use case above. The process for setting up triggers and alerts on a query with machine learning remains the same as a normal dataset/query. For more details, see Alerts.

Workspaces

Creating Workspaces

The very first thing required when starting a Machine Learning project in Knowi is to create a workspace. A workspace can be thought of as a folder that will contain all your subsequent machine learning models for the particular use case in question.

Workspace

Once the workspace is created and the required type of modeling determined, the user is then required to either select or upload their training dataset. This dataset will include historical data relating to the predictor variable they wish to predict. The example flow below is for supervised learning (classification and regression).

The user is then able to perform Cloud9QL upon the training dataset, select the variable they wish to predict and also analyze their data to not only see the columns present in their training dataset, but they can also view statistical information about the data in each column by clicking on the icon in each column header.

select

Once the data is uploaded and the attribute to be predicted has been selected, the user then selects Prepare Data. This will then guide the user step by step through some tasks designed to help clean the data items ready for the machine learning algorithms to use.

Editing Workspaces

When entering the Machine Learning module, the user will automatically be taken to a list of their current workspaces and published models. To edit a previously created workspace, simply click on the edit icon next to the workspace name.

edit

Data Preparation

Once your data is loaded and your predictor attribute selected, the next step is to ensure that your data is ready for the machine learning algorithms to successfully run against.

Knowi will lead you through a series of data preparation steps (some are mandatory and some are optional) prior to running the algorithms of your choice.

Note that the results of each step are saved. If a user leaves the data preparation area and returns later the system will direct them to the next step in the process automatically. The user also always has access to view their data by clicking in the top right hand corner of the box.

Data Types

Firstly, we need to ensure that all data types are correct. The user has the option to modify the data types, if necessary. The user simply selects the correct data type per column and then selects 'Next Step'

Data_types

Outliers

The next step is the identification of potential outliers in your data. Knowi will highlight these values and allow the user to either remove all of them, remove selected values or skip the step completely.

Note that the user also always has the ability to go back to the Cloud9QL processing area and inspect their data again.

outliers

Missing Values

It is important that the training dataset does not have any missing values (null values). Rows containing missing values will either need to be removed or imputed (calculated) using the mean of the associated column. The user has the option to remove or impute values. This step is mandatory.

The system will allow the user to:
1. enter a % above which all rows with this percentage of missing values will be removed from the dataset (eg, remove all rows where there are >25% missing numerical values)
2.enter a % above which all columns with this percentage of missing numerical values will be removed from the dataset (eg, remove all columns where there are >9% missing values)
3.impute the remaining missing values on a column by column basis

missing

Rescaling

If your numerical attributes are comprised of different scales (for example, weight, height, age, etc.) then you have the option of rescaling this data. This is not required, but may boost performance. Try creating different models for your non-rescaled, standardized and normalized data and see which ones achieve higher accuracy.

Two methods of rescaling are offered; Normalization (when you do not know the distribution of your data or the distribution is not Gaussian; this will set all values across the board to be between 0 and 1) and Standardization (if your data is Gaussian; this will transform the data to have a mean of 0 and a standard deviation of 1).

Simply select the data items to rescale, the method and chose 'Next Step'.

This step may be skipped entirely.

rescaling

Discrete Grouping

Some algorithms, such as Decision Trees, work better with discrete data. This means taking numerical data and converting it into logical, ordered groups or bins of data (ordinal attributes). It is most useful if you believe there are natural groupings within your column data or if your numerical data has a large range of values (for example, -infinity > 7,000,000,000).

This step is optional and can be skipped.

discrete

Dummy Variables

Some algorithms only work with numerical data and do not support nominal or ordinal data. It will therefore be necessary to convert these values into real values. Each category will be transformed into a column (or attribute) and 0 or 1 will be inserted as the value. This is called widening your dataset.

For example, a column called Gender typically has permissible string entries for 'Male', 'Female' and 'Not Specified'. If the value in a particular case is 'Male', then this would become three columns (one for each category), Gender:Male (with a value of 1), Gender:Female (with a value of 0), Gender:Not Specified (with a value of 0)

Existing column below would become three columns:

existing column	value	new column	value
Gender	Male	Male	1
		Female	0
		Not Specified	0

dummy

This concludes the data preparation activity. Any decisions made along the way have been saved and a user can jump back to any previous step and make changes, if they wish.

The next step in our machine learning journey is to now select the model features that will help predict the outcome.

Feature Selection

Once all data has been prepared, the user is now asked to select the features (data attributes) to feed into the model creation.

features

Feature selection is a crucial part of machine learning and a user will typically create many different models using many different combinations of features before finding the best fit.

The user has two options at this point, to either manually select their features or to let Knowi auto-select features for them based upon correlation and information gain algorithms that we run against the dataset.

It is highly recommended to run your model several times with different features selected.

select_features

Once the features have been selected, the user then selects the algorithms to run and train their models.

Model Creation

After selecting the features, the user is then able to select the algorithms they wish to use to train their model(s).

The user can select one or more algorithms and can also repeat using different features and settings each time.

The algorithms displayed depend on whether the user specified Classification or Regression as the workspace type at workspace creation time.

start_train

Clicking on the settings cog will allow the user to enter algorithm specific parameters.

Once all required algorithms and their settings have been entered, the user then selects 'Train'.

The models and their corresponding results will then appear in the Results section.

results

Each model result has 3 icons associated with it. These allow you to inspect the results of the model and also publish the chosen model. Published models can then be used against a live Knowi queries to predict against incoming data.

	view the data results of the trained model and see the predicted output against the original predictor input
	view the statistical results of each model
	publish the chosen model and make it available for use against incoming data

publish

To use the model against a live Knowi query, the user selects the 'Use Model' option corresponding to the model they wish to use. The system will then take them to the query list page where they can select the appropriate query and associate the model to be used at query run time.

use

Classification Machine Learning

In the terminology of machine learning, classification is considered an instance of supervised learning, i.e. learning where a training set of correctly identified observations is available.

An algorithm that implements classification, especially in a concrete implementation, is known as a classifier.

Knowi currently support 4 different classifiers:

Decision Tree
Logistic Regression
KNN
Naive Bayes

As an example, to predict whether a client will default on their next payment period based on their prior payment behavior:

Download and use the data from UCI Machine Learning Repository. This dataset contains 30,000 client Credit Card data with 24 attributes including:
Personal characteristics such as age, education, gender, and marital status
Credit line limit information
Billing/payment history for the 6 months period from April to September of 2005
Navigate over to our Workspaces page and you will be lead through all the necessary steps to create your model.

Time-series Anomaly Detection

Time-series anomaly detection is a feature used to identify unusual patterns that do not conform to expected behavior, called outliers. There are many applications in business, from intrusion detection (identifying strange patterns in network traffic that could signal a hack) to system health monitoring (spotting a malignant tumor in an MRI scan), and from fraud detection in credit card transactions to fault detection in operating environments.

Upon creation of your Anomaly Detection Workspace, the user will be presented with a number of configuration steps.

Select Dataset - the user is able to select an existing time- series dataset or upload a new dataset to analyze (please note that anomaly detection algorithms work only with time series data at this time)
Cloud9QL data manipulation (optional) - this allows the user to post process the data by applying Cloud9QL transformation
Select the Date/Time Dimension - this is the time series feature of the selected dataset that is going to be on the X chart axis
Select the Numeric attribute - this is the numerical feature of the selected dataset that you'd like to monitor. This will be the Y chart axis
Choose your Algorithm - here the user will select one of the many anomaly forecasting algorithms available (see below)

Anomaly forecasting algorithms

Olympic Model (Seasonal Naive) The naive seasonal model where the prediction for next point is a smoothed average over the previous n periods.
Double and Triple Exponential Smoothing Models Both are popular models used to produce smoothed time- series. The exponential smoothing variant add trend and seasonality into the model. The ETS model used automatically picks the best 'fit' exponential smoothing model.
Moving Average Model Here, the forecast is based on an artificially constructed time series in which the value for a given time period is replaced by the mean of that value and the values for some number of the preceding and succeeding time periods.
Weighted Moving Average and Naive Forecasting Models The forecast for both of these models is based on an artificially constructed time series in which the value for a given time period is replaced by the mean of that value and the values for some number of the preceding and succeeding time periods. The Weighted Moving Average is a special case of the moving average model.
Regression Model Models the relationship between x & y using one or more variable.
ARIMA Model Uses the Autoregressive Integrated Moving Average method.

As soon as the above steps have been completed and the Run Analysis option selected an anomaly detection model is trained and applied to the data. The precision of the model increases over time as more data is made available.

The anomaly detection visualization itself consists of a configurable blue band range of expected values (acceptable threshold limit) along with the actual metric data points. Any values outside of the blue band range are considered anomalies and will appear in red.

anomaly-results

Configuring the Anomaly Detection Algorithm

The width of the blue band of the expected values can be configured by setting the threshold attribute explicitly on the settings modal dialog. This Anomaly detection threshold is the mean absolute percentage deviation from the expected value. The default threshold value set is 50% but this can be modified.

anomaly-settings

Saving the Anomaly detection visualization

As an option you can save the anomaly detection visualization results as widget that can then be shared on one or more dashboards. To do this, simply select teh Save Widget option and enter a widget name. The widget will now appear in the general widget list for subsequent use outside of the Machine Learning module.

However all anomaly related information available within the widget settings bar will not be readily available for user edit. All anomaly detection settings have to be changed via the anomaly workspace directly.

Setting an Anomaly Detection alert

One crucial feature around the anomaly detection is the ability to configure alerts that provide automatic notification when new anomalies are detected.

Channels such as email, webhook and slack can be easily set up by selecting the alerts button from the control list.

By default the look back interval is set to equals to the alert frequency, so any anomaly will be communicated within that interval only. As soon as at least 1 anomaly is detected the system will trigger the alert.

There are several fixed email placeholders that may be used in the email template to add additional information:

%DATASET_NAME% - represents the dataset name selected
%ANOMALY_SIZE% - represents the number of anomalies within the look back interval
%FREQUENCY% - represents the frequency of the alert chosen
%ANOMALY_RESULTS% - represents the detailed information about the anomalies, including expected range and actual metric value

Adding additional analyses

The workspace can contain one or more anomaly detection models. To add another into the workspace, simply choose the Add Analysis button.

add_analysis

Regression Machine Learning

In regression problems, we are trying to predict continuous values as the output. This differs from classification, where the output is a category or class. There are a number of different types of regression problems we support using the following algorithms:

Linear Regression (OLS)
Radial Base Functions
Regression Trees (e.g. Random Forest)
Support Vector Regression (SVR)

As an example, we will build a predictive model to predict house price (price is a number from some defined range, so it will be regression task). We will be using linear regression to predict sales price based on multiple attributes.

You can download the house price dataset here.

Let's suppose you want to sell your house and you are wondering what you can get for it. You usually look for other homes similar to yours, in the same area and close to the same age as yours. We will do something similar, but with Linear Regression Machine Learning.

Attribute Information:

CRIM per capita crime rate by town
ZN proportion of residential land zoned for lots over 25,000 sq.ft.
INDUS proportion of non-retail business acres per town
CHAS Charles River dummy variable (= 1 if tract bounds river; 0 otherwise)
NOX nitric oxides concentration (parts per 10 million)
RM average number of rooms per dwelling
AGE proportion of owner-occupied units built prior to 1940
DIS weighted distances to five Boston employment centers
RAD index of accessibility to radial highways
TAX full-value property-tax rate per $10,000
PTRATIO pupil-teacher ratio by town
B 1000(Bk - 0.63)^2 where Bk is the proportion of blacks by town
LSTAT % lower status of the population
PRICE True value of owner-occupied homes in $1000's

We will be training our model using PRICE.

Now, navigate over to our Workspaces page and you will be lead through all the necessary steps to create your model.

Algorithms

Radial Base Function Network

A radial basis function network is an artificial neural network that uses radial basis functions as activation functions. It is a linear combination of radial basis functions. They are used in function approximation, time series prediction, and control.

A radial basis function (RBF) is a real-valued function whose value depends only on the distance from the origin, so that ∅(x)=∅(||x||); or alternatively on the distance from some other point c, called a center, so that ∅(x,c)=∅(||x-c||). Any function ∅ that satisfies the property is a radial function. The norm is usually Euclidean distance, although other distance functions are also possible. For example by using probability metric it is for some radial functions possible to avoid problems with ill conditioning of the matrix solved to determine coefficients wi (see below), since the ||x|| is always greater than zero.

Ordinary Least Squares (OLS)

In linear regression, the model specification is that the dependent variable is a linear combination of the parameters. The residual is the difference between the value of the dependent variable predicted by the model, and the true value of the dependent variable. Ordinary least squares obtains parameter estimates that minimize the sum of squared residuals, SSE (also denoted RSS).

K-Nearest Neighbor

The k-nearest neighbor algorithm (k-NN) is a method for classifying objects by a majority vote of its neighbors, with the object being assigned to the class most common amongst its k nearest neighbors (k is typically small). k-NN is a type of instance-based learning, or lazy learning where the function is only approximated locally and all computation is deferred until classification.

The simplest k-NN method takes a data set of feature vectors and labels with Euclidean distance as the similarity measure.

The best choice of k depends upon the data; generally, larger values of k reduce the effect of noise on the classification, but make boundaries between classes less distinct. A good k can be selected by various heuristic techniques, e.g. cross-validation. In binary problems, **it is helpful to choose k to be an odd number as this avoids tied votes**.

The nearest neighbor algorithm has some strong consistency results. As the amount of data approaches infinity, the algorithm is guaranteed to yield an error rate no worse than twice the Bayes error rate (the minimum achievable error rate given the distribution of the data). k-NN is guaranteed to approach the Bayes error rate, for some value of k (where k increases as a function of the number of data points).

The user can also provide a customized distance function.

Often, the classification accuracy of k-NN can be improved significantly if the distance metric is learned with specialized algorithms such as Large Margin Nearest Neighbor or Neighborhood Components Analysis.

Alternatively, the user may provide a k-nearest neighbor search data structure. Besides the simple linear search, KD-Tree, Cover Tree, and LSH (Locality-Sensitive Hashing) for efficient k-nearest neighbor search are also available.

A KD-tree (short for k-dimensional tree) is a space-partitioning dataset structure for organizing points in a k-dimensional space. Cover tree is a data structure for generic nearest neighbor search (with a metric), which is especially efficient in spaces with small intrinsic dimension. The cover tree has a theoretical bound that is based on the dataset's doubling constant. LSH is an efficient algorithm for approximate nearest neighbor search in high dimensional spaces by performing probabilistic dimension reduction of data.

Nearest neighbor rules in effect compute the decision boundary in an implicit manner. In general, the larger k, the smoother the boundary.

Naive Bayes

The Naive Bayes Classifier technique is based on the so-called Bayesian theorem and is particularly suited when the dimensionality of the inputs is high. Despite its simplicity, Naive Bayes can often outperform more sophisticated classification methods.

NaiveBayesIntro

To demonstrate the concept of Na�ve Bayes Classification, consider the example displayed in the illustration above. As indicated, the objects can be classified as either GREEN or RED. Our task is to classify new cases as they arrive, i.e., decide to which class label they belong, based on the currently exiting objects.

Since there are twice as many GREEN objects as RED, it is reasonable to believe that a new case (which hasn't been observed yet) is twice as likely to have membership GREEN rather than RED. In the Bayesian analysis, this belief is known as the prior probability. Prior probabilities are based on previous experience, in this case the percentage of GREEN and RED objects, and often used to predict outcomes before they actually happen.

The users can change the following settings:

Generation Model	Multinomial or Bernoulli. Th multinomial model generates one term in each position of the document. The multivariate Bernoulli model or Bernoulli model generates an indicator for each term , either indicating presence of the term in the document or indicating absence.
Add k-smoothing	By default, we use add-one or Laplace smoothing, which simply adds one to each count to eliminate zeros.

Support Vector Regression

Support vector machines can be used as a regression method, maintaining all the main features of the algorithm. In the case of regression, a margin of tolerance ∈ is set in approximation. The goal of SVR is to find a function that has at most ∈ deviation from the response variable for all the training data, and at the same time is as flat as possible. In other words, we do not care about errors as long as they are less than ∈, but will not accept any deviation larger than this.

Regression Tree

Classification and Regression Tree techniques have a number of advantages over many of those alternative techniques.

Simple to understand and interpret.
In most cases, the interpretation of results summarized in a tree is very simple. This simplicity is useful not only for purposes of rapid classification of new observations, but can also often yield a much simpler "model" for explaining why observations are classified or predicted in a particular manner.
Able to handle both numerical and categorical data.
Other techniques are usually specialized in analyzing datasets that have only one type of variable.
Tree methods are nonparametric and nonlinear.
The final results of using tree methods for classification or regression can be summarized in a series of (usually few) logical if-then conditions (tree nodes). Therefore, there is no implicit assumption that the underlying relationships between the predictor variables and the dependent variable are linear, follow some specific non-linear link function, or that they are even monotonic in nature. Thus, tree methods are particularly well suited for data mining tasks, where there is often little a priori knowledge nor any coherent set of theories or predictions regarding which variables are related and how. In those types of data analytics, tree methods can often reveal simple relationships between just a few variables that could have easily gone unnoticed using other analytic techniques.

One major problem with classification and regression trees is their high variance. Often a small change in the data can result in a very different series of splits, making interpretation somewhat precarious. Besides, decision-tree learners can create over-complex trees that cause over- fitting. Mechanisms such as pruning are necessary to avoid this problem. Another limitation of trees is the lack of smoothness of the prediction surface.

Logistic Regression

Logistic regression (logit model) is a generalized linear model used for binomial regression. Logistic regression applies maximum likelihood estimation after transforming the dependent into a logit variable. A logit is the natural log of the odds of the dependent equaling a certain value or not (usually 1 in binary logistic models, the highest value in multinomial models). In this way, logistic regression estimates the odds of a certain event (value) occurring.

logit

Logistic regression has many analogies to ordinary least squares (OLS) regression. Unlike OLS regression, however, logistic regression does not assume linearity of relationship between the raw values of the independent variables and the dependent, does not require normally distributed variables, does not assume homoscedasticity, and in general has less stringent requirements.

Compared with linear discriminant analysis, logistic regression has several advantages:

It is more robust: the independent variables don't have to be normally distributed, or have equal variance in each group
It does not assume a linear relationship between the independent variables and dependent variable.
It may handle nonlinear effects since one can add explicit interaction and power terms. However, it requires much more data to achieve stable, meaningful results.

Logistic regression also has strong connections with neural network and maximum entropy modeling. For example, binary logistic regression is equivalent to a one-layer, single-output neural network with a logistic activation function trained under log loss. Similarly, multinomial logistic regression is equivalent to a one-layer, softmax- output neural network.

Logistic regression estimation also obeys the maximum entropy principle, and thus logistic regression is sometimes called "maximum entropy modeling", and the resulting classifier the "maximum entropy classifier".

Decision Tree

A decision tree can be learned by splitting the training set into subsets based on an attribute value test. This process is repeated on each derived subset in a recursive manner called recursive partitioning. The recursion is completed when the subset at a node all has the same value of the target variable, or when splitting no longer adds value to the predictions.

The settings cog allows the user to enter options for the following:

Maximum number of leaf nodes
Minimum number of leaf nodes
Splitting Rule
- Gini impurity: a measure of how often a randomly chosen element from the set would be incorrectly labeled if it were randomly labeled according to the distribution of labels in the subset
- Entropy: Information gain is based on the concept of entropy used in information theory. For categorical variables with different number of levels, however, information gain are biased in favor of those attributes with more levels. Instead, one may employ the information gain ratio, which solves the drawback of information gain

Decision tree techniques have a number of advantages over many alternative techniques.

Simple to understand and interpret:
In most cases, the interpretation of results summarized in a tree is very simple. This simplicity is useful not only for purposes of rapid classification of new observations, but can also often yield a much simpler "model" for explaining why observations are classified or predicted in a particular manner.

Able to handle both numerical and categorical data:
Other techniques are usually specialized in analyzing datasets that have only one type of variable.

Nonparametric and nonlinear:
The final results of using tree methods for classification or regression can be summarized in a series of (usually few) logical if-then conditions (tree nodes). Therefore, there is no implicit assumption that the underlying relationships between the predictor variables and the dependent variable are linear, follow some specific non-linear link function, or that they are even monotonic in nature. Thus, tree methods are particularly well suited for data mining tasks, where there is often little a priori knowledge nor any coherent set of theories or predictions regarding which variables are related and how. In those types of data analytics, tree methods can often reveal simple relationships between just a few variables that could have easily gone unnoticed using other analytic techniques.

The Six Steps of Creating a Machine Learning Model

The UCI Machine Learning Repository contains many full data sets that can be used to test and train machine learning models. One such example is the Breast Cancer Wisconsin (Diagnostic) Data Set which relates whether breast cancer is benign or malignant to 10 specific aspects of the tumor. Based on this dataset, we can develop a model that will be able to determine the likelihood of breast cancer being benign or malignant.

The process of using machine learning to analyze data is made easy with Knowi Adaptive Intelligence. Given a training dataset, Knowi can apply either classification or regression algorithms to build valuable insights from the data.

Here is a step-by-step guide about how to turn that data into a powerful machine learning model using Knowi:

Create the Workspace and Upload Data

To start the machine learning process, go to www.knowi.com. If you are not already a Knowi user, sign up for a free trial to complete this tutorial. Once in, go into the machine learning section that can be found on the left-hand side of the screen. From there, start a new workspace and you will be given a choice of either making a classification or regression model. For the case of the breast cancer example, the workspace will be classification due to the nature of the data where the variable that we are predicting will always fall into either of two categories. Next, upload the Breast Cancer Wisconsin (Diagnostic) Data Set.

Step 1

Choose Response Variable and View Full Dataset

After uploading, and possibly manipulating the file, chose the Attribute to Predict from the drop-down list. In the case of the breast cancer data, the attribute that is being predicted is the class of the tumor. Following the choice of the prediction variable, the initial analysis takes place by using the Analyze Data button. This displays the data on the screen and allows an opportunity to scroll through the data looking for patterns.

Step 2 Step 2b

Prepare the Data
- After analyzing, data preparation begins. Data preparation is an optional, wizard driven process that involves going through a step-by-step process where the program confirms the training set datatypes, identifies and allows for the removal of outliers, reports missing data with the option to remove or impute values, allows for rescaling of the data, groups into discrete bins and, finally, provides the option to create dummy variables. All decisions can be changed by moving backwards and forwards through the steps at any time.
- For the Breast Cancer data, a small amount of rescaling and grouping were necessary to increase accuracy.

Step 3

Feature Selection

Whether you came in with prepared data, or just finished the process, the next step is to select which variables to be used in the model. To make this decision it is essential to check back at the data, looking for patterns and correlations.

Step 4

Create and Compare the Models

At this point you are left with choosing between the available algorithms (i.e. Decision Tree, Logistic Regression, K-Nearest Neighbor, or Naive Bayes). Knowi makes it easy to choose all available and compare them with useful attributes such as accuracy or the absolute deviation. Pressing the little eye next to the model created in the results section will show a preview of the input data along with the predictions of the program. Next to the eye there is a plus sign that, when pressed, will display the details of that specific model. It is beneficial to produce many models and tweak settings each time to find the best one for the situation. All past models are saved in the history and can be viewed, compared, and even published.

Step 5

Publish

The last step is publication. This step involves the button next to the plus sign. Upon publishing, a prompt to name the model will be displayed. It is possible to publish as many models as needed from the same data. All models that are created can be viewed and compared directly in the 'Published Models' tab within Machine Learning.

Step 6

How to Apply a Model to a Query

Now you have officially created a machine learning model that can seamlessly be applied to any query. To integrate it into a dataset simply press 'Apply Model' while performing a query and this will add a field where all the machine learning models will be available to be selected and used. Pressing the preview button on the screen will show the data along with the predictions made by the model.

Apply Model

Apply Model2

Actions from Insight Made Easy

With those six steps you have a machine learning model that can be integrated into any workflow and create new visualizations and insights that will drive downstream actions. The applications of the machine learning model are endless and can be tailored to the individual need. Once a model is made, and put in place, there are many actions that can be performed to gain meaning and spark reactions. This is done through trigger notifications. A trigger notification is a notification that will act in the case that a certain condition is met. In the scope of the breast cancer machine learning model, an alert can be set to email a doctor the patient's information in the situation that the model found a tumor to be malignant. This enables more than just insights, it generates action.

Summary:

The process of creating a model within Knowi is so easy that anyone can do it, and it starts with simply uploading a dataset. Data can be uploaded from a file, SQL and NoSQL sources, along with REST-APIs. Following the uploading of a file, Knowi has built-in algorithms available, or the option to create your own, along with a designated page to review multiple factors and evaluate the best algorithm for your situation. Using this method, the Breast Cancer training data was loaded from the UCI Machine Learning Repository into a Knowi workspace, then analyzed with the built-in data prepping tools. The resulting model was ready to be integrated into any workflow and autonomously perform actions based on the results, such as sending an alert to a doctor depending on the outcome of the test. Give Knowi a try and see how easy visualizing and learning from your data can be.

References

Dheeru, D., & Karra Taniskidou, E. (2017). UCI Machine Learning Repository. Retrieved from University of California, Irvine, School of Information and Computer Sciences: http://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+%28Diagnostic%29

Knowi. (2017). Adaptive Intelligence for Modern Data. Retrieved from Knowi Web site: www.knowi.com/